2025-05-07T19:42:32.6820400Z Current runner version: '2.323.0' 2025-05-07T19:42:32.6825986Z Runner name: 'i-032cc121644da911c' 2025-05-07T19:42:32.6826928Z Machine name: 'ip-10-0-4-57' 2025-05-07T19:42:32.6829518Z ##[group]GITHUB_TOKEN Permissions 2025-05-07T19:42:32.6831505Z Contents: read 2025-05-07T19:42:32.6832056Z Metadata: read 2025-05-07T19:42:32.6832651Z Packages: read 2025-05-07T19:42:32.6833401Z ##[endgroup] 2025-05-07T19:42:32.6835924Z Secret source: None 2025-05-07T19:42:32.6836939Z Prepare workflow directory 2025-05-07T19:42:33.2955343Z Prepare all required actions 2025-05-07T19:42:33.2998151Z Getting action download info 2025-05-07T19:42:34.0497950Z Download action repository 'actions/checkout@v4' (SHA:11bd71901bbe5b1630ceea73d27597364c9af683) 2025-05-07T19:42:34.3264510Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-05-07T19:42:34.8263519Z Complete job name: build_artifact (x86, linux.24xlarge, default, 3.13, 11.8.0, clang) 2025-05-07T19:42:34.9244916Z A job started hook has been configured by the self-hosted runner administrator 2025-05-07T19:42:34.9359171Z ##[group]Run '/home/ec2-user/runner-scripts/before_job.sh' 2025-05-07T19:42:34.9369473Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:34.9370338Z ##[endgroup] 2025-05-07T19:42:36.0555663Z Runner Type: linux.24xlarge 2025-05-07T19:42:36.0556159Z Instance Type: c5.24xlarge 2025-05-07T19:42:36.0556462Z AMI Name: unknown 2025-05-07T19:42:36.0581551Z AMI ID: ami-071226ecf16aa7d96 2025-05-07T19:42:41.1263230Z ##[group]Checking docker version 2025-05-07T19:42:41.1278673Z ##[command]/usr/bin/docker version --format '{{.Server.APIVersion}}' 2025-05-07T19:42:41.1495297Z '1.44' 2025-05-07T19:42:41.1517120Z Docker daemon API version: '1.44' 2025-05-07T19:42:41.1517734Z ##[command]/usr/bin/docker version --format '{{.Client.APIVersion}}' 2025-05-07T19:42:41.1707434Z '1.44' 2025-05-07T19:42:41.1718816Z Docker client API version: '1.44' 2025-05-07T19:42:41.1724093Z ##[endgroup] 2025-05-07T19:42:41.1728132Z ##[group]Clean up resources from previous jobs 2025-05-07T19:42:41.1733111Z ##[command]/usr/bin/docker ps --all --quiet --no-trunc --filter "label=52a9e0" 2025-05-07T19:42:41.1909400Z ##[command]/usr/bin/docker network prune --force --filter "label=52a9e0" 2025-05-07T19:42:41.2051338Z ##[endgroup] 2025-05-07T19:42:41.2051767Z ##[group]Create local container network 2025-05-07T19:42:41.2060719Z ##[command]/usr/bin/docker network create --label 52a9e0 github_network_935158e13aba4e44929564f3b9c47480 2025-05-07T19:42:41.4729983Z 9d8373b1bfaa2cb4e51ccd682252b647f027ad8c09bc6a2c88550475ab61d4af 2025-05-07T19:42:41.4747298Z ##[endgroup] 2025-05-07T19:42:41.4771377Z ##[group]Starting job container 2025-05-07T19:42:41.4792409Z ##[command]/usr/bin/docker pull amazonlinux:2023 2025-05-07T19:42:41.7128294Z 2023: Pulling from library/amazonlinux 2025-05-07T19:42:41.7742642Z 1c3112c87ab2: Pulling fs layer 2025-05-07T19:42:42.3331881Z 1c3112c87ab2: Verifying Checksum 2025-05-07T19:42:42.3332882Z 1c3112c87ab2: Download complete 2025-05-07T19:42:44.1388714Z 1c3112c87ab2: Pull complete 2025-05-07T19:42:44.1552434Z Digest: sha256:cb5b4c509d62ae388f674c139ae5e8281fc160c217d474445e912043e1941988 2025-05-07T19:42:44.1602945Z Status: Downloaded newer image for amazonlinux:2023 2025-05-07T19:42:44.1632449Z docker.io/library/amazonlinux:2023 2025-05-07T19:42:44.1722439Z ##[command]/usr/bin/docker create --name 3b54c127a5cb47de8f35c3c3802a9fab_amazonlinux2023_18a699 --label 52a9e0 --workdir /__w/FBGEMM/FBGEMM --network github_network_935158e13aba4e44929564f3b9c47480 --user root -e "HOME=/github/home" -e GITHUB_ACTIONS=true -e CI=true -v "/var/run/docker.sock":"/var/run/docker.sock" -v "/home/ec2-user/actions-runner/_work":"/__w" -v "/home/ec2-user/actions-runner/externals":"/__e":ro -v "/home/ec2-user/actions-runner/_work/_temp":"/__w/_temp" -v "/home/ec2-user/actions-runner/_work/_actions":"/__w/_actions" -v "/home/ec2-user/actions-runner/_work/_tool":"/__w/_tool" -v "/home/ec2-user/actions-runner/_work/_temp/_github_home":"/github/home" -v "/home/ec2-user/actions-runner/_work/_temp/_github_workflow":"/github/workflow" --entrypoint "tail" amazonlinux:2023 "-f" "/dev/null" 2025-05-07T19:42:44.5020214Z 2b02554cc61113bb96fb80bbab95670dde250cea5f4d3e11972b04e9d3bcf9fd 2025-05-07T19:42:44.5045110Z ##[command]/usr/bin/docker start 2b02554cc61113bb96fb80bbab95670dde250cea5f4d3e11972b04e9d3bcf9fd 2025-05-07T19:42:45.0285516Z 2b02554cc61113bb96fb80bbab95670dde250cea5f4d3e11972b04e9d3bcf9fd 2025-05-07T19:42:45.0305906Z ##[command]/usr/bin/docker ps --all --filter id=2b02554cc61113bb96fb80bbab95670dde250cea5f4d3e11972b04e9d3bcf9fd --filter status=running --no-trunc --format "{{.ID}} {{.Status}}" 2025-05-07T19:42:45.0468861Z 2b02554cc61113bb96fb80bbab95670dde250cea5f4d3e11972b04e9d3bcf9fd Up Less than a second 2025-05-07T19:42:45.0494928Z ##[command]/usr/bin/docker inspect --format "{{range .Config.Env}}{{println .}}{{end}}" 2b02554cc61113bb96fb80bbab95670dde250cea5f4d3e11972b04e9d3bcf9fd 2025-05-07T19:42:45.0647496Z HOME=/github/home 2025-05-07T19:42:45.0647953Z GITHUB_ACTIONS=true 2025-05-07T19:42:45.0648197Z CI=true 2025-05-07T19:42:45.0648563Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:42:45.0668899Z ##[endgroup] 2025-05-07T19:42:45.0679019Z ##[group]Waiting for all services to be ready 2025-05-07T19:42:45.0680907Z ##[endgroup] 2025-05-07T19:42:45.0756407Z ##[group]Run yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:45.0757222Z yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:45.0758116Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:45.0758481Z env: 2025-05-07T19:42:45.0758720Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:45.0759067Z BUILD_ENV: build_binary 2025-05-07T19:42:45.0759336Z BUILD_TARGET: default 2025-05-07T19:42:45.0759621Z BUILD_VARIANT: cuda 2025-05-07T19:42:45.0759905Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:42:45.0760190Z ##[endgroup] 2025-05-07T19:42:45.9808036Z Amazon Linux 2023 repository 65 MB/s | 37 MB 00:00 2025-05-07T19:42:52.6075412Z Last metadata expiration check: 0:00:07 ago on Wed May 7 19:42:45 2025. 2025-05-07T19:42:53.1657095Z Dependencies resolved. 2025-05-07T19:42:53.1833781Z Nothing to do. 2025-05-07T19:42:53.1834834Z Complete! 2025-05-07T19:42:53.4246978Z Last metadata expiration check: 0:00:08 ago on Wed May 7 19:42:45 2025. 2025-05-07T19:42:53.4871635Z Dependencies resolved. 2025-05-07T19:42:53.5096531Z ======================================================================================== 2025-05-07T19:42:53.5097367Z Package Arch Version Repository Size 2025-05-07T19:42:53.5098047Z ======================================================================================== 2025-05-07T19:42:53.5098416Z Installing: 2025-05-07T19:42:53.5098886Z binutils x86_64 2.41-50.amzn2023.0.3 amazonlinux 5.3 M 2025-05-07T19:42:53.5099463Z findutils x86_64 1:4.8.0-2.amzn2023.0.2 amazonlinux 539 k 2025-05-07T19:42:53.5099980Z git x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 54 k 2025-05-07T19:42:53.5100484Z pciutils x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 93 k 2025-05-07T19:42:53.5101010Z sudo x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 1.3 M 2025-05-07T19:42:53.5101492Z tar x86_64 2:1.34-1.amzn2023.0.4 amazonlinux 879 k 2025-05-07T19:42:53.5101958Z wget x86_64 1.21.3-1.amzn2023.0.4 amazonlinux 779 k 2025-05-07T19:42:53.5103003Z which x86_64 2.21-26.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:53.5103403Z Installing dependencies: 2025-05-07T19:42:53.5103798Z cracklib x86_64 2.9.6-27.amzn2023.0.2 amazonlinux 82 k 2025-05-07T19:42:53.5104347Z cyrus-sasl-lib x86_64 2.1.27-18.amzn2023.0.3 amazonlinux 786 k 2025-05-07T19:42:53.5105204Z elfutils-debuginfod-client x86_64 0.188-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:53.5105812Z git-core x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 4.7 M 2025-05-07T19:42:53.5106346Z git-core-doc noarch 2.47.1-1.amzn2023.0.2 amazonlinux 2.8 M 2025-05-07T19:42:53.5106878Z gnutls x86_64 3.8.3-6.amzn2023.0.1 amazonlinux 1.1 M 2025-05-07T19:42:53.5107387Z groff-base x86_64 1.22.4-7.amzn2023.0.2 amazonlinux 1.0 M 2025-05-07T19:42:53.5107900Z gzip x86_64 1.12-1.amzn2023.0.1 amazonlinux 160 k 2025-05-07T19:42:53.5108406Z hwdata noarch 0.384-1.amzn2023.0.3 amazonlinux 1.6 M 2025-05-07T19:42:53.5108931Z jansson x86_64 2.14-0.amzn2023 amazonlinux 46 k 2025-05-07T19:42:53.5109456Z kmod-libs x86_64 29-2.amzn2023.0.5 amazonlinux 62 k 2025-05-07T19:42:53.5109948Z less x86_64 608-2.amzn2023.0.2 amazonlinux 168 k 2025-05-07T19:42:53.5110594Z libcbor x86_64 0.7.0-3.amzn2023.0.2 amazonlinux 57 k 2025-05-07T19:42:53.5111100Z libdb x86_64 5.3.28-49.amzn2023.0.2 amazonlinux 756 k 2025-05-07T19:42:53.5111593Z libeconf x86_64 0.4.0-1.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:42:53.5112114Z libedit x86_64 3.1-38.20210714cvs.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:53.5112618Z libfdisk x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 153 k 2025-05-07T19:42:53.5113215Z libfido2 x86_64 1.10.0-2.amzn2023.0.2 amazonlinux 95 k 2025-05-07T19:42:53.5113733Z libmetalink x86_64 0.1.3-14.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:53.5114294Z libpwquality x86_64 1.4.4-6.amzn2023.0.2 amazonlinux 106 k 2025-05-07T19:42:53.5114932Z libsemanage x86_64 3.4-5.amzn2023.0.2 amazonlinux 121 k 2025-05-07T19:42:53.5115492Z libutempter x86_64 1.2.1-4.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:53.5116012Z nano x86_64 8.3-1.amzn2023 amazonlinux 706 k 2025-05-07T19:42:53.5116514Z ncurses x86_64 6.2-4.20200222.amzn2023.0.6 amazonlinux 394 k 2025-05-07T19:42:53.5117008Z nettle x86_64 3.10.1-1.amzn2023.0.1 amazonlinux 573 k 2025-05-07T19:42:53.5117524Z openldap x86_64 2.4.57-6.amzn2023.0.7 amazonlinux 256 k 2025-05-07T19:42:53.5118033Z openssh x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 454 k 2025-05-07T19:42:53.5118588Z openssh-clients x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 708 k 2025-05-07T19:42:53.5119130Z pam x86_64 1.5.1-8.amzn2023.0.4 amazonlinux 542 k 2025-05-07T19:42:53.5119635Z pciutils-libs x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:53.5120222Z perl-AutoLoader noarch 5.74-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:53.5120767Z perl-B x86_64 1.80-477.amzn2023.0.6 amazonlinux 179 k 2025-05-07T19:42:53.5121315Z perl-Carp noarch 1.50-458.amzn2023.0.2 amazonlinux 29 k 2025-05-07T19:42:53.5121902Z perl-Class-Struct noarch 0.66-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:53.5122525Z perl-Data-Dumper x86_64 2.174-460.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:53.5123130Z perl-Digest noarch 1.20-1.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:53.5123787Z perl-Digest-MD5 x86_64 2.58-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:53.5124394Z perl-DynaLoader x86_64 1.47-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:53.5125144Z perl-Encode x86_64 4:3.15-462.amzn2023.0.2 amazonlinux 1.7 M 2025-05-07T19:42:53.5125918Z perl-Errno x86_64 1.30-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:53.5126489Z perl-Error noarch 1:0.17029-5.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:53.5127120Z perl-Exporter noarch 5.74-459.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:53.5127722Z perl-Fcntl x86_64 1.13-477.amzn2023.0.6 amazonlinux 21 k 2025-05-07T19:42:53.5128303Z perl-File-Basename noarch 2.85-477.amzn2023.0.6 amazonlinux 18 k 2025-05-07T19:42:53.5128929Z perl-File-Find noarch 1.37-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:53.5129524Z perl-File-Path noarch 2.18-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:53.5130125Z perl-File-Temp noarch 1:0.231.100-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:53.5131678Z perl-File-stat noarch 1.09-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:53.5132285Z perl-FileHandle noarch 2.03-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:53.5132911Z perl-Getopt-Long noarch 1:2.52-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:53.5133512Z perl-Getopt-Std noarch 1.12-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:53.5134093Z perl-Git noarch 2.47.1-1.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:53.5134682Z perl-HTTP-Tiny noarch 0.078-1.amzn2023.0.3 amazonlinux 56 k 2025-05-07T19:42:53.5135231Z perl-IO x86_64 1.43-477.amzn2023.0.6 amazonlinux 87 k 2025-05-07T19:42:53.5135797Z perl-IPC-Open3 noarch 1.21-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:53.5136386Z perl-MIME-Base64 x86_64 3.16-2.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:53.5136980Z perl-Net-SSLeay x86_64 1.94-1.amzn2023.0.1 amazonlinux 392 k 2025-05-07T19:42:53.5137553Z perl-POSIX x86_64 1.94-477.amzn2023.0.6 amazonlinux 97 k 2025-05-07T19:42:53.5138111Z perl-PathTools x86_64 3.78-459.amzn2023.0.2 amazonlinux 85 k 2025-05-07T19:42:53.5138716Z perl-Pod-Escapes noarch 1:1.07-458.amzn2023.0.2 amazonlinux 20 k 2025-05-07T19:42:53.5139322Z perl-Pod-Perldoc noarch 3.28.01-459.amzn2023.0.3 amazonlinux 84 k 2025-05-07T19:42:53.5139942Z perl-Pod-Simple noarch 1:3.42-2.amzn2023.0.2 amazonlinux 215 k 2025-05-07T19:42:53.5140536Z perl-Pod-Usage noarch 4:2.01-2.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:53.5141268Z perl-Scalar-List-Utils x86_64 4:1.56-459.amzn2023.0.2 amazonlinux 71 k 2025-05-07T19:42:53.5141894Z perl-SelectSaver noarch 1.02-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:53.5142461Z perl-Socket x86_64 4:2.032-1.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:53.5143016Z perl-Storable x86_64 1:3.21-458.amzn2023.0.2 amazonlinux 96 k 2025-05-07T19:42:53.5143566Z perl-Symbol noarch 1.08-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:53.5144170Z perl-Term-ANSIColor noarch 5.01-459.amzn2023.0.2 amazonlinux 48 k 2025-05-07T19:42:53.5144772Z perl-Term-Cap noarch 1.17-458.amzn2023.0.2 amazonlinux 22 k 2025-05-07T19:42:53.5145339Z perl-TermReadKey x86_64 2.38-9.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:53.5146019Z perl-Text-ParseWords noarch 3.30-458.amzn2023.0.2 amazonlinux 17 k 2025-05-07T19:42:53.5146654Z perl-Text-Tabs+Wrap noarch 2021.0726-1.amzn2023.0.1 amazonlinux 22 k 2025-05-07T19:42:53.5147278Z perl-Time-Local noarch 2:1.300-5.amzn2023.0.2 amazonlinux 34 k 2025-05-07T19:42:53.5147833Z perl-URI noarch 5.09-1.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:53.5148380Z perl-base noarch 2.27-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:53.5148945Z perl-constant noarch 1.33-459.amzn2023.0.2 amazonlinux 23 k 2025-05-07T19:42:53.5149489Z perl-if noarch 0.60.800-477.amzn2023.0.6 amazonlinux 14 k 2025-05-07T19:42:53.5150049Z perl-interpreter x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 71 k 2025-05-07T19:42:53.5150580Z perl-lib x86_64 0.65-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:53.5151113Z perl-libnet noarch 3.13-2.amzn2023.0.2 amazonlinux 126 k 2025-05-07T19:42:53.5151646Z perl-libs x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 2.0 M 2025-05-07T19:42:53.5152204Z perl-mro x86_64 1.23-477.amzn2023.0.6 amazonlinux 29 k 2025-05-07T19:42:53.5152816Z perl-overload noarch 1.31-477.amzn2023.0.6 amazonlinux 46 k 2025-05-07T19:42:53.5153611Z perl-overloading noarch 0.02-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:53.5154196Z perl-parent noarch 1:0.238-458.amzn2023.0.2 amazonlinux 14 k 2025-05-07T19:42:53.5154785Z perl-podlators noarch 1:4.14-458.amzn2023.0.2 amazonlinux 112 k 2025-05-07T19:42:53.5155347Z perl-subs noarch 1.03-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:53.5155909Z perl-vars noarch 1.05-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:53.5156470Z shadow-utils x86_64 2:4.9-12.amzn2023.0.4 amazonlinux 1.1 M 2025-05-07T19:42:53.5157016Z systemd-libs x86_64 252.23-3.amzn2023 amazonlinux 613 k 2025-05-07T19:42:53.5157563Z util-linux x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 2.2 M 2025-05-07T19:42:53.5158105Z util-linux-core x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 432 k 2025-05-07T19:42:53.5158561Z Installing weak dependencies: 2025-05-07T19:42:53.5159014Z nano-default-editor noarch 8.3-1.amzn2023 amazonlinux 10 k 2025-05-07T19:42:53.5159635Z perl-IO-Socket-IP noarch 0.41-3.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:53.5160251Z perl-IO-Socket-SSL noarch 2.075-1.amzn2023.0.2 amazonlinux 218 k 2025-05-07T19:42:53.5160855Z perl-Mozilla-CA noarch 20200520-4.amzn2023.0.2 amazonlinux 13 k 2025-05-07T19:42:53.5161469Z perl-NDBM_File x86_64 1.15-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:53.5162046Z sudo-python-plugin x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 56 k 2025-05-07T19:42:53.5162413Z 2025-05-07T19:42:53.5162512Z Transaction Summary 2025-05-07T19:42:53.5162805Z ======================================================================================== 2025-05-07T19:42:53.5163134Z Install 107 Packages 2025-05-07T19:42:53.5163284Z 2025-05-07T19:42:53.5163430Z Total download size: 38 M 2025-05-07T19:42:53.5163690Z Installed size: 151 M 2025-05-07T19:42:53.5163951Z Downloading Packages: 2025-05-07T19:42:53.8218706Z (1/107): cracklib-2.9.6-27.amzn2023.0.2.x86_64. 3.7 MB/s | 82 kB 00:00 2025-05-07T19:42:53.8344260Z (2/107): elfutils-debuginfod-client-0.188-3.amz 3.6 MB/s | 41 kB 00:00 2025-05-07T19:42:53.8410072Z (3/107): cyrus-sasl-lib-2.1.27-18.amzn2023.0.3. 19 MB/s | 786 kB 00:00 2025-05-07T19:42:53.8636571Z (4/107): binutils-2.41-50.amzn2023.0.3.x86_64.r 82 MB/s | 5.3 MB 00:00 2025-05-07T19:42:53.8688215Z (5/107): findutils-4.8.0-2.amzn2023.0.2.x86_64. 16 MB/s | 539 kB 00:00 2025-05-07T19:42:53.8701314Z (6/107): git-2.47.1-1.amzn2023.0.2.x86_64.rpm 1.9 MB/s | 54 kB 00:00 2025-05-07T19:42:53.8914581Z (7/107): gnutls-3.8.3-6.amzn2023.0.1.x86_64.rpm 56 MB/s | 1.1 MB 00:00 2025-05-07T19:42:53.9127793Z (8/107): git-core-2.47.1-1.amzn2023.0.2.x86_64. 99 MB/s | 4.7 MB 00:00 2025-05-07T19:42:53.9243960Z (9/107): git-core-doc-2.47.1-1.amzn2023.0.2.noa 53 MB/s | 2.8 MB 00:00 2025-05-07T19:42:53.9319704Z (10/107): groff-base-1.22.4-7.amzn2023.0.2.x86_ 27 MB/s | 1.0 MB 00:00 2025-05-07T19:42:53.9356022Z (11/107): gzip-1.12-1.amzn2023.0.1.x86_64.rpm 17 MB/s | 160 kB 00:00 2025-05-07T19:42:53.9456306Z (12/107): hwdata-0.384-1.amzn2023.0.3.noarch.rp 85 MB/s | 1.6 MB 00:00 2025-05-07T19:42:53.9476125Z (13/107): jansson-2.14-0.amzn2023.x86_64.rpm 3.2 MB/s | 46 kB 00:00 2025-05-07T19:42:53.9494220Z (14/107): kmod-libs-29-2.amzn2023.0.5.x86_64.rp 5.2 MB/s | 62 kB 00:00 2025-05-07T19:42:53.9532571Z (15/107): less-608-2.amzn2023.0.2.x86_64.rpm 23 MB/s | 168 kB 00:00 2025-05-07T19:42:53.9586018Z (16/107): libcbor-0.7.0-3.amzn2023.0.2.x86_64.r 6.6 MB/s | 57 kB 00:00 2025-05-07T19:42:53.9638525Z (17/107): libdb-5.3.28-49.amzn2023.0.2.x86_64.r 53 MB/s | 756 kB 00:00 2025-05-07T19:42:53.9654686Z (18/107): libeconf-0.4.0-1.amzn2023.0.3.x86_64. 2.3 MB/s | 28 kB 00:00 2025-05-07T19:42:53.9678477Z (19/107): libedit-3.1-38.20210714cvs.amzn2023.0 13 MB/s | 108 kB 00:00 2025-05-07T19:42:53.9730403Z (20/107): libfido2-1.10.0-2.amzn2023.0.2.x86_64 14 MB/s | 95 kB 00:00 2025-05-07T19:42:53.9758229Z (21/107): libfdisk-2.37.4-1.amzn2023.0.4.x86_64 15 MB/s | 153 kB 00:00 2025-05-07T19:42:53.9774743Z (22/107): libmetalink-0.1.3-14.amzn2023.0.2.x86 3.4 MB/s | 31 kB 00:00 2025-05-07T19:42:53.9801753Z (23/107): libpwquality-1.4.4-6.amzn2023.0.2.x86 16 MB/s | 106 kB 00:00 2025-05-07T19:42:53.9838788Z (24/107): libutempter-1.2.1-4.amzn2023.0.2.x86_ 4.5 MB/s | 26 kB 00:00 2025-05-07T19:42:53.9865681Z (25/107): libsemanage-3.4-5.amzn2023.0.2.x86_64 15 MB/s | 121 kB 00:00 2025-05-07T19:42:53.9920795Z (26/107): nano-8.3-1.amzn2023.x86_64.rpm 60 MB/s | 706 kB 00:00 2025-05-07T19:42:53.9947567Z (27/107): nano-default-editor-8.3-1.amzn2023.no 1.0 MB/s | 10 kB 00:00 2025-05-07T19:42:53.9989232Z (28/107): ncurses-6.2-4.20200222.amzn2023.0.6.x 36 MB/s | 394 kB 00:00 2025-05-07T19:42:54.0042115Z (29/107): nettle-3.10.1-1.amzn2023.0.1.x86_64.r 50 MB/s | 573 kB 00:00 2025-05-07T19:42:54.0072016Z (30/107): openldap-2.4.57-6.amzn2023.0.7.x86_64 21 MB/s | 256 kB 00:00 2025-05-07T19:42:54.0118912Z (31/107): openssh-8.7p1-8.amzn2023.0.14.x86_64. 39 MB/s | 454 kB 00:00 2025-05-07T19:42:54.0206016Z (32/107): pam-1.5.1-8.amzn2023.0.4.x86_64.rpm 45 MB/s | 542 kB 00:00 2025-05-07T19:42:54.0260599Z (33/107): openssh-clients-8.7p1-8.amzn2023.0.14 41 MB/s | 708 kB 00:00 2025-05-07T19:42:54.0281581Z (34/107): pciutils-3.7.0-3.amzn2023.0.2.x86_64. 5.7 MB/s | 93 kB 00:00 2025-05-07T19:42:54.0304786Z (35/107): pciutils-libs-3.7.0-3.amzn2023.0.2.x8 4.7 MB/s | 41 kB 00:00 2025-05-07T19:42:54.0319418Z (36/107): perl-AutoLoader-5.74-477.amzn2023.0.6 3.8 MB/s | 22 kB 00:00 2025-05-07T19:42:54.0356732Z (37/107): perl-B-1.80-477.amzn2023.0.6.x86_64.r 26 MB/s | 179 kB 00:00 2025-05-07T19:42:54.0379126Z (38/107): perl-Carp-1.50-458.amzn2023.0.2.noarc 4.3 MB/s | 29 kB 00:00 2025-05-07T19:42:54.0395936Z (39/107): perl-Class-Struct-0.66-477.amzn2023.0 3.5 MB/s | 22 kB 00:00 2025-05-07T19:42:54.0435085Z (40/107): perl-Digest-1.20-1.amzn2023.0.2.noarc 4.9 MB/s | 26 kB 00:00 2025-05-07T19:42:54.0459069Z (41/107): perl-Data-Dumper-2.174-460.amzn2023.0 5.8 MB/s | 55 kB 00:00 2025-05-07T19:42:54.0475968Z (42/107): perl-Digest-MD5-2.58-2.amzn2023.0.2.x 4.6 MB/s | 36 kB 00:00 2025-05-07T19:42:54.0494656Z (43/107): perl-DynaLoader-1.47-477.amzn2023.0.6 4.8 MB/s | 26 kB 00:00 2025-05-07T19:42:54.0608763Z (44/107): perl-Encode-3.15-462.amzn2023.0.2.x86 114 MB/s | 1.7 MB 00:00 2025-05-07T19:42:54.0627916Z (45/107): perl-Errno-1.30-477.amzn2023.0.6.x86_ 1.0 MB/s | 15 kB 00:00 2025-05-07T19:42:54.0640892Z (46/107): perl-Error-0.17029-5.amzn2023.0.2.noa 2.8 MB/s | 41 kB 00:00 2025-05-07T19:42:54.0693033Z (47/107): perl-Exporter-5.74-459.amzn2023.0.2.n 4.0 MB/s | 31 kB 00:00 2025-05-07T19:42:54.0712348Z (48/107): perl-Fcntl-1.13-477.amzn2023.0.6.x86_ 3.1 MB/s | 21 kB 00:00 2025-05-07T19:42:54.0723695Z (49/107): perl-File-Basename-2.85-477.amzn2023. 2.2 MB/s | 18 kB 00:00 2025-05-07T19:42:54.0787992Z (50/107): perl-File-Find-1.37-477.amzn2023.0.6. 2.9 MB/s | 26 kB 00:00 2025-05-07T19:42:54.0798069Z (51/107): perl-File-Path-2.18-2.amzn2023.0.2.no 5.2 MB/s | 36 kB 00:00 2025-05-07T19:42:54.0811658Z (52/107): perl-File-Temp-0.231.100-2.amzn2023.0 7.4 MB/s | 60 kB 00:00 2025-05-07T19:42:54.0876886Z (53/107): perl-File-stat-1.09-477.amzn2023.0.6. 3.2 MB/s | 17 kB 00:00 2025-05-07T19:42:54.0886648Z (54/107): perl-FileHandle-2.03-477.amzn2023.0.6 2.4 MB/s | 16 kB 00:00 2025-05-07T19:42:54.0909853Z (55/107): perl-Getopt-Long-2.52-2.amzn2023.0.2. 6.7 MB/s | 60 kB 00:00 2025-05-07T19:42:54.0942885Z (56/107): perl-Getopt-Std-1.12-477.amzn2023.0.6 3.4 MB/s | 16 kB 00:00 2025-05-07T19:42:54.0961619Z (57/107): perl-Git-2.47.1-1.amzn2023.0.2.noarch 6.2 MB/s | 42 kB 00:00 2025-05-07T19:42:54.0982514Z (58/107): perl-HTTP-Tiny-0.078-1.amzn2023.0.3.n 8.2 MB/s | 56 kB 00:00 2025-05-07T19:42:54.1014142Z (59/107): perl-IO-1.43-477.amzn2023.0.6.x86_64. 13 MB/s | 87 kB 00:00 2025-05-07T19:42:54.1036477Z (60/107): perl-IO-Socket-IP-0.41-3.amzn2023.0.2 7.9 MB/s | 42 kB 00:00 2025-05-07T19:42:54.1071812Z (61/107): perl-IO-Socket-SSL-2.075-1.amzn2023.0 25 MB/s | 218 kB 00:00 2025-05-07T19:42:54.1094635Z (62/107): perl-IPC-Open3-1.21-477.amzn2023.0.6. 3.2 MB/s | 23 kB 00:00 2025-05-07T19:42:54.1111272Z (63/107): perl-MIME-Base64-3.16-2.amzn2023.0.2. 4.5 MB/s | 31 kB 00:00 2025-05-07T19:42:54.1133214Z (64/107): perl-Mozilla-CA-20200520-4.amzn2023.0 2.3 MB/s | 13 kB 00:00 2025-05-07T19:42:54.1197867Z (65/107): perl-Net-SSLeay-1.94-1.amzn2023.0.1.x 49 MB/s | 392 kB 00:00 2025-05-07T19:42:54.1217236Z (66/107): perl-NDBM_File-1.15-477.amzn2023.0.6. 1.9 MB/s | 23 kB 00:00 2025-05-07T19:42:54.1235359Z (67/107): perl-POSIX-1.94-477.amzn2023.0.6.x86_ 9.2 MB/s | 97 kB 00:00 2025-05-07T19:42:54.1261916Z (68/107): perl-PathTools-3.78-459.amzn2023.0.2. 15 MB/s | 85 kB 00:00 2025-05-07T19:42:54.1291794Z (69/107): perl-Pod-Escapes-1.07-458.amzn2023.0. 4.2 MB/s | 20 kB 00:00 2025-05-07T19:42:54.1312131Z (70/107): perl-Pod-Perldoc-3.28.01-459.amzn2023 12 MB/s | 84 kB 00:00 2025-05-07T19:42:54.1346126Z (71/107): perl-Pod-Simple-3.42-2.amzn2023.0.2.n 27 MB/s | 215 kB 00:00 2025-05-07T19:42:54.1363578Z (72/107): perl-Pod-Usage-2.01-2.amzn2023.0.2.no 5.6 MB/s | 41 kB 00:00 2025-05-07T19:42:54.1387941Z (73/107): perl-Scalar-List-Utils-1.56-459.amzn2 11 MB/s | 71 kB 00:00 2025-05-07T19:42:54.1400667Z (74/107): perl-SelectSaver-1.02-477.amzn2023.0. 2.3 MB/s | 12 kB 00:00 2025-05-07T19:42:54.1462274Z (75/107): perl-Storable-3.21-458.amzn2023.0.2.x 14 MB/s | 96 kB 00:00 2025-05-07T19:42:54.1468075Z (76/107): perl-Symbol-1.08-477.amzn2023.0.6.noa 2.2 MB/s | 15 kB 00:00 2025-05-07T19:42:54.1491481Z (77/107): perl-Socket-2.032-1.amzn2023.0.2.x86_ 5.6 MB/s | 55 kB 00:00 2025-05-07T19:42:54.1528190Z (78/107): perl-Term-ANSIColor-5.01-459.amzn2023 9.4 MB/s | 48 kB 00:00 2025-05-07T19:42:54.1539670Z (79/107): perl-Term-Cap-1.17-458.amzn2023.0.2.n 3.5 MB/s | 22 kB 00:00 2025-05-07T19:42:54.1561016Z (80/107): perl-TermReadKey-2.38-9.amzn2023.0.2. 5.0 MB/s | 36 kB 00:00 2025-05-07T19:42:54.1596083Z (81/107): perl-Text-ParseWords-3.30-458.amzn202 3.3 MB/s | 17 kB 00:00 2025-05-07T19:42:54.1606235Z (82/107): perl-Text-Tabs+Wrap-2021.0726-1.amzn2 3.6 MB/s | 22 kB 00:00 2025-05-07T19:42:54.1630816Z (83/107): perl-Time-Local-1.300-5.amzn2023.0.2. 5.3 MB/s | 34 kB 00:00 2025-05-07T19:42:54.1671109Z (84/107): perl-URI-5.09-1.amzn2023.0.2.noarch.r 20 MB/s | 108 kB 00:00 2025-05-07T19:42:54.1683765Z (85/107): perl-base-2.27-477.amzn2023.0.6.noarc 2.3 MB/s | 17 kB 00:00 2025-05-07T19:42:54.1699943Z (86/107): perl-constant-1.33-459.amzn2023.0.2.n 3.5 MB/s | 23 kB 00:00 2025-05-07T19:42:54.1730026Z (87/107): perl-if-0.60.800-477.amzn2023.0.6.noa 2.6 MB/s | 14 kB 00:00 2025-05-07T19:42:54.1753892Z (88/107): perl-interpreter-5.32.1-477.amzn2023. 14 MB/s | 71 kB 00:00 2025-05-07T19:42:54.1768737Z (89/107): perl-lib-0.65-477.amzn2023.0.6.x86_64 2.4 MB/s | 15 kB 00:00 2025-05-07T19:42:54.1852464Z (90/107): perl-libnet-3.13-2.amzn2023.0.2.noarc 10 MB/s | 126 kB 00:00 2025-05-07T19:42:54.1967040Z (91/107): perl-libs-5.32.1-477.amzn2023.0.6.x86 97 MB/s | 2.0 MB 00:00 2025-05-07T19:42:54.1977352Z (92/107): perl-mro-1.23-477.amzn2023.0.6.x86_64 1.4 MB/s | 29 kB 00:00 2025-05-07T19:42:54.1999065Z (93/107): perl-overload-1.31-477.amzn2023.0.6.n 3.6 MB/s | 46 kB 00:00 2025-05-07T19:42:54.2046984Z (94/107): perl-overloading-0.02-477.amzn2023.0. 2.0 MB/s | 13 kB 00:00 2025-05-07T19:42:54.2067662Z (95/107): perl-parent-0.238-458.amzn2023.0.2.no 1.7 MB/s | 14 kB 00:00 2025-05-07T19:42:54.2083177Z (96/107): perl-podlators-4.14-458.amzn2023.0.2. 13 MB/s | 112 kB 00:00 2025-05-07T19:42:54.2109519Z (97/107): perl-subs-1.03-477.amzn2023.0.6.noarc 2.0 MB/s | 12 kB 00:00 2025-05-07T19:42:54.2157526Z (98/107): perl-vars-1.05-477.amzn2023.0.6.noarc 2.0 MB/s | 13 kB 00:00 2025-05-07T19:42:54.2235301Z (99/107): shadow-utils-4.9-12.amzn2023.0.4.x86_ 79 MB/s | 1.1 MB 00:00 2025-05-07T19:42:54.2319547Z (100/107): sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 61 MB/s | 1.3 MB 00:00 2025-05-07T19:42:54.2337131Z (101/107): sudo-python-plugin-1.9.15-1.p5.amzn2 3.4 MB/s | 56 kB 00:00 2025-05-07T19:42:54.2394568Z (102/107): systemd-libs-252.23-3.amzn2023.x86_6 45 MB/s | 613 kB 00:00 2025-05-07T19:42:54.2476312Z (103/107): tar-1.34-1.amzn2023.0.4.x86_64.rpm 70 MB/s | 879 kB 00:00 2025-05-07T19:42:54.2545025Z (104/107): util-linux-core-2.37.4-1.amzn2023.0. 30 MB/s | 432 kB 00:00 2025-05-07T19:42:54.2640537Z (105/107): util-linux-2.37.4-1.amzn2023.0.4.x86 76 MB/s | 2.2 MB 00:00 2025-05-07T19:42:54.2698157Z (106/107): wget-1.21.3-1.amzn2023.0.4.x86_64.rp 41 MB/s | 779 kB 00:00 2025-05-07T19:42:54.2705511Z (107/107): which-2.21-26.amzn2023.0.2.x86_64.rp 2.6 MB/s | 42 kB 00:00 2025-05-07T19:42:54.2725151Z -------------------------------------------------------------------------------- 2025-05-07T19:42:54.2727939Z Total 50 MB/s | 38 MB 00:00 2025-05-07T19:42:55.3323128Z Running transaction check 2025-05-07T19:42:55.3783003Z Transaction check succeeded. 2025-05-07T19:42:55.3783928Z Running transaction test 2025-05-07T19:42:55.7493854Z Transaction test succeeded. 2025-05-07T19:42:55.7495643Z Running transaction 2025-05-07T19:42:56.4440142Z Preparing : 1/1 2025-05-07T19:42:56.4585837Z Installing : systemd-libs-252.23-3.amzn2023.x86_64 1/107 2025-05-07T19:42:56.4816917Z Installing : nettle-3.10.1-1.amzn2023.0.1.x86_64 2/107 2025-05-07T19:42:56.5017729Z Installing : gnutls-3.8.3-6.amzn2023.0.1.x86_64 3/107 2025-05-07T19:42:56.5065314Z Installing : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:56.5129774Z Running scriptlet: util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:56.5220396Z Installing : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:56.5480159Z Installing : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 6/107 2025-05-07T19:42:56.5538395Z Installing : nano-8.3-1.amzn2023.x86_64 7/107 2025-05-07T19:42:56.5587109Z Installing : nano-default-editor-8.3-1.amzn2023.noarch 8/107 2025-05-07T19:42:56.6076665Z Installing : libsemanage-3.4-5.amzn2023.0.2.x86_64 9/107 2025-05-07T19:42:56.6135788Z Installing : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 10/107 2025-05-07T19:42:56.6413414Z Running scriptlet: libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:56.6465086Z Installing : libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:56.6531963Z Installing : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 12/107 2025-05-07T19:42:56.6594222Z Installing : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 13/107 2025-05-07T19:42:56.6646845Z Installing : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 14/107 2025-05-07T19:42:56.6790716Z Installing : libeconf-0.4.0-1.amzn2023.0.3.x86_64 15/107 2025-05-07T19:42:56.6841509Z Installing : libdb-5.3.28-49.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:56.6900978Z Installing : libcbor-0.7.0-3.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:56.6974341Z Installing : libfido2-1.10.0-2.amzn2023.0.2.x86_64 18/107 2025-05-07T19:42:56.7041172Z Installing : less-608-2.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:56.7101538Z Installing : kmod-libs-29-2.amzn2023.0.5.x86_64 20/107 2025-05-07T19:42:56.7537197Z Installing : jansson-2.14-0.amzn2023.x86_64 21/107 2025-05-07T19:42:56.7625860Z Installing : hwdata-0.384-1.amzn2023.0.3.noarch 22/107 2025-05-07T19:42:56.7777959Z Installing : gzip-1.12-1.amzn2023.0.1.x86_64 23/107 2025-05-07T19:42:56.8216660Z Installing : cracklib-2.9.6-27.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:56.8404822Z Installing : pam-1.5.1-8.amzn2023.0.4.x86_64 25/107 2025-05-07T19:42:56.9226664Z Installing : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 26/107 2025-05-07T19:42:56.9227807Z Installing : util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:56.9228325Z warning: /etc/adjtime created as /etc/adjtime.rpmnew 2025-05-07T19:42:56.9228588Z 2025-05-07T19:42:56.9435593Z Running scriptlet: util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:56.9776346Z Running scriptlet: openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:56.9976158Z Installing : openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:57.0044617Z Installing : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:57.1159479Z Running scriptlet: openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:57.2655607Z Installing : git-core-2.47.1-1.amzn2023.0.2.x86_64 30/107 2025-05-07T19:42:57.2795974Z Installing : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 31/107 2025-05-07T19:42:57.3201268Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:57.3299625Z Installing : groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:57.3371222Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:57.3444470Z Installing : perl-Digest-1.20-1.amzn2023.0.2.noarch 33/107 2025-05-07T19:42:57.3540527Z Installing : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:57.3594799Z Installing : perl-B-1.80-477.amzn2023.0.6.x86_64 35/107 2025-05-07T19:42:57.3640797Z Installing : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:57.3698604Z Installing : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 37/107 2025-05-07T19:42:57.3787285Z Installing : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 38/107 2025-05-07T19:42:57.3854461Z Installing : perl-libnet-3.13-2.amzn2023.0.2.noarch 39/107 2025-05-07T19:42:57.3953277Z Installing : perl-base-2.27-477.amzn2023.0.6.noarch 40/107 2025-05-07T19:42:57.4173856Z Installing : perl-URI-5.09-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:57.4262036Z Installing : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 42/107 2025-05-07T19:42:57.4312477Z Installing : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 43/107 2025-05-07T19:42:57.4365331Z Installing : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 44/107 2025-05-07T19:42:57.4427785Z Installing : perl-if-0.60.800-477.amzn2023.0.6.noarch 45/107 2025-05-07T19:42:57.4490543Z Installing : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:57.4548048Z Installing : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:57.4635480Z Installing : perl-File-Path-2.18-2.amzn2023.0.2.noarch 48/107 2025-05-07T19:42:57.4697250Z Installing : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 49/107 2025-05-07T19:42:57.4752080Z Installing : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 50/107 2025-05-07T19:42:57.4814151Z Installing : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 51/107 2025-05-07T19:42:57.4877051Z Installing : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 52/107 2025-05-07T19:42:57.4933944Z Installing : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 53/107 2025-05-07T19:42:57.4973812Z Installing : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:57.5035250Z Installing : perl-subs-1.03-477.amzn2023.0.6.noarch 55/107 2025-05-07T19:42:57.5102603Z Installing : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 56/107 2025-05-07T19:42:57.5156326Z Installing : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 57/107 2025-05-07T19:42:57.5269985Z Installing : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 58/107 2025-05-07T19:42:57.5355828Z Installing : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 59/107 2025-05-07T19:42:57.5426702Z Installing : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 60/107 2025-05-07T19:42:57.5480941Z Installing : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 61/107 2025-05-07T19:42:57.5526084Z Installing : perl-Symbol-1.08-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:57.5601338Z Installing : perl-File-stat-1.09-477.amzn2023.0.6.noarch 63/107 2025-05-07T19:42:57.5704768Z Installing : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:57.5780470Z Installing : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 65/107 2025-05-07T19:42:57.5835042Z Installing : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 66/107 2025-05-07T19:42:57.5890858Z Installing : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 67/107 2025-05-07T19:42:57.5960248Z Installing : perl-mro-1.23-477.amzn2023.0.6.x86_64 68/107 2025-05-07T19:42:57.6035413Z Installing : perl-IO-1.43-477.amzn2023.0.6.x86_64 69/107 2025-05-07T19:42:57.6096743Z Installing : perl-overloading-0.02-477.amzn2023.0.6.noarch 70/107 2025-05-07T19:42:57.6162857Z Installing : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:57.6213804Z Installing : perl-Errno-1.30-477.amzn2023.0.6.x86_64 72/107 2025-05-07T19:42:57.6265657Z Installing : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 73/107 2025-05-07T19:42:57.6324063Z Installing : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:57.6411041Z Installing : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:57.6491529Z Installing : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 76/107 2025-05-07T19:42:57.6554322Z Installing : perl-constant-1.33-459.amzn2023.0.2.noarch 77/107 2025-05-07T19:42:57.6620721Z Installing : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 78/107 2025-05-07T19:42:57.6668626Z Installing : perl-overload-1.31-477.amzn2023.0.6.noarch 79/107 2025-05-07T19:42:57.6719301Z Installing : perl-parent-1:0.238-458.amzn2023.0.2.noarch 80/107 2025-05-07T19:42:57.6785108Z Installing : perl-vars-1.05-477.amzn2023.0.6.noarch 81/107 2025-05-07T19:42:57.6840480Z Installing : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 82/107 2025-05-07T19:42:57.6899097Z Installing : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 83/107 2025-05-07T19:42:57.6952867Z Installing : perl-Carp-1.50-458.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:57.7005964Z Installing : perl-Exporter-5.74-459.amzn2023.0.2.noarch 85/107 2025-05-07T19:42:57.7092356Z Installing : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 86/107 2025-05-07T19:42:57.7626223Z Installing : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 87/107 2025-05-07T19:42:57.8581215Z Installing : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 88/107 2025-05-07T19:42:57.8711303Z Installing : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:57.8791299Z Installing : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 90/107 2025-05-07T19:42:57.8857635Z Installing : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 91/107 2025-05-07T19:42:57.8918208Z Installing : perl-File-Find-1.37-477.amzn2023.0.6.noarch 92/107 2025-05-07T19:42:57.8989739Z Installing : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 93/107 2025-05-07T19:42:57.9037365Z Installing : perl-lib-0.65-477.amzn2023.0.6.x86_64 94/107 2025-05-07T19:42:57.9103904Z Installing : perl-Git-2.47.1-1.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:57.9183224Z Installing : git-2.47.1-1.amzn2023.0.2.x86_64 96/107 2025-05-07T19:42:57.9391900Z Installing : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 97/107 2025-05-07T19:42:57.9520624Z Installing : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 98/107 2025-05-07T19:42:57.9601736Z Installing : openldap-2.4.57-6.amzn2023.0.7.x86_64 99/107 2025-05-07T19:42:58.0009908Z Installing : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 100/107 2025-05-07T19:42:58.1234817Z Installing : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 101/107 2025-05-07T19:42:58.1326936Z Installing : binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:58.1431230Z Running scriptlet: binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:58.1738208Z Installing : pciutils-3.7.0-3.amzn2023.0.2.x86_64 103/107 2025-05-07T19:42:58.1835995Z Installing : wget-1.21.3-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:58.2080748Z Installing : which-2.21-26.amzn2023.0.2.x86_64 105/107 2025-05-07T19:42:58.2292172Z Installing : tar-2:1.34-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:58.2380692Z Installing : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:58.2491421Z Running scriptlet: pam-1.5.1-8.amzn2023.0.4.x86_64 107/107 2025-05-07T19:42:59.0137754Z Running scriptlet: findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:59.0138639Z Verifying : binutils-2.41-50.amzn2023.0.3.x86_64 1/107 2025-05-07T19:42:59.0139184Z Verifying : cracklib-2.9.6-27.amzn2023.0.2.x86_64 2/107 2025-05-07T19:42:59.0139725Z Verifying : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 3/107 2025-05-07T19:42:59.0140281Z Verifying : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 4/107 2025-05-07T19:42:59.0140865Z Verifying : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:59.0141370Z Verifying : git-2.47.1-1.amzn2023.0.2.x86_64 6/107 2025-05-07T19:42:59.0141987Z Verifying : git-core-2.47.1-1.amzn2023.0.2.x86_64 7/107 2025-05-07T19:42:59.0142506Z Verifying : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 8/107 2025-05-07T19:42:59.0143337Z Verifying : gnutls-3.8.3-6.amzn2023.0.1.x86_64 9/107 2025-05-07T19:42:59.0143889Z Verifying : groff-base-1.22.4-7.amzn2023.0.2.x86_64 10/107 2025-05-07T19:42:59.0144408Z Verifying : gzip-1.12-1.amzn2023.0.1.x86_64 11/107 2025-05-07T19:42:59.0144927Z Verifying : hwdata-0.384-1.amzn2023.0.3.noarch 12/107 2025-05-07T19:42:59.0145412Z Verifying : jansson-2.14-0.amzn2023.x86_64 13/107 2025-05-07T19:42:59.0145917Z Verifying : kmod-libs-29-2.amzn2023.0.5.x86_64 14/107 2025-05-07T19:42:59.0146419Z Verifying : less-608-2.amzn2023.0.2.x86_64 15/107 2025-05-07T19:42:59.0146913Z Verifying : libcbor-0.7.0-3.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:59.0147438Z Verifying : libdb-5.3.28-49.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:59.0147937Z Verifying : libeconf-0.4.0-1.amzn2023.0.3.x86_64 18/107 2025-05-07T19:42:59.0148483Z Verifying : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:59.0149098Z Verifying : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 20/107 2025-05-07T19:42:59.0149593Z Verifying : libfido2-1.10.0-2.amzn2023.0.2.x86_64 21/107 2025-05-07T19:42:59.0150121Z Verifying : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 22/107 2025-05-07T19:42:59.0150629Z Verifying : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 23/107 2025-05-07T19:42:59.0151189Z Verifying : libsemanage-3.4-5.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:59.0151750Z Verifying : libutempter-1.2.1-4.amzn2023.0.2.x86_64 25/107 2025-05-07T19:42:59.0152266Z Verifying : nano-8.3-1.amzn2023.x86_64 26/107 2025-05-07T19:42:59.0152919Z Verifying : nano-default-editor-8.3-1.amzn2023.noarch 27/107 2025-05-07T19:42:59.0153651Z Verifying : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 28/107 2025-05-07T19:42:59.0154199Z Verifying : nettle-3.10.1-1.amzn2023.0.1.x86_64 29/107 2025-05-07T19:42:59.0154735Z Verifying : openldap-2.4.57-6.amzn2023.0.7.x86_64 30/107 2025-05-07T19:42:59.0155311Z Verifying : openssh-8.7p1-8.amzn2023.0.14.x86_64 31/107 2025-05-07T19:42:59.0190593Z Verifying : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 32/107 2025-05-07T19:42:59.0191292Z Verifying : pam-1.5.1-8.amzn2023.0.4.x86_64 33/107 2025-05-07T19:42:59.0191856Z Verifying : pciutils-3.7.0-3.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:59.0192657Z Verifying : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 35/107 2025-05-07T19:42:59.0193570Z Verifying : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:59.0194163Z Verifying : perl-B-1.80-477.amzn2023.0.6.x86_64 37/107 2025-05-07T19:42:59.0194755Z Verifying : perl-Carp-1.50-458.amzn2023.0.2.noarch 38/107 2025-05-07T19:42:59.0195367Z Verifying : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 39/107 2025-05-07T19:42:59.0195954Z Verifying : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 40/107 2025-05-07T19:42:59.0196575Z Verifying : perl-Digest-1.20-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:59.0197125Z Verifying : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 42/107 2025-05-07T19:42:59.0197731Z Verifying : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 43/107 2025-05-07T19:42:59.0198299Z Verifying : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 44/107 2025-05-07T19:42:59.0198882Z Verifying : perl-Errno-1.30-477.amzn2023.0.6.x86_64 45/107 2025-05-07T19:42:59.0199617Z Verifying : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:59.0200186Z Verifying : perl-Exporter-5.74-459.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:59.0200786Z Verifying : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 48/107 2025-05-07T19:42:59.0201399Z Verifying : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 49/107 2025-05-07T19:42:59.0201983Z Verifying : perl-File-Find-1.37-477.amzn2023.0.6.noarch 50/107 2025-05-07T19:42:59.0202779Z Verifying : perl-File-Path-2.18-2.amzn2023.0.2.noarch 51/107 2025-05-07T19:42:59.0203350Z Verifying : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 52/107 2025-05-07T19:42:59.0203962Z Verifying : perl-File-stat-1.09-477.amzn2023.0.6.noarch 53/107 2025-05-07T19:42:59.0204542Z Verifying : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:59.0205199Z Verifying : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 55/107 2025-05-07T19:42:59.0205845Z Verifying : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 56/107 2025-05-07T19:42:59.0206405Z Verifying : perl-Git-2.47.1-1.amzn2023.0.2.noarch 57/107 2025-05-07T19:42:59.0206991Z Verifying : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 58/107 2025-05-07T19:42:59.0207537Z Verifying : perl-IO-1.43-477.amzn2023.0.6.x86_64 59/107 2025-05-07T19:42:59.0208118Z Verifying : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 60/107 2025-05-07T19:42:59.0208719Z Verifying : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 61/107 2025-05-07T19:42:59.0209290Z Verifying : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:59.0209899Z Verifying : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 63/107 2025-05-07T19:42:59.0210471Z Verifying : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:59.0211070Z Verifying : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 65/107 2025-05-07T19:42:59.0211630Z Verifying : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 66/107 2025-05-07T19:42:59.0212224Z Verifying : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 67/107 2025-05-07T19:42:59.0212800Z Verifying : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 68/107 2025-05-07T19:42:59.0213360Z Verifying : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 69/107 2025-05-07T19:42:59.0214062Z Verifying : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 70/107 2025-05-07T19:42:59.0214749Z Verifying : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:59.0215410Z Verifying : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 72/107 2025-05-07T19:42:59.0216006Z Verifying : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 73/107 2025-05-07T19:42:59.0216547Z Verifying : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:59.0217106Z Verifying : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:59.0217647Z Verifying : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 76/107 2025-05-07T19:42:59.0218170Z Verifying : perl-Symbol-1.08-477.amzn2023.0.6.noarch 77/107 2025-05-07T19:42:59.0218716Z Verifying : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 78/107 2025-05-07T19:42:59.0219230Z Verifying : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 79/107 2025-05-07T19:42:59.0219763Z Verifying : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 80/107 2025-05-07T19:42:59.0220314Z Verifying : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 81/107 2025-05-07T19:42:59.0220884Z Verifying : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 82/107 2025-05-07T19:42:59.0221431Z Verifying : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 83/107 2025-05-07T19:42:59.0222029Z Verifying : perl-URI-5.09-1.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:59.0222573Z Verifying : perl-base-2.27-477.amzn2023.0.6.noarch 85/107 2025-05-07T19:42:59.0223093Z Verifying : perl-constant-1.33-459.amzn2023.0.2.noarch 86/107 2025-05-07T19:42:59.0223634Z Verifying : perl-if-0.60.800-477.amzn2023.0.6.noarch 87/107 2025-05-07T19:42:59.0224147Z Verifying : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 88/107 2025-05-07T19:42:59.0224701Z Verifying : perl-lib-0.65-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:59.0225239Z Verifying : perl-libnet-3.13-2.amzn2023.0.2.noarch 90/107 2025-05-07T19:42:59.0225747Z Verifying : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 91/107 2025-05-07T19:42:59.0226263Z Verifying : perl-mro-1.23-477.amzn2023.0.6.x86_64 92/107 2025-05-07T19:42:59.0226776Z Verifying : perl-overload-1.31-477.amzn2023.0.6.noarch 93/107 2025-05-07T19:42:59.0227338Z Verifying : perl-overloading-0.02-477.amzn2023.0.6.noarch 94/107 2025-05-07T19:42:59.0227886Z Verifying : perl-parent-1:0.238-458.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:59.0228398Z Verifying : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 96/107 2025-05-07T19:42:59.0228935Z Verifying : perl-subs-1.03-477.amzn2023.0.6.noarch 97/107 2025-05-07T19:42:59.0229438Z Verifying : perl-vars-1.05-477.amzn2023.0.6.noarch 98/107 2025-05-07T19:42:59.0229972Z Verifying : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 99/107 2025-05-07T19:42:59.0230476Z Verifying : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 100/107 2025-05-07T19:42:59.0231017Z Verifying : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 101/107 2025-05-07T19:42:59.0231583Z Verifying : systemd-libs-252.23-3.amzn2023.x86_64 102/107 2025-05-07T19:42:59.0232082Z Verifying : tar-2:1.34-1.amzn2023.0.4.x86_64 103/107 2025-05-07T19:42:59.0232607Z Verifying : util-linux-2.37.4-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:59.0233404Z Verifying : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 105/107 2025-05-07T19:42:59.0233979Z Verifying : wget-1.21.3-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:59.2295454Z Verifying : which-2.21-26.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:59.2295849Z 2025-05-07T19:42:59.2295948Z Installed: 2025-05-07T19:42:59.2296322Z binutils-2.41-50.amzn2023.0.3.x86_64 2025-05-07T19:42:59.2297475Z cracklib-2.9.6-27.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2298023Z cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 2025-05-07T19:42:59.2298648Z elfutils-debuginfod-client-0.188-3.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2299222Z findutils-1:4.8.0-2.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2299738Z git-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2300250Z git-core-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2300781Z git-core-doc-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:42:59.2301322Z gnutls-3.8.3-6.amzn2023.0.1.x86_64 2025-05-07T19:42:59.2301843Z groff-base-1.22.4-7.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2302556Z gzip-1.12-1.amzn2023.0.1.x86_64 2025-05-07T19:42:59.2303065Z hwdata-0.384-1.amzn2023.0.3.noarch 2025-05-07T19:42:59.2303812Z jansson-2.14-0.amzn2023.x86_64 2025-05-07T19:42:59.2304352Z kmod-libs-29-2.amzn2023.0.5.x86_64 2025-05-07T19:42:59.2304861Z less-608-2.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2305379Z libcbor-0.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2305890Z libdb-5.3.28-49.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2306461Z libeconf-0.4.0-1.amzn2023.0.3.x86_64 2025-05-07T19:42:59.2307023Z libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2307573Z libfdisk-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:59.2308175Z libfido2-1.10.0-2.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2308732Z libmetalink-0.1.3-14.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2309309Z libpwquality-1.4.4-6.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2309866Z libsemanage-3.4-5.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2310438Z libutempter-1.2.1-4.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2310957Z nano-8.3-1.amzn2023.x86_64 2025-05-07T19:42:59.2311505Z nano-default-editor-8.3-1.amzn2023.noarch 2025-05-07T19:42:59.2312085Z ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 2025-05-07T19:42:59.2312615Z nettle-3.10.1-1.amzn2023.0.1.x86_64 2025-05-07T19:42:59.2313247Z openldap-2.4.57-6.amzn2023.0.7.x86_64 2025-05-07T19:42:59.2313780Z openssh-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:42:59.2314359Z openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:42:59.2314889Z pam-1.5.1-8.amzn2023.0.4.x86_64 2025-05-07T19:42:59.2315412Z pciutils-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2315969Z pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2316541Z perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.2317104Z perl-B-1.80-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.2317635Z perl-Carp-1.50-458.amzn2023.0.2.noarch 2025-05-07T19:42:59.2318325Z perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.2318902Z perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2319495Z perl-Digest-1.20-1.amzn2023.0.2.noarch 2025-05-07T19:42:59.2320071Z perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2320641Z perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.2321213Z perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2321757Z perl-Errno-1.30-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.2322329Z perl-Error-1:0.17029-5.amzn2023.0.2.noarch 2025-05-07T19:42:59.2322905Z perl-Exporter-5.74-459.amzn2023.0.2.noarch 2025-05-07T19:42:59.2323472Z perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.2324066Z perl-File-Basename-2.85-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.2324713Z perl-File-Find-1.37-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.2325385Z perl-File-Path-2.18-2.amzn2023.0.2.noarch 2025-05-07T19:42:59.2325899Z perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 2025-05-07T19:42:59.2326423Z perl-File-stat-1.09-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.2326967Z perl-FileHandle-2.03-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.2327495Z perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 2025-05-07T19:42:59.2328037Z perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.2328551Z perl-Git-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:42:59.2329080Z perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 2025-05-07T19:42:59.2329602Z perl-IO-1.43-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.2330109Z perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 2025-05-07T19:42:59.2330664Z perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 2025-05-07T19:42:59.2331194Z perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.2331735Z perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2332275Z perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 2025-05-07T19:42:59.2332822Z perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.2333345Z perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 2025-05-07T19:42:59.2333861Z perl-POSIX-1.94-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.2334395Z perl-PathTools-3.78-459.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2334923Z perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 2025-05-07T19:42:59.2335466Z perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 2025-05-07T19:42:59.2335988Z perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 2025-05-07T19:42:59.2336511Z perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 2025-05-07T19:42:59.2337043Z perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2337578Z perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.2338109Z perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2338684Z perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2339218Z perl-Symbol-1.08-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.2339772Z perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 2025-05-07T19:42:59.2340313Z perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 2025-05-07T19:42:59.2340860Z perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2341406Z perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarch 2025-05-07T19:42:59.2341979Z perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noarch 2025-05-07T19:42:59.2342507Z perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 2025-05-07T19:42:59.2343020Z perl-URI-5.09-1.amzn2023.0.2.noarch 2025-05-07T19:42:59.2343538Z perl-base-2.27-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.2344055Z perl-constant-1.33-459.amzn2023.0.2.noarch 2025-05-07T19:42:59.2344580Z perl-if-0.60.800-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.2345232Z perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.2345940Z perl-lib-0.65-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.2346486Z perl-libnet-3.13-2.amzn2023.0.2.noarch 2025-05-07T19:42:59.2347013Z perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.2347539Z perl-mro-1.23-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.2348068Z perl-overload-1.31-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.2348649Z perl-overloading-0.02-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.2349204Z perl-parent-1:0.238-458.amzn2023.0.2.noarch 2025-05-07T19:42:59.2349758Z perl-podlators-1:4.14-458.amzn2023.0.2.noarch 2025-05-07T19:42:59.2350311Z perl-subs-1.03-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.2350836Z perl-vars-1.05-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.2351376Z shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 2025-05-07T19:42:59.2351877Z sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:42:59.2352414Z sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:42:59.2353040Z systemd-libs-252.23-3.amzn2023.x86_64 2025-05-07T19:42:59.2353744Z tar-2:1.34-1.amzn2023.0.4.x86_64 2025-05-07T19:42:59.2354321Z util-linux-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:59.2354866Z util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:59.2355403Z wget-1.21.3-1.amzn2023.0.4.x86_64 2025-05-07T19:42:59.2355897Z which-2.21-26.amzn2023.0.2.x86_64 2025-05-07T19:42:59.2356223Z 2025-05-07T19:42:59.2356317Z Complete! 2025-05-07T19:42:59.3056320Z ##[group]Run actions/checkout@v4 2025-05-07T19:42:59.3056625Z with: 2025-05-07T19:42:59.3056834Z submodules: true 2025-05-07T19:42:59.3057060Z repository: pytorch/FBGEMM 2025-05-07T19:42:59.3057482Z token: *** 2025-05-07T19:42:59.3057679Z ssh-strict: true 2025-05-07T19:42:59.3057904Z ssh-user: git 2025-05-07T19:42:59.3058115Z persist-credentials: true 2025-05-07T19:42:59.3058373Z clean: true 2025-05-07T19:42:59.3058604Z sparse-checkout-cone-mode: true 2025-05-07T19:42:59.3059038Z fetch-depth: 1 2025-05-07T19:42:59.3059255Z fetch-tags: false 2025-05-07T19:42:59.3059460Z show-progress: true 2025-05-07T19:42:59.3059690Z lfs: false 2025-05-07T19:42:59.3059886Z set-safe-directory: true 2025-05-07T19:42:59.3060160Z env: 2025-05-07T19:42:59.3060360Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:59.3060666Z BUILD_ENV: build_binary 2025-05-07T19:42:59.3060894Z BUILD_TARGET: default 2025-05-07T19:42:59.3061126Z BUILD_VARIANT: cuda 2025-05-07T19:42:59.3061394Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:42:59.3061626Z ##[endgroup] 2025-05-07T19:42:59.3101239Z ##[command]/usr/bin/docker exec 2b02554cc61113bb96fb80bbab95670dde250cea5f4d3e11972b04e9d3bcf9fd sh -c "cat /etc/*release | grep ^ID" 2025-05-07T19:42:59.7090137Z Syncing repository: pytorch/FBGEMM 2025-05-07T19:42:59.7091605Z ##[group]Getting Git version info 2025-05-07T19:42:59.7091957Z Working directory is '/__w/FBGEMM/FBGEMM' 2025-05-07T19:42:59.7092546Z [command]/usr/bin/git version 2025-05-07T19:42:59.7092883Z git version 2.47.1 2025-05-07T19:42:59.7093880Z ##[endgroup] 2025-05-07T19:42:59.7098244Z Temporarily overriding HOME='/__w/_temp/557b51b1-2bce-4771-8d43-c4652586962d' before making global git config changes 2025-05-07T19:42:59.7099022Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T19:42:59.7100087Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T19:42:59.7128555Z [command]/usr/bin/git config --local --get remote.origin.url 2025-05-07T19:42:59.7148420Z https://github.com/pytorch/FBGEMM 2025-05-07T19:42:59.7159703Z ##[group]Removing previously created refs, to avoid conflicts 2025-05-07T19:42:59.7162477Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-05-07T19:42:59.7183731Z HEAD 2025-05-07T19:42:59.7213855Z ##[endgroup] 2025-05-07T19:42:59.7214564Z [command]/usr/bin/git submodule status 2025-05-07T19:42:59.7585600Z e5d7c0bd5d9aec44d68830187138149e6a8c4e32 external/asmjit (e5d7c0b) 2025-05-07T19:42:59.7657591Z 4a61bdd4bd4ed730e078aebc7c0fcf046ff29406 external/composable_kernel (remotes/origin/FBGEMM) 2025-05-07T19:42:59.7764865Z 6543fec09b2f04ac4a666882998b534afc9c1349 external/cpuinfo (6543fec) 2025-05-07T19:42:59.7831729Z 3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3 external/cutlass (remotes/origin/FBGEMM) 2025-05-07T19:42:59.8063435Z f8d7d77c06936315286eb55f8de22cd23c188571 external/googletest (release-1.8.0-3335-gf8d7d77c) 2025-05-07T19:42:59.8139551Z 420084499c7c1e1c2d801922f40df202eac5f3a0 external/hipify_torch (remotes/origin/mmelesse-9-g4200844) 2025-05-07T19:42:59.8174710Z 9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03 external/json (v3.11.2-84-g9cca280a) 2025-05-07T19:42:59.8193808Z ##[group]Cleaning the repository 2025-05-07T19:42:59.8194782Z [command]/usr/bin/git clean -ffdx 2025-05-07T19:42:59.9167423Z Removing build_only/ 2025-05-07T19:42:59.9167819Z Removing collect_env.py 2025-05-07T19:42:59.9168168Z Removing fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:42:59.9173635Z [command]/usr/bin/git reset --hard HEAD 2025-05-07T19:43:00.0278372Z HEAD is now at a5ab0b0 Merge 3e0eb9844c62b4a9cef00aa8fd072a26f76b40ac into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:00.0282754Z ##[endgroup] 2025-05-07T19:43:00.0286113Z ##[group]Disabling automatic garbage collection 2025-05-07T19:43:00.0290534Z [command]/usr/bin/git config --local gc.auto 0 2025-05-07T19:43:00.0316264Z ##[endgroup] 2025-05-07T19:43:00.0316693Z ##[group]Setting up auth 2025-05-07T19:43:00.0323688Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T19:43:00.0350464Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T19:43:00.0628537Z Entering 'external/asmjit' 2025-05-07T19:43:00.0677284Z Entering 'external/composable_kernel' 2025-05-07T19:43:00.0737139Z Entering 'external/cpuinfo' 2025-05-07T19:43:00.0796719Z Entering 'external/cutlass' 2025-05-07T19:43:00.0871520Z Entering 'external/googletest' 2025-05-07T19:43:00.0920700Z Entering 'external/hipify_torch' 2025-05-07T19:43:00.0990990Z Entering 'external/json' 2025-05-07T19:43:00.1053829Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T19:43:00.1078910Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T19:43:00.1352281Z Entering 'external/asmjit' 2025-05-07T19:43:00.1395796Z Entering 'external/composable_kernel' 2025-05-07T19:43:00.1455585Z Entering 'external/cpuinfo' 2025-05-07T19:43:00.1504174Z Entering 'external/cutlass' 2025-05-07T19:43:00.1556991Z Entering 'external/googletest' 2025-05-07T19:43:00.1603280Z Entering 'external/hipify_torch' 2025-05-07T19:43:00.1653851Z Entering 'external/json' 2025-05-07T19:43:00.1713410Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:43:00.1752457Z ##[endgroup] 2025-05-07T19:43:00.1753018Z ##[group]Fetching the repository 2025-05-07T19:43:00.1756669Z [command]/usr/bin/git -c protocol.version=2 fetch --no-tags --prune --no-recurse-submodules --depth=1 origin +a2f4c52051596e74bc8c16e3d2867a4ecdd271e0:refs/remotes/pull/4066/merge 2025-05-07T19:43:00.3715057Z From https://github.com/pytorch/FBGEMM 2025-05-07T19:43:00.3715793Z + a5ab0b0...a2f4c52 a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 -> pull/4066/merge (forced update) 2025-05-07T19:43:00.3736212Z ##[endgroup] 2025-05-07T19:43:00.3737361Z ##[group]Determining the checkout info 2025-05-07T19:43:00.3738642Z ##[endgroup] 2025-05-07T19:43:00.3739418Z [command]/usr/bin/git sparse-checkout disable 2025-05-07T19:43:00.3874631Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-05-07T19:43:00.3901603Z ##[group]Checking out the ref 2025-05-07T19:43:00.3903041Z [command]/usr/bin/git checkout --progress --force refs/remotes/pull/4066/merge 2025-05-07T19:43:00.4156411Z Warning: you are leaving 1 commit behind, not connected to 2025-05-07T19:43:00.4157588Z any of your branches: 2025-05-07T19:43:00.4158037Z 2025-05-07T19:43:00.4159156Z a5ab0b0 Merge 3e0eb9844c62b4a9cef00aa8fd072a26f76b40ac into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:00.4160562Z 2025-05-07T19:43:00.4161201Z If you want to keep it by creating a new branch, this may be a good time 2025-05-07T19:43:00.4162371Z to do so with: 2025-05-07T19:43:00.4162742Z 2025-05-07T19:43:00.4163106Z git branch a5ab0b0 2025-05-07T19:43:00.4163341Z 2025-05-07T19:43:00.4163737Z HEAD is now at a2f4c52 Merge 6060cd4b5f971680caecdcc657faccb5720d1c3e into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:00.4164973Z ##[endgroup] 2025-05-07T19:43:00.4165444Z ##[group]Setting up auth for fetching submodules 2025-05-07T19:43:00.4166075Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:43:00.4212646Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-05-07T19:43:00.4235903Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-05-07T19:43:00.4259761Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-05-07T19:43:00.4281168Z ##[endgroup] 2025-05-07T19:43:00.4282921Z ##[group]Fetching submodules 2025-05-07T19:43:00.4283825Z [command]/usr/bin/git submodule sync 2025-05-07T19:43:00.4594799Z Synchronizing submodule url for 'external/asmjit' 2025-05-07T19:43:00.4596507Z Synchronizing submodule url for 'external/composable_kernel' 2025-05-07T19:43:00.4597844Z Synchronizing submodule url for 'external/cpuinfo' 2025-05-07T19:43:00.4599022Z Synchronizing submodule url for 'external/cutlass' 2025-05-07T19:43:00.4599903Z Synchronizing submodule url for 'external/googletest' 2025-05-07T19:43:00.4600647Z Synchronizing submodule url for 'external/hipify_torch' 2025-05-07T19:43:00.4601055Z Synchronizing submodule url for 'external/json' 2025-05-07T19:43:00.4602229Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --depth=1 2025-05-07T19:43:00.5358049Z Submodule path 'external/asmjit': checked out 'e5d7c0bd5d9aec44d68830187138149e6a8c4e32' 2025-05-07T19:43:00.8073456Z Submodule path 'external/composable_kernel': checked out '4a61bdd4bd4ed730e078aebc7c0fcf046ff29406' 2025-05-07T19:43:00.9078591Z Submodule path 'external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-05-07T19:43:01.5697809Z Submodule path 'external/cutlass': checked out '3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3' 2025-05-07T19:43:01.6123615Z Submodule path 'external/googletest': checked out 'f8d7d77c06936315286eb55f8de22cd23c188571' 2025-05-07T19:43:01.6202621Z Submodule path 'external/hipify_torch': checked out '420084499c7c1e1c2d801922f40df202eac5f3a0' 2025-05-07T19:43:01.7319596Z Submodule path 'external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-05-07T19:43:01.7330578Z [command]/usr/bin/git submodule foreach git config --local gc.auto 0 2025-05-07T19:43:01.7611184Z Entering 'external/asmjit' 2025-05-07T19:43:01.7643556Z Entering 'external/composable_kernel' 2025-05-07T19:43:01.7672063Z Entering 'external/cpuinfo' 2025-05-07T19:43:01.7703246Z Entering 'external/cutlass' 2025-05-07T19:43:01.7733206Z Entering 'external/googletest' 2025-05-07T19:43:01.7756396Z Entering 'external/hipify_torch' 2025-05-07T19:43:01.7783035Z Entering 'external/json' 2025-05-07T19:43:01.7824734Z ##[endgroup] 2025-05-07T19:43:01.7825183Z ##[group]Persisting credentials for submodules 2025-05-07T19:43:01.7826236Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-05-07T19:43:01.8092920Z Entering 'external/asmjit' 2025-05-07T19:43:01.8121873Z url.https://github.com/.insteadof 2025-05-07T19:43:01.8122252Z url.https://github.com/.insteadof 2025-05-07T19:43:01.8159815Z Entering 'external/composable_kernel' 2025-05-07T19:43:01.8194468Z url.https://github.com/.insteadof 2025-05-07T19:43:01.8194871Z url.https://github.com/.insteadof 2025-05-07T19:43:01.8230338Z Entering 'external/cpuinfo' 2025-05-07T19:43:01.8274921Z url.https://github.com/.insteadof 2025-05-07T19:43:01.8275941Z url.https://github.com/.insteadof 2025-05-07T19:43:01.8305658Z Entering 'external/cutlass' 2025-05-07T19:43:01.8342758Z url.https://github.com/.insteadof 2025-05-07T19:43:01.8343159Z url.https://github.com/.insteadof 2025-05-07T19:43:01.8391436Z Entering 'external/googletest' 2025-05-07T19:43:01.8433962Z url.https://github.com/.insteadof 2025-05-07T19:43:01.8435004Z url.https://github.com/.insteadof 2025-05-07T19:43:01.8472488Z Entering 'external/hipify_torch' 2025-05-07T19:43:01.8503736Z url.https://github.com/.insteadof 2025-05-07T19:43:01.8504194Z url.https://github.com/.insteadof 2025-05-07T19:43:01.8533598Z Entering 'external/json' 2025-05-07T19:43:01.8578483Z url.https://github.com/.insteadof 2025-05-07T19:43:01.8578898Z url.https://github.com/.insteadof 2025-05-07T19:43:01.8634401Z [command]/usr/bin/git submodule foreach sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-05-07T19:43:01.8951947Z Entering 'external/asmjit' 2025-05-07T19:43:01.8997859Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/asmjit/config remote.origin.url 2025-05-07T19:43:01.8998441Z Entering 'external/composable_kernel' 2025-05-07T19:43:01.9046095Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/composable_kernel/config remote.origin.url 2025-05-07T19:43:01.9047673Z Entering 'external/cpuinfo' 2025-05-07T19:43:01.9097018Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cpuinfo/config remote.origin.url 2025-05-07T19:43:01.9097540Z Entering 'external/cutlass' 2025-05-07T19:43:01.9150979Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cutlass/config remote.origin.url 2025-05-07T19:43:01.9152627Z Entering 'external/googletest' 2025-05-07T19:43:01.9196635Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/googletest/config remote.origin.url 2025-05-07T19:43:01.9197860Z Entering 'external/hipify_torch' 2025-05-07T19:43:01.9243173Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/hipify_torch/config remote.origin.url 2025-05-07T19:43:01.9244658Z Entering 'external/json' 2025-05-07T19:43:01.9291453Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/json/config remote.origin.url 2025-05-07T19:43:01.9381459Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-05-07T19:43:01.9658431Z Entering 'external/asmjit' 2025-05-07T19:43:01.9682307Z Entering 'external/composable_kernel' 2025-05-07T19:43:01.9708056Z Entering 'external/cpuinfo' 2025-05-07T19:43:01.9734007Z Entering 'external/cutlass' 2025-05-07T19:43:01.9767183Z Entering 'external/googletest' 2025-05-07T19:43:01.9795653Z Entering 'external/hipify_torch' 2025-05-07T19:43:01.9823824Z Entering 'external/json' 2025-05-07T19:43:01.9868287Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-05-07T19:43:02.0147651Z Entering 'external/asmjit' 2025-05-07T19:43:02.0173128Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.0206170Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.0236865Z Entering 'external/cutlass' 2025-05-07T19:43:02.0269173Z Entering 'external/googletest' 2025-05-07T19:43:02.0297504Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.0322343Z Entering 'external/json' 2025-05-07T19:43:02.0375159Z ##[endgroup] 2025-05-07T19:43:02.0405257Z [command]/usr/bin/git log -1 --format=%H 2025-05-07T19:43:02.0428211Z a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:02.0597291Z ##[group]Run . $PRELUDE; print_system_info 2025-05-07T19:43:02.0597904Z . $PRELUDE; print_system_info 2025-05-07T19:43:02.0598730Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:02.0599111Z env: 2025-05-07T19:43:02.0599347Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:02.0599695Z BUILD_ENV: build_binary 2025-05-07T19:43:02.0599968Z BUILD_TARGET: default 2025-05-07T19:43:02.0600241Z BUILD_VARIANT: cuda 2025-05-07T19:43:02.0600532Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:43:02.0600800Z ##[endgroup] 2025-05-07T19:43:02.5292738Z ################################################################################ 2025-05-07T19:43:02.5293195Z # Print System Info 2025-05-07T19:43:02.5293457Z # 2025-05-07T19:43:02.5304621Z # [2025-05-07T19:43:02.530Z] + print_system_info 2025-05-07T19:43:02.5305024Z ################################################################################ 2025-05-07T19:43:02.5305295Z 2025-05-07T19:43:02.5305533Z ################################################################################ 2025-05-07T19:43:02.5305934Z [INFO] Printing environment variables ... 2025-05-07T19:43:02.5308021Z + printenv 2025-05-07T19:43:02.5308164Z 2025-05-07T19:43:02.5318180Z GITHUB_WORKSPACE=/__w/FBGEMM/FBGEMM 2025-05-07T19:43:02.5319133Z BUILD_VARIANT=cuda 2025-05-07T19:43:02.5319814Z HOSTNAME=2b02554cc611 2025-05-07T19:43:02.5321072Z GITHUB_PATH=/__w/_temp/_runner_file_commands/add_path_0cfd9d0e-3b49-4008-b4b4-9e78d8e5bcfb 2025-05-07T19:43:02.5321750Z GITHUB_ACTION=__run_2 2025-05-07T19:43:02.5322017Z GITHUB_RUN_NUMBER=10601 2025-05-07T19:43:02.5322273Z RUNNER_NAME=i-032cc121644da911c 2025-05-07T19:43:02.5322586Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-05-07T19:43:02.5322905Z PLATFORM_NAME_LC=linux-x86_64 2025-05-07T19:43:02.5323201Z MACHINE_NAME_LC=x86_64 2025-05-07T19:43:02.5323450Z GITHUB_TRIGGERING_ACTOR=q10 2025-05-07T19:43:02.5323750Z PRELUDE=.github/scripts/setup_env.bash 2025-05-07T19:43:02.5324055Z GITHUB_REF_TYPE=branch 2025-05-07T19:43:02.5324922Z *** 2025-05-07T19:43:02.5325160Z GITHUB_REPOSITORY_ID=150154628 2025-05-07T19:43:02.5325878Z GITHUB_ACTIONS=true 2025-05-07T19:43:02.5326165Z GITHUB_SHA=a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:02.5326737Z GITHUB_WORKFLOW_REF=pytorch/FBGEMM/.github/workflows/fbgemm_gpu_ci_cuda.yml@refs/pull/4066/merge 2025-05-07T19:43:02.5327406Z RUNNER_ENVIRONMENT=self-hosted 2025-05-07T19:43:02.5327673Z GITHUB_REF=refs/pull/4066/merge 2025-05-07T19:43:02.5327937Z RUNNER_OS=Linux 2025-05-07T19:43:02.5328173Z GITHUB_REF_PROTECTED=false 2025-05-07T19:43:02.5328415Z HOME=/github/home 2025-05-07T19:43:02.5328679Z GITHUB_API_URL=https://api.github.com 2025-05-07T19:43:02.5328962Z RUNNER_ARCH=X64 2025-05-07T19:43:02.5329194Z RUNNER_TEMP=/__w/_temp 2025-05-07T19:43:02.5329415Z BUILD_TARGET=default 2025-05-07T19:43:02.5329831Z GITHUB_STATE=/__w/_temp/_runner_file_commands/save_state_0cfd9d0e-3b49-4008-b4b4-9e78d8e5bcfb 2025-05-07T19:43:02.5330454Z GITHUB_ENV=/__w/_temp/_runner_file_commands/set_env_0cfd9d0e-3b49-4008-b4b4-9e78d8e5bcfb 2025-05-07T19:43:02.5330938Z GITHUB_EVENT_PATH=/github/workflow/event.json 2025-05-07T19:43:02.5331255Z GITHUB_EVENT_NAME=pull_request 2025-05-07T19:43:02.5331527Z GITHUB_RUN_ID=14891846252 2025-05-07T19:43:02.5331986Z GITHUB_STEP_SUMMARY=/__w/_temp/_runner_file_commands/step_summary_0cfd9d0e-3b49-4008-b4b4-9e78d8e5bcfb 2025-05-07T19:43:02.5332480Z BUILD_ENV=build_binary 2025-05-07T19:43:02.5332737Z GITHUB_ACTOR=q10 2025-05-07T19:43:02.5332948Z GITHUB_RUN_ATTEMPT=1 2025-05-07T19:43:02.5333187Z KERN_NAME_LC=linux 2025-05-07T19:43:02.5333405Z BUILD_CUDA_VERSION=11.8.0 2025-05-07T19:43:02.5333897Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-05-07T19:43:02.5334247Z PLATFORM_NAME=Linux-x86_64 2025-05-07T19:43:02.5334548Z GITHUB_SERVER_URL=https://github.com 2025-05-07T19:43:02.5334852Z SHLVL=1 2025-05-07T19:43:02.5335053Z GITHUB_ACTOR_ID=255046 2025-05-07T19:43:02.5335309Z RUNNER_TOOL_CACHE=/__w/_tool 2025-05-07T19:43:02.5335842Z GITHUB_WORKFLOW_SHA=6060cd4b5f971680caecdcc657faccb5720d1c3e 2025-05-07T19:43:02.5336433Z GITHUB_REF_NAME=4066/merge 2025-05-07T19:43:02.5336695Z KERN_NAME=Linux 2025-05-07T19:43:02.5336945Z GITHUB_JOB=build_artifact 2025-05-07T19:43:02.5337217Z GITHUB_REPOSITORY=pytorch/FBGEMM 2025-05-07T19:43:02.5337532Z GITHUB_RETENTION_DAYS=90 2025-05-07T19:43:02.5337842Z RUNNER_WORKSPACE=/__w/FBGEMM 2025-05-07T19:43:02.5338123Z GITHUB_ACTION_REPOSITORY= 2025-05-07T19:43:02.5338503Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:43:02.5338896Z GITHUB_BASE_REF=main 2025-05-07T19:43:02.5339145Z CI=true 2025-05-07T19:43:02.5339365Z GITHUB_REPOSITORY_OWNER=pytorch 2025-05-07T19:43:02.5339690Z GITHUB_HEAD_REF=bm/genai-rocm-oss-6 2025-05-07T19:43:02.5339986Z GITHUB_ACTION_REF= 2025-05-07T19:43:02.5340264Z GITHUB_WORKFLOW=FBGEMM GPU/GenAI CUDA CI 2025-05-07T19:43:02.5340776Z GITHUB_OUTPUT=/__w/_temp/_runner_file_commands/set_output_0cfd9d0e-3b49-4008-b4b4-9e78d8e5bcfb 2025-05-07T19:43:02.5341296Z MACHINE_NAME=x86_64 2025-05-07T19:43:02.5341554Z _=/usr/bin/printenv 2025-05-07T19:43:02.5341697Z 2025-05-07T19:43:02.5341815Z ################################################################################ 2025-05-07T19:43:02.5342163Z [INFO] Print ldd version ... 2025-05-07T19:43:02.5342436Z + ldd --version 2025-05-07T19:43:02.5342594Z 2025-05-07T19:43:02.5342682Z ldd (GNU libc) 2.34 2025-05-07T19:43:02.5342960Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:43:02.5343453Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:43:02.5344027Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:43:02.5344532Z Written by Roland McGrath and Ulrich Drepper. 2025-05-07T19:43:02.5344774Z 2025-05-07T19:43:02.5344915Z ################################################################################ 2025-05-07T19:43:02.5345244Z [INFO] Print CPU info ... 2025-05-07T19:43:02.5345532Z + nproc 2025-05-07T19:43:02.5345645Z 2025-05-07T19:43:02.5353176Z 96 2025-05-07T19:43:02.5355821Z 2025-05-07T19:43:02.5356008Z + lscpu 2025-05-07T19:43:02.5356128Z 2025-05-07T19:43:02.5616162Z Architecture: x86_64 2025-05-07T19:43:02.5617296Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:43:02.5618549Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5619337Z Byte Order: Little Endian 2025-05-07T19:43:02.5619899Z CPU(s): 96 2025-05-07T19:43:02.5620214Z On-line CPU(s) list: 0-95 2025-05-07T19:43:02.5620593Z Vendor ID: GenuineIntel 2025-05-07T19:43:02.5621040Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5621450Z CPU family: 6 2025-05-07T19:43:02.5621766Z Model: 85 2025-05-07T19:43:02.5622065Z Thread(s) per core: 2 2025-05-07T19:43:02.5622504Z Core(s) per socket: 24 2025-05-07T19:43:02.5622814Z Socket(s): 2 2025-05-07T19:43:02.5623095Z Stepping: 7 2025-05-07T19:43:02.5623426Z BogoMIPS: 5999.97 2025-05-07T19:43:02.5625685Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5627955Z Hypervisor vendor: KVM 2025-05-07T19:43:02.5631369Z Virtualization type: full 2025-05-07T19:43:02.5631816Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:43:02.5632208Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:43:02.5632615Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:43:02.5633314Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:43:02.5633698Z NUMA node(s): 2 2025-05-07T19:43:02.5634041Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:43:02.5634478Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:43:02.5634965Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:43:02.5635586Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:43:02.5636134Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:43:02.5636784Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:02.5637417Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:43:02.5638070Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:02.5638726Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:43:02.5639144Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:43:02.5639637Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:43:02.5640057Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:43:02.5640638Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:43:02.5641530Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:43:02.5642178Z Vulnerability Srbds: Not affected 2025-05-07T19:43:02.5642619Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:43:02.5643475Z 2025-05-07T19:43:02.5643591Z + cat /proc/cpuinfo 2025-05-07T19:43:02.5643878Z 2025-05-07T19:43:02.5643985Z processor : 0 2025-05-07T19:43:02.5644257Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5644558Z cpu family : 6 2025-05-07T19:43:02.5644836Z model : 85 2025-05-07T19:43:02.5645190Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5645573Z stepping : 7 2025-05-07T19:43:02.5645829Z microcode : 0x5003901 2025-05-07T19:43:02.5646085Z cpu MHz : 3300.928 2025-05-07T19:43:02.5646356Z cache size : 36608 KB 2025-05-07T19:43:02.5646624Z physical id : 0 2025-05-07T19:43:02.5646903Z siblings : 48 2025-05-07T19:43:02.5647143Z core id : 0 2025-05-07T19:43:02.5647399Z cpu cores : 24 2025-05-07T19:43:02.5647625Z apicid : 0 2025-05-07T19:43:02.5647888Z initial apicid : 0 2025-05-07T19:43:02.5682685Z fpu : yes 2025-05-07T19:43:02.5683126Z fpu_exception : yes 2025-05-07T19:43:02.5683396Z cpuid level : 13 2025-05-07T19:43:02.5683651Z wp : yes 2025-05-07T19:43:02.5685993Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5688692Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5689289Z bogomips : 5999.97 2025-05-07T19:43:02.5689538Z clflush size : 64 2025-05-07T19:43:02.5689769Z cache_alignment : 64 2025-05-07T19:43:02.5690073Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5690595Z power management: 2025-05-07T19:43:02.5690763Z 2025-05-07T19:43:02.5690855Z processor : 1 2025-05-07T19:43:02.5691086Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5691354Z cpu family : 6 2025-05-07T19:43:02.5691564Z model : 85 2025-05-07T19:43:02.5691872Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5692248Z stepping : 7 2025-05-07T19:43:02.5692466Z microcode : 0x5003901 2025-05-07T19:43:02.5692835Z cpu MHz : 3213.795 2025-05-07T19:43:02.5693039Z cache size : 36608 KB 2025-05-07T19:43:02.5693277Z physical id : 0 2025-05-07T19:43:02.5693479Z siblings : 48 2025-05-07T19:43:02.5693690Z core id : 1 2025-05-07T19:43:02.5693882Z cpu cores : 24 2025-05-07T19:43:02.5694095Z apicid : 2 2025-05-07T19:43:02.5694288Z initial apicid : 2 2025-05-07T19:43:02.5694509Z fpu : yes 2025-05-07T19:43:02.5694703Z fpu_exception : yes 2025-05-07T19:43:02.5694932Z cpuid level : 13 2025-05-07T19:43:02.5695133Z wp : yes 2025-05-07T19:43:02.5697280Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5699816Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5700394Z bogomips : 5999.97 2025-05-07T19:43:02.5700608Z clflush size : 64 2025-05-07T19:43:02.5700840Z cache_alignment : 64 2025-05-07T19:43:02.5701107Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5701444Z power management: 2025-05-07T19:43:02.5701577Z 2025-05-07T19:43:02.5701661Z processor : 2 2025-05-07T19:43:02.5701894Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5702582Z cpu family : 6 2025-05-07T19:43:02.5702817Z model : 85 2025-05-07T19:43:02.5703110Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5703498Z stepping : 7 2025-05-07T19:43:02.5703739Z microcode : 0x5003901 2025-05-07T19:43:02.5703973Z cpu MHz : 2640.114 2025-05-07T19:43:02.5704213Z cache size : 36608 KB 2025-05-07T19:43:02.5704444Z physical id : 0 2025-05-07T19:43:02.5704678Z siblings : 48 2025-05-07T19:43:02.5704892Z core id : 2 2025-05-07T19:43:02.5705119Z cpu cores : 24 2025-05-07T19:43:02.5705327Z apicid : 4 2025-05-07T19:43:02.5705556Z initial apicid : 4 2025-05-07T19:43:02.5705779Z fpu : yes 2025-05-07T19:43:02.5706006Z fpu_exception : yes 2025-05-07T19:43:02.5706236Z cpuid level : 13 2025-05-07T19:43:02.5706482Z wp : yes 2025-05-07T19:43:02.5708831Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5711517Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5712121Z bogomips : 5999.97 2025-05-07T19:43:02.5712369Z clflush size : 64 2025-05-07T19:43:02.5712601Z cache_alignment : 64 2025-05-07T19:43:02.5712986Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5713324Z power management: 2025-05-07T19:43:02.5713488Z 2025-05-07T19:43:02.5713579Z processor : 3 2025-05-07T19:43:02.5714018Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5714284Z cpu family : 6 2025-05-07T19:43:02.5714487Z model : 85 2025-05-07T19:43:02.5714778Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5715143Z stepping : 7 2025-05-07T19:43:02.5715359Z microcode : 0x5003901 2025-05-07T19:43:02.5715609Z cpu MHz : 2999.988 2025-05-07T19:43:02.5715829Z cache size : 36608 KB 2025-05-07T19:43:02.5716080Z physical id : 0 2025-05-07T19:43:02.5716299Z siblings : 48 2025-05-07T19:43:02.5716522Z core id : 3 2025-05-07T19:43:02.5716729Z cpu cores : 24 2025-05-07T19:43:02.5716956Z apicid : 6 2025-05-07T19:43:02.5717158Z initial apicid : 6 2025-05-07T19:43:02.5717400Z fpu : yes 2025-05-07T19:43:02.5717611Z fpu_exception : yes 2025-05-07T19:43:02.5717856Z cpuid level : 13 2025-05-07T19:43:02.5718076Z wp : yes 2025-05-07T19:43:02.5720395Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5723114Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5723732Z bogomips : 5999.97 2025-05-07T19:43:02.5723959Z clflush size : 64 2025-05-07T19:43:02.5724205Z cache_alignment : 64 2025-05-07T19:43:02.5724491Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5724856Z power management: 2025-05-07T19:43:02.5724997Z 2025-05-07T19:43:02.5725086Z processor : 4 2025-05-07T19:43:02.5725331Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5725688Z cpu family : 6 2025-05-07T19:43:02.5725902Z model : 85 2025-05-07T19:43:02.5726167Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5726615Z stepping : 7 2025-05-07T19:43:02.5726838Z microcode : 0x5003901 2025-05-07T19:43:02.5727062Z cpu MHz : 3286.279 2025-05-07T19:43:02.5727297Z cache size : 36608 KB 2025-05-07T19:43:02.5727518Z physical id : 0 2025-05-07T19:43:02.5727746Z siblings : 48 2025-05-07T19:43:02.5727941Z core id : 4 2025-05-07T19:43:02.5728336Z cpu cores : 24 2025-05-07T19:43:02.5728541Z apicid : 8 2025-05-07T19:43:02.5728757Z initial apicid : 8 2025-05-07T19:43:02.5728973Z fpu : yes 2025-05-07T19:43:02.5729205Z fpu_exception : yes 2025-05-07T19:43:02.5729431Z cpuid level : 13 2025-05-07T19:43:02.5729662Z wp : yes 2025-05-07T19:43:02.5731919Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5734554Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5735141Z bogomips : 5999.97 2025-05-07T19:43:02.5735385Z clflush size : 64 2025-05-07T19:43:02.5735613Z cache_alignment : 64 2025-05-07T19:43:02.5735903Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5736236Z power management: 2025-05-07T19:43:02.5736396Z 2025-05-07T19:43:02.5736487Z processor : 5 2025-05-07T19:43:02.5736711Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5736972Z cpu family : 6 2025-05-07T19:43:02.5737255Z model : 85 2025-05-07T19:43:02.5737610Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5737989Z stepping : 7 2025-05-07T19:43:02.5738199Z microcode : 0x5003901 2025-05-07T19:43:02.5738448Z cpu MHz : 3297.971 2025-05-07T19:43:02.5738668Z cache size : 36608 KB 2025-05-07T19:43:02.5738912Z physical id : 0 2025-05-07T19:43:02.5739125Z siblings : 48 2025-05-07T19:43:02.5739348Z core id : 5 2025-05-07T19:43:02.5739546Z cpu cores : 24 2025-05-07T19:43:02.5739939Z apicid : 10 2025-05-07T19:43:02.5740178Z initial apicid : 10 2025-05-07T19:43:02.5740416Z fpu : yes 2025-05-07T19:43:02.5740618Z fpu_exception : yes 2025-05-07T19:43:02.5740857Z cpuid level : 13 2025-05-07T19:43:02.5741095Z wp : yes 2025-05-07T19:43:02.5743557Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5746241Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5746861Z bogomips : 5999.97 2025-05-07T19:43:02.5747086Z clflush size : 64 2025-05-07T19:43:02.5747331Z cache_alignment : 64 2025-05-07T19:43:02.5747604Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5747947Z power management: 2025-05-07T19:43:02.5748084Z 2025-05-07T19:43:02.5748178Z processor : 6 2025-05-07T19:43:02.5748377Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5748625Z cpu family : 6 2025-05-07T19:43:02.5748817Z model : 85 2025-05-07T19:43:02.5749115Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5749454Z stepping : 7 2025-05-07T19:43:02.5749723Z microcode : 0x5003901 2025-05-07T19:43:02.5749937Z cpu MHz : 2999.988 2025-05-07T19:43:02.5750155Z cache size : 36608 KB 2025-05-07T19:43:02.5750380Z physical id : 0 2025-05-07T19:43:02.5750605Z siblings : 48 2025-05-07T19:43:02.5750799Z core id : 6 2025-05-07T19:43:02.5750999Z cpu cores : 24 2025-05-07T19:43:02.5751203Z apicid : 12 2025-05-07T19:43:02.5751389Z initial apicid : 12 2025-05-07T19:43:02.5751623Z fpu : yes 2025-05-07T19:43:02.5751828Z fpu_exception : yes 2025-05-07T19:43:02.5752063Z cpuid level : 13 2025-05-07T19:43:02.5752269Z wp : yes 2025-05-07T19:43:02.5754663Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5757314Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5757895Z bogomips : 5999.97 2025-05-07T19:43:02.5758123Z clflush size : 64 2025-05-07T19:43:02.5758341Z cache_alignment : 64 2025-05-07T19:43:02.5758634Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5758975Z power management: 2025-05-07T19:43:02.5759106Z 2025-05-07T19:43:02.5759189Z processor : 7 2025-05-07T19:43:02.5759408Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5759647Z cpu family : 6 2025-05-07T19:43:02.5759874Z model : 85 2025-05-07T19:43:02.5760153Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5760608Z stepping : 7 2025-05-07T19:43:02.5760821Z microcode : 0x5003901 2025-05-07T19:43:02.5761072Z cpu MHz : 3181.448 2025-05-07T19:43:02.5761292Z cache size : 36608 KB 2025-05-07T19:43:02.5761539Z physical id : 0 2025-05-07T19:43:02.5761754Z siblings : 48 2025-05-07T19:43:02.5761978Z core id : 7 2025-05-07T19:43:02.5762197Z cpu cores : 24 2025-05-07T19:43:02.5762402Z apicid : 14 2025-05-07T19:43:02.5762628Z initial apicid : 14 2025-05-07T19:43:02.5762850Z fpu : yes 2025-05-07T19:43:02.5763071Z fpu_exception : yes 2025-05-07T19:43:02.5763301Z cpuid level : 13 2025-05-07T19:43:02.5763529Z wp : yes 2025-05-07T19:43:02.5765934Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5768421Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5768992Z bogomips : 5999.97 2025-05-07T19:43:02.5769201Z clflush size : 64 2025-05-07T19:43:02.5769427Z cache_alignment : 64 2025-05-07T19:43:02.5769689Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5770014Z power management: 2025-05-07T19:43:02.5770142Z 2025-05-07T19:43:02.5770238Z processor : 8 2025-05-07T19:43:02.5770443Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5770685Z cpu family : 6 2025-05-07T19:43:02.5770876Z model : 85 2025-05-07T19:43:02.5771155Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5771486Z stepping : 7 2025-05-07T19:43:02.5771704Z microcode : 0x5003901 2025-05-07T19:43:02.5771920Z cpu MHz : 3207.612 2025-05-07T19:43:02.5772231Z cache size : 36608 KB 2025-05-07T19:43:02.5772447Z physical id : 0 2025-05-07T19:43:02.5772664Z siblings : 48 2025-05-07T19:43:02.5772857Z core id : 8 2025-05-07T19:43:02.5773070Z cpu cores : 24 2025-05-07T19:43:02.5773282Z apicid : 16 2025-05-07T19:43:02.5773478Z initial apicid : 16 2025-05-07T19:43:02.5773698Z fpu : yes 2025-05-07T19:43:02.5773887Z fpu_exception : yes 2025-05-07T19:43:02.5774110Z cpuid level : 13 2025-05-07T19:43:02.5774305Z wp : yes 2025-05-07T19:43:02.5776809Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5779405Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5779980Z bogomips : 5999.97 2025-05-07T19:43:02.5780208Z clflush size : 64 2025-05-07T19:43:02.5780425Z cache_alignment : 64 2025-05-07T19:43:02.5780713Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5781047Z power management: 2025-05-07T19:43:02.5781180Z 2025-05-07T19:43:02.5781264Z processor : 9 2025-05-07T19:43:02.5781493Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5781733Z cpu family : 6 2025-05-07T19:43:02.5781961Z model : 85 2025-05-07T19:43:02.5782235Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5782601Z stepping : 7 2025-05-07T19:43:02.5782809Z microcode : 0x5003901 2025-05-07T19:43:02.5783114Z cpu MHz : 3264.683 2025-05-07T19:43:02.5783330Z cache size : 36608 KB 2025-05-07T19:43:02.5783572Z physical id : 0 2025-05-07T19:43:02.5783778Z siblings : 48 2025-05-07T19:43:02.5783988Z core id : 9 2025-05-07T19:43:02.5784202Z cpu cores : 24 2025-05-07T19:43:02.5784408Z apicid : 18 2025-05-07T19:43:02.5784633Z initial apicid : 18 2025-05-07T19:43:02.5784849Z fpu : yes 2025-05-07T19:43:02.5785063Z fpu_exception : yes 2025-05-07T19:43:02.5785281Z cpuid level : 13 2025-05-07T19:43:02.5785502Z wp : yes 2025-05-07T19:43:02.5787748Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5790405Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5790974Z bogomips : 5999.97 2025-05-07T19:43:02.5791181Z clflush size : 64 2025-05-07T19:43:02.5791407Z cache_alignment : 64 2025-05-07T19:43:02.5791663Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5791985Z power management: 2025-05-07T19:43:02.5792112Z 2025-05-07T19:43:02.5792211Z processor : 10 2025-05-07T19:43:02.5792420Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5792665Z cpu family : 6 2025-05-07T19:43:02.5792968Z model : 85 2025-05-07T19:43:02.5793442Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5793801Z stepping : 7 2025-05-07T19:43:02.5794113Z microcode : 0x5003901 2025-05-07T19:43:02.5794347Z cpu MHz : 3770.084 2025-05-07T19:43:02.5794600Z cache size : 36608 KB 2025-05-07T19:43:02.5794833Z physical id : 0 2025-05-07T19:43:02.5795062Z siblings : 48 2025-05-07T19:43:02.5795363Z core id : 10 2025-05-07T19:43:02.5795565Z cpu cores : 24 2025-05-07T19:43:02.5795791Z apicid : 20 2025-05-07T19:43:02.5795997Z initial apicid : 20 2025-05-07T19:43:02.5796234Z fpu : yes 2025-05-07T19:43:02.5796435Z fpu_exception : yes 2025-05-07T19:43:02.5796667Z cpuid level : 13 2025-05-07T19:43:02.5796877Z wp : yes 2025-05-07T19:43:02.5799207Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5801893Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5802700Z bogomips : 5999.97 2025-05-07T19:43:02.5802936Z clflush size : 64 2025-05-07T19:43:02.5803158Z cache_alignment : 64 2025-05-07T19:43:02.5803452Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5803801Z power management: 2025-05-07T19:43:02.5803939Z 2025-05-07T19:43:02.5804028Z processor : 11 2025-05-07T19:43:02.5804267Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5804511Z cpu family : 6 2025-05-07T19:43:02.5804737Z model : 85 2025-05-07T19:43:02.5805017Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5805385Z stepping : 7 2025-05-07T19:43:02.5805600Z microcode : 0x5003901 2025-05-07T19:43:02.5805850Z cpu MHz : 3324.214 2025-05-07T19:43:02.5806072Z cache size : 36608 KB 2025-05-07T19:43:02.5806439Z physical id : 0 2025-05-07T19:43:02.5806655Z siblings : 48 2025-05-07T19:43:02.5806879Z core id : 11 2025-05-07T19:43:02.5807109Z cpu cores : 24 2025-05-07T19:43:02.5807323Z apicid : 22 2025-05-07T19:43:02.5807555Z initial apicid : 22 2025-05-07T19:43:02.5807779Z fpu : yes 2025-05-07T19:43:02.5808006Z fpu_exception : yes 2025-05-07T19:43:02.5808232Z cpuid level : 13 2025-05-07T19:43:02.5808459Z wp : yes 2025-05-07T19:43:02.5810764Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5813460Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5814293Z bogomips : 5999.97 2025-05-07T19:43:02.5814500Z clflush size : 64 2025-05-07T19:43:02.5814725Z cache_alignment : 64 2025-05-07T19:43:02.5814989Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5815312Z power management: 2025-05-07T19:43:02.5815440Z 2025-05-07T19:43:02.5815538Z processor : 12 2025-05-07T19:43:02.5815744Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5815989Z cpu family : 6 2025-05-07T19:43:02.5816185Z model : 85 2025-05-07T19:43:02.5816459Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5816790Z stepping : 7 2025-05-07T19:43:02.5817005Z microcode : 0x5003901 2025-05-07T19:43:02.5817220Z cpu MHz : 2999.988 2025-05-07T19:43:02.5817440Z cache size : 36608 KB 2025-05-07T19:43:02.5817654Z physical id : 0 2025-05-07T19:43:02.5817874Z siblings : 48 2025-05-07T19:43:02.5818081Z core id : 12 2025-05-07T19:43:02.5818271Z cpu cores : 24 2025-05-07T19:43:02.5818568Z apicid : 24 2025-05-07T19:43:02.5818761Z initial apicid : 24 2025-05-07T19:43:02.5818984Z fpu : yes 2025-05-07T19:43:02.5819171Z fpu_exception : yes 2025-05-07T19:43:02.5819396Z cpuid level : 13 2025-05-07T19:43:02.5819590Z wp : yes 2025-05-07T19:43:02.5821734Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5824226Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5824779Z bogomips : 5999.97 2025-05-07T19:43:02.5825000Z clflush size : 64 2025-05-07T19:43:02.5825208Z cache_alignment : 64 2025-05-07T19:43:02.5825488Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5825811Z power management: 2025-05-07T19:43:02.5825938Z 2025-05-07T19:43:02.5826024Z processor : 13 2025-05-07T19:43:02.5826252Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5826484Z cpu family : 6 2025-05-07T19:43:02.5826690Z model : 85 2025-05-07T19:43:02.5826945Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5827295Z stepping : 7 2025-05-07T19:43:02.5827486Z microcode : 0x5003901 2025-05-07T19:43:02.5827730Z cpu MHz : 3189.512 2025-05-07T19:43:02.5827933Z cache size : 36608 KB 2025-05-07T19:43:02.5828160Z physical id : 0 2025-05-07T19:43:02.5828362Z siblings : 48 2025-05-07T19:43:02.5828633Z core id : 13 2025-05-07T19:43:02.5828846Z cpu cores : 24 2025-05-07T19:43:02.5829046Z apicid : 26 2025-05-07T19:43:02.5829265Z initial apicid : 26 2025-05-07T19:43:02.5829468Z fpu : yes 2025-05-07T19:43:02.5829671Z fpu_exception : yes 2025-05-07T19:43:02.5829878Z cpuid level : 13 2025-05-07T19:43:02.5830085Z wp : yes 2025-05-07T19:43:02.5832220Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5835300Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5835928Z bogomips : 5999.97 2025-05-07T19:43:02.5836155Z clflush size : 64 2025-05-07T19:43:02.5836404Z cache_alignment : 64 2025-05-07T19:43:02.5836713Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5837052Z power management: 2025-05-07T19:43:02.5837188Z 2025-05-07T19:43:02.5837294Z processor : 14 2025-05-07T19:43:02.5837527Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5837815Z cpu family : 6 2025-05-07T19:43:02.5838004Z model : 85 2025-05-07T19:43:02.5838324Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5838692Z stepping : 7 2025-05-07T19:43:02.5838913Z microcode : 0x5003901 2025-05-07T19:43:02.5839128Z cpu MHz : 3213.877 2025-05-07T19:43:02.5839346Z cache size : 36608 KB 2025-05-07T19:43:02.5839578Z physical id : 0 2025-05-07T19:43:02.5839777Z siblings : 48 2025-05-07T19:43:02.5839980Z core id : 14 2025-05-07T19:43:02.5840176Z cpu cores : 24 2025-05-07T19:43:02.5840388Z apicid : 28 2025-05-07T19:43:02.5840586Z initial apicid : 28 2025-05-07T19:43:02.5840878Z fpu : yes 2025-05-07T19:43:02.5841068Z fpu_exception : yes 2025-05-07T19:43:02.5841297Z cpuid level : 13 2025-05-07T19:43:02.5841498Z wp : yes 2025-05-07T19:43:02.5843806Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5846624Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5847204Z bogomips : 5999.97 2025-05-07T19:43:02.5847424Z clflush size : 64 2025-05-07T19:43:02.5847647Z cache_alignment : 64 2025-05-07T19:43:02.5847915Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5848244Z power management: 2025-05-07T19:43:02.5848372Z 2025-05-07T19:43:02.5848451Z processor : 15 2025-05-07T19:43:02.5848674Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5848901Z cpu family : 6 2025-05-07T19:43:02.5849102Z model : 85 2025-05-07T19:43:02.5849368Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5849721Z stepping : 7 2025-05-07T19:43:02.5849922Z microcode : 0x5003901 2025-05-07T19:43:02.5850151Z cpu MHz : 3274.880 2025-05-07T19:43:02.5850356Z cache size : 36608 KB 2025-05-07T19:43:02.5850581Z physical id : 0 2025-05-07T19:43:02.5850797Z siblings : 48 2025-05-07T19:43:02.5850991Z core id : 15 2025-05-07T19:43:02.5851190Z cpu cores : 24 2025-05-07T19:43:02.5851381Z apicid : 30 2025-05-07T19:43:02.5851672Z initial apicid : 30 2025-05-07T19:43:02.5851903Z fpu : yes 2025-05-07T19:43:02.5852102Z fpu_exception : yes 2025-05-07T19:43:02.5852326Z cpuid level : 13 2025-05-07T19:43:02.5852536Z wp : yes 2025-05-07T19:43:02.5854829Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5857493Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5858081Z bogomips : 5999.97 2025-05-07T19:43:02.5858303Z clflush size : 64 2025-05-07T19:43:02.5858509Z cache_alignment : 64 2025-05-07T19:43:02.5858794Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5859118Z power management: 2025-05-07T19:43:02.5859263Z 2025-05-07T19:43:02.5859343Z processor : 16 2025-05-07T19:43:02.5859568Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5859800Z cpu family : 6 2025-05-07T19:43:02.5860011Z model : 85 2025-05-07T19:43:02.5860279Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5860638Z stepping : 7 2025-05-07T19:43:02.5860835Z microcode : 0x5003901 2025-05-07T19:43:02.5861070Z cpu MHz : 3276.923 2025-05-07T19:43:02.5861286Z cache size : 36608 KB 2025-05-07T19:43:02.5861512Z physical id : 0 2025-05-07T19:43:02.5861837Z siblings : 48 2025-05-07T19:43:02.5862038Z core id : 16 2025-05-07T19:43:02.5862227Z cpu cores : 24 2025-05-07T19:43:02.5862431Z apicid : 32 2025-05-07T19:43:02.5862623Z initial apicid : 32 2025-05-07T19:43:02.5862837Z fpu : yes 2025-05-07T19:43:02.5863040Z fpu_exception : yes 2025-05-07T19:43:02.5863249Z cpuid level : 13 2025-05-07T19:43:02.5863520Z wp : yes 2025-05-07T19:43:02.5865740Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5868328Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5868906Z bogomips : 5999.97 2025-05-07T19:43:02.5869119Z clflush size : 64 2025-05-07T19:43:02.5869335Z cache_alignment : 64 2025-05-07T19:43:02.5869603Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5869938Z power management: 2025-05-07T19:43:02.5870065Z 2025-05-07T19:43:02.5870156Z processor : 17 2025-05-07T19:43:02.5870974Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5871327Z cpu family : 6 2025-05-07T19:43:02.5871544Z model : 85 2025-05-07T19:43:02.5871829Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5872181Z stepping : 7 2025-05-07T19:43:02.5872387Z microcode : 0x5003901 2025-05-07T19:43:02.5872603Z cpu MHz : 3296.453 2025-05-07T19:43:02.5872901Z cache size : 36608 KB 2025-05-07T19:43:02.5873125Z physical id : 0 2025-05-07T19:43:02.5873362Z siblings : 48 2025-05-07T19:43:02.5873584Z core id : 17 2025-05-07T19:43:02.5873783Z cpu cores : 24 2025-05-07T19:43:02.5874034Z apicid : 34 2025-05-07T19:43:02.5874237Z initial apicid : 34 2025-05-07T19:43:02.5874465Z fpu : yes 2025-05-07T19:43:02.5874735Z fpu_exception : yes 2025-05-07T19:43:02.5874956Z cpuid level : 13 2025-05-07T19:43:02.5875151Z wp : yes 2025-05-07T19:43:02.5877458Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5880134Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5880719Z bogomips : 5999.97 2025-05-07T19:43:02.5880946Z clflush size : 64 2025-05-07T19:43:02.5881164Z cache_alignment : 64 2025-05-07T19:43:02.5881446Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5881792Z power management: 2025-05-07T19:43:02.5881922Z 2025-05-07T19:43:02.5882008Z processor : 18 2025-05-07T19:43:02.5882230Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5882461Z cpu family : 6 2025-05-07T19:43:02.5882668Z model : 85 2025-05-07T19:43:02.5882943Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5883306Z stepping : 7 2025-05-07T19:43:02.5883514Z microcode : 0x5003901 2025-05-07T19:43:02.5883751Z cpu MHz : 3329.099 2025-05-07T19:43:02.5883975Z cache size : 36608 KB 2025-05-07T19:43:02.5884219Z physical id : 0 2025-05-07T19:43:02.5884424Z siblings : 48 2025-05-07T19:43:02.5884650Z core id : 18 2025-05-07T19:43:02.5884851Z cpu cores : 24 2025-05-07T19:43:02.5885052Z apicid : 36 2025-05-07T19:43:02.5885261Z initial apicid : 36 2025-05-07T19:43:02.5885587Z fpu : yes 2025-05-07T19:43:02.5885786Z fpu_exception : yes 2025-05-07T19:43:02.5885999Z cpuid level : 13 2025-05-07T19:43:02.5886206Z wp : yes 2025-05-07T19:43:02.5888438Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5891196Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5891760Z bogomips : 5999.97 2025-05-07T19:43:02.5891956Z clflush size : 64 2025-05-07T19:43:02.5892165Z cache_alignment : 64 2025-05-07T19:43:02.5892416Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5892718Z power management: 2025-05-07T19:43:02.5892844Z 2025-05-07T19:43:02.5892931Z processor : 19 2025-05-07T19:43:02.5893140Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5893363Z cpu family : 6 2025-05-07T19:43:02.5893545Z model : 85 2025-05-07T19:43:02.5893802Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5894124Z stepping : 7 2025-05-07T19:43:02.5894319Z microcode : 0x5003901 2025-05-07T19:43:02.5894510Z cpu MHz : 2999.988 2025-05-07T19:43:02.5894712Z cache size : 36608 KB 2025-05-07T19:43:02.5894913Z physical id : 0 2025-05-07T19:43:02.5895110Z siblings : 48 2025-05-07T19:43:02.5895303Z core id : 19 2025-05-07T19:43:02.5895480Z cpu cores : 24 2025-05-07T19:43:02.5895681Z apicid : 38 2025-05-07T19:43:02.5895876Z initial apicid : 38 2025-05-07T19:43:02.5896096Z fpu : yes 2025-05-07T19:43:02.5896274Z fpu_exception : yes 2025-05-07T19:43:02.5896482Z cpuid level : 13 2025-05-07T19:43:02.5896672Z wp : yes 2025-05-07T19:43:02.5898881Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5901361Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5901900Z bogomips : 5999.97 2025-05-07T19:43:02.5902261Z clflush size : 64 2025-05-07T19:43:02.5902648Z cache_alignment : 64 2025-05-07T19:43:02.5902976Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5903311Z power management: 2025-05-07T19:43:02.5903454Z 2025-05-07T19:43:02.5903543Z processor : 20 2025-05-07T19:43:02.5903773Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5904012Z cpu family : 6 2025-05-07T19:43:02.5904231Z model : 85 2025-05-07T19:43:02.5904510Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5904862Z stepping : 7 2025-05-07T19:43:02.5905092Z microcode : 0x5003901 2025-05-07T19:43:02.5905347Z cpu MHz : 3258.809 2025-05-07T19:43:02.5905572Z cache size : 36608 KB 2025-05-07T19:43:02.5905843Z physical id : 0 2025-05-07T19:43:02.5906090Z siblings : 48 2025-05-07T19:43:02.5906300Z core id : 20 2025-05-07T19:43:02.5906530Z cpu cores : 24 2025-05-07T19:43:02.5906736Z apicid : 40 2025-05-07T19:43:02.5906980Z initial apicid : 40 2025-05-07T19:43:02.5907213Z fpu : yes 2025-05-07T19:43:02.5907447Z fpu_exception : yes 2025-05-07T19:43:02.5907679Z cpuid level : 13 2025-05-07T19:43:02.5907931Z wp : yes 2025-05-07T19:43:02.5910254Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5913138Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5913754Z bogomips : 5999.97 2025-05-07T19:43:02.5914007Z clflush size : 64 2025-05-07T19:43:02.5914262Z cache_alignment : 64 2025-05-07T19:43:02.5914548Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5914916Z power management: 2025-05-07T19:43:02.5915066Z 2025-05-07T19:43:02.5915191Z processor : 21 2025-05-07T19:43:02.5915427Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5915710Z cpu family : 6 2025-05-07T19:43:02.5915922Z model : 85 2025-05-07T19:43:02.5916243Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5916611Z stepping : 7 2025-05-07T19:43:02.5916848Z microcode : 0x5003901 2025-05-07T19:43:02.5917078Z cpu MHz : 2999.988 2025-05-07T19:43:02.5917321Z cache size : 36608 KB 2025-05-07T19:43:02.5917570Z physical id : 0 2025-05-07T19:43:02.5917824Z siblings : 48 2025-05-07T19:43:02.5918055Z core id : 21 2025-05-07T19:43:02.5918268Z cpu cores : 24 2025-05-07T19:43:02.5918513Z apicid : 42 2025-05-07T19:43:02.5918735Z initial apicid : 42 2025-05-07T19:43:02.5918985Z fpu : yes 2025-05-07T19:43:02.5919198Z fpu_exception : yes 2025-05-07T19:43:02.5919456Z cpuid level : 13 2025-05-07T19:43:02.5919681Z wp : yes 2025-05-07T19:43:02.5922106Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5924822Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5925519Z bogomips : 5999.97 2025-05-07T19:43:02.5925751Z clflush size : 64 2025-05-07T19:43:02.5925974Z cache_alignment : 64 2025-05-07T19:43:02.5926269Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5926595Z power management: 2025-05-07T19:43:02.5926729Z 2025-05-07T19:43:02.5926821Z processor : 22 2025-05-07T19:43:02.5927068Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5927297Z cpu family : 6 2025-05-07T19:43:02.5927514Z model : 85 2025-05-07T19:43:02.5927775Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5928119Z stepping : 7 2025-05-07T19:43:02.5928317Z microcode : 0x5003901 2025-05-07T19:43:02.5928541Z cpu MHz : 2999.988 2025-05-07T19:43:02.5928760Z cache size : 36608 KB 2025-05-07T19:43:02.5929008Z physical id : 0 2025-05-07T19:43:02.5929232Z siblings : 48 2025-05-07T19:43:02.5929421Z core id : 22 2025-05-07T19:43:02.5929642Z cpu cores : 24 2025-05-07T19:43:02.5929839Z apicid : 44 2025-05-07T19:43:02.5930045Z initial apicid : 44 2025-05-07T19:43:02.5930252Z fpu : yes 2025-05-07T19:43:02.5930467Z fpu_exception : yes 2025-05-07T19:43:02.5930690Z cpuid level : 13 2025-05-07T19:43:02.5930925Z wp : yes 2025-05-07T19:43:02.5933054Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5935606Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5936201Z bogomips : 5999.97 2025-05-07T19:43:02.5936430Z clflush size : 64 2025-05-07T19:43:02.5936676Z cache_alignment : 64 2025-05-07T19:43:02.5936976Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5937280Z power management: 2025-05-07T19:43:02.5937414Z 2025-05-07T19:43:02.5937531Z processor : 23 2025-05-07T19:43:02.5937740Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5938002Z cpu family : 6 2025-05-07T19:43:02.5938209Z model : 85 2025-05-07T19:43:02.5938518Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5938883Z stepping : 7 2025-05-07T19:43:02.5939116Z microcode : 0x5003901 2025-05-07T19:43:02.5939352Z cpu MHz : 3323.806 2025-05-07T19:43:02.5939612Z cache size : 36608 KB 2025-05-07T19:43:02.5939847Z physical id : 0 2025-05-07T19:43:02.5940066Z siblings : 48 2025-05-07T19:43:02.5940275Z core id : 23 2025-05-07T19:43:02.5940477Z cpu cores : 24 2025-05-07T19:43:02.5940718Z apicid : 46 2025-05-07T19:43:02.5940934Z initial apicid : 46 2025-05-07T19:43:02.5941153Z fpu : yes 2025-05-07T19:43:02.5941341Z fpu_exception : yes 2025-05-07T19:43:02.5941542Z cpuid level : 13 2025-05-07T19:43:02.5941728Z wp : yes 2025-05-07T19:43:02.5943935Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5946576Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5947115Z bogomips : 5999.97 2025-05-07T19:43:02.5947327Z clflush size : 64 2025-05-07T19:43:02.5947555Z cache_alignment : 64 2025-05-07T19:43:02.5947829Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5948129Z power management: 2025-05-07T19:43:02.5948250Z 2025-05-07T19:43:02.5948324Z processor : 24 2025-05-07T19:43:02.5948539Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5948751Z cpu family : 6 2025-05-07T19:43:02.5948934Z model : 85 2025-05-07T19:43:02.5949182Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5949537Z stepping : 7 2025-05-07T19:43:02.5949745Z microcode : 0x5003901 2025-05-07T19:43:02.5949993Z cpu MHz : 2999.988 2025-05-07T19:43:02.5950209Z cache size : 36608 KB 2025-05-07T19:43:02.5950449Z physical id : 1 2025-05-07T19:43:02.5950686Z siblings : 48 2025-05-07T19:43:02.5950892Z core id : 0 2025-05-07T19:43:02.5951113Z cpu cores : 24 2025-05-07T19:43:02.5951303Z apicid : 64 2025-05-07T19:43:02.5951506Z initial apicid : 64 2025-05-07T19:43:02.5951709Z fpu : yes 2025-05-07T19:43:02.5951918Z fpu_exception : yes 2025-05-07T19:43:02.5952124Z cpuid level : 13 2025-05-07T19:43:02.5952334Z wp : yes 2025-05-07T19:43:02.5954835Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5960080Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5960687Z bogomips : 5999.97 2025-05-07T19:43:02.5960979Z clflush size : 64 2025-05-07T19:43:02.5961251Z cache_alignment : 64 2025-05-07T19:43:02.5961567Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5961918Z power management: 2025-05-07T19:43:02.5962062Z 2025-05-07T19:43:02.5962188Z processor : 25 2025-05-07T19:43:02.5962430Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5962707Z cpu family : 6 2025-05-07T19:43:02.5962929Z model : 85 2025-05-07T19:43:02.5963256Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5963632Z stepping : 7 2025-05-07T19:43:02.5963881Z microcode : 0x5003901 2025-05-07T19:43:02.5964110Z cpu MHz : 2999.988 2025-05-07T19:43:02.5964346Z cache size : 36608 KB 2025-05-07T19:43:02.5964615Z physical id : 1 2025-05-07T19:43:02.5964838Z siblings : 48 2025-05-07T19:43:02.5965069Z core id : 1 2025-05-07T19:43:02.5965267Z cpu cores : 24 2025-05-07T19:43:02.5965610Z apicid : 66 2025-05-07T19:43:02.5965817Z initial apicid : 66 2025-05-07T19:43:02.5966057Z fpu : yes 2025-05-07T19:43:02.5966259Z fpu_exception : yes 2025-05-07T19:43:02.5966492Z cpuid level : 13 2025-05-07T19:43:02.5966690Z wp : yes 2025-05-07T19:43:02.5968902Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5971402Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5971956Z bogomips : 5999.97 2025-05-07T19:43:02.5972187Z clflush size : 64 2025-05-07T19:43:02.5972391Z cache_alignment : 64 2025-05-07T19:43:02.5972686Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5973026Z power management: 2025-05-07T19:43:02.5973159Z 2025-05-07T19:43:02.5973247Z processor : 26 2025-05-07T19:43:02.5973488Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5973729Z cpu family : 6 2025-05-07T19:43:02.5973957Z model : 85 2025-05-07T19:43:02.5974216Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5974555Z stepping : 7 2025-05-07T19:43:02.5974747Z microcode : 0x5003901 2025-05-07T19:43:02.5974981Z cpu MHz : 2999.988 2025-05-07T19:43:02.5975204Z cache size : 36608 KB 2025-05-07T19:43:02.5975439Z physical id : 1 2025-05-07T19:43:02.5975674Z siblings : 48 2025-05-07T19:43:02.5975872Z core id : 2 2025-05-07T19:43:02.5976075Z cpu cores : 24 2025-05-07T19:43:02.5976270Z apicid : 68 2025-05-07T19:43:02.5976482Z initial apicid : 68 2025-05-07T19:43:02.5976701Z fpu : yes 2025-05-07T19:43:02.5976929Z fpu_exception : yes 2025-05-07T19:43:02.5977157Z cpuid level : 13 2025-05-07T19:43:02.5977385Z wp : yes 2025-05-07T19:43:02.5979515Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5982057Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5982620Z bogomips : 5999.97 2025-05-07T19:43:02.5982823Z clflush size : 64 2025-05-07T19:43:02.5983047Z cache_alignment : 64 2025-05-07T19:43:02.5983324Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5983633Z power management: 2025-05-07T19:43:02.5983761Z 2025-05-07T19:43:02.5983859Z processor : 27 2025-05-07T19:43:02.5984063Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5984300Z cpu family : 6 2025-05-07T19:43:02.5984491Z model : 85 2025-05-07T19:43:02.5984766Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5985098Z stepping : 7 2025-05-07T19:43:02.5985308Z microcode : 0x5003901 2025-05-07T19:43:02.5985522Z cpu MHz : 2999.988 2025-05-07T19:43:02.5985747Z cache size : 36608 KB 2025-05-07T19:43:02.5985977Z physical id : 1 2025-05-07T19:43:02.5986173Z siblings : 48 2025-05-07T19:43:02.5986377Z core id : 3 2025-05-07T19:43:02.5986563Z cpu cores : 24 2025-05-07T19:43:02.5986772Z apicid : 70 2025-05-07T19:43:02.5986963Z initial apicid : 70 2025-05-07T19:43:02.5987188Z fpu : yes 2025-05-07T19:43:02.5987373Z fpu_exception : yes 2025-05-07T19:43:02.5987596Z cpuid level : 13 2025-05-07T19:43:02.5987790Z wp : yes 2025-05-07T19:43:02.5989992Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.5992464Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.5993101Z bogomips : 5999.97 2025-05-07T19:43:02.5993519Z clflush size : 64 2025-05-07T19:43:02.5993758Z cache_alignment : 64 2025-05-07T19:43:02.5994078Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.5994422Z power management: 2025-05-07T19:43:02.5994557Z 2025-05-07T19:43:02.5994642Z processor : 28 2025-05-07T19:43:02.5994876Z vendor_id : GenuineIntel 2025-05-07T19:43:02.5995118Z cpu family : 6 2025-05-07T19:43:02.5995336Z model : 85 2025-05-07T19:43:02.5995611Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.5995981Z stepping : 7 2025-05-07T19:43:02.5996193Z microcode : 0x5003901 2025-05-07T19:43:02.5996433Z cpu MHz : 2999.988 2025-05-07T19:43:02.5996648Z cache size : 36608 KB 2025-05-07T19:43:02.5996890Z physical id : 1 2025-05-07T19:43:02.5997121Z siblings : 48 2025-05-07T19:43:02.5997321Z core id : 4 2025-05-07T19:43:02.5997537Z cpu cores : 24 2025-05-07T19:43:02.5997743Z apicid : 72 2025-05-07T19:43:02.5997962Z initial apicid : 72 2025-05-07T19:43:02.5998180Z fpu : yes 2025-05-07T19:43:02.5998401Z fpu_exception : yes 2025-05-07T19:43:02.5998615Z cpuid level : 13 2025-05-07T19:43:02.5998837Z wp : yes 2025-05-07T19:43:02.6001135Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6004052Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6004663Z bogomips : 5999.97 2025-05-07T19:43:02.6004885Z clflush size : 64 2025-05-07T19:43:02.6005127Z cache_alignment : 64 2025-05-07T19:43:02.6005428Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6005758Z power management: 2025-05-07T19:43:02.6005897Z 2025-05-07T19:43:02.6006001Z processor : 29 2025-05-07T19:43:02.6006221Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6006479Z cpu family : 6 2025-05-07T19:43:02.6006685Z model : 85 2025-05-07T19:43:02.6006976Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6007327Z stepping : 7 2025-05-07T19:43:02.6007555Z microcode : 0x5003901 2025-05-07T19:43:02.6007787Z cpu MHz : 2999.988 2025-05-07T19:43:02.6008017Z cache size : 36608 KB 2025-05-07T19:43:02.6008260Z physical id : 1 2025-05-07T19:43:02.6008473Z siblings : 48 2025-05-07T19:43:02.6008688Z core id : 5 2025-05-07T19:43:02.6008885Z cpu cores : 24 2025-05-07T19:43:02.6009103Z apicid : 74 2025-05-07T19:43:02.6009306Z initial apicid : 74 2025-05-07T19:43:02.6009536Z fpu : yes 2025-05-07T19:43:02.6009737Z fpu_exception : yes 2025-05-07T19:43:02.6009969Z cpuid level : 13 2025-05-07T19:43:02.6010179Z wp : yes 2025-05-07T19:43:02.6012642Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6015437Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6015987Z bogomips : 5999.97 2025-05-07T19:43:02.6016211Z clflush size : 64 2025-05-07T19:43:02.6016433Z cache_alignment : 64 2025-05-07T19:43:02.6016691Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6017013Z power management: 2025-05-07T19:43:02.6017139Z 2025-05-07T19:43:02.6017218Z processor : 30 2025-05-07T19:43:02.6017432Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6017656Z cpu family : 6 2025-05-07T19:43:02.6017859Z model : 85 2025-05-07T19:43:02.6018116Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6018456Z stepping : 7 2025-05-07T19:43:02.6018650Z microcode : 0x5003901 2025-05-07T19:43:02.6018875Z cpu MHz : 2999.988 2025-05-07T19:43:02.6019080Z cache size : 36608 KB 2025-05-07T19:43:02.6019308Z physical id : 1 2025-05-07T19:43:02.6019521Z siblings : 48 2025-05-07T19:43:02.6019720Z core id : 6 2025-05-07T19:43:02.6019931Z cpu cores : 24 2025-05-07T19:43:02.6020126Z apicid : 76 2025-05-07T19:43:02.6020333Z initial apicid : 76 2025-05-07T19:43:02.6020558Z fpu : yes 2025-05-07T19:43:02.6020755Z fpu_exception : yes 2025-05-07T19:43:02.6020958Z cpuid level : 13 2025-05-07T19:43:02.6021170Z wp : yes 2025-05-07T19:43:02.6023295Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6025795Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6026478Z bogomips : 5999.97 2025-05-07T19:43:02.6026687Z clflush size : 64 2025-05-07T19:43:02.6026918Z cache_alignment : 64 2025-05-07T19:43:02.6027197Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6027505Z power management: 2025-05-07T19:43:02.6027634Z 2025-05-07T19:43:02.6027735Z processor : 31 2025-05-07T19:43:02.6027945Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6028190Z cpu family : 6 2025-05-07T19:43:02.6028386Z model : 85 2025-05-07T19:43:02.6028663Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6029000Z stepping : 7 2025-05-07T19:43:02.6029223Z microcode : 0x5003901 2025-05-07T19:43:02.6029443Z cpu MHz : 2999.988 2025-05-07T19:43:02.6029674Z cache size : 36608 KB 2025-05-07T19:43:02.6029910Z physical id : 1 2025-05-07T19:43:02.6030119Z siblings : 48 2025-05-07T19:43:02.6030332Z core id : 7 2025-05-07T19:43:02.6030531Z cpu cores : 24 2025-05-07T19:43:02.6030744Z apicid : 78 2025-05-07T19:43:02.6030941Z initial apicid : 78 2025-05-07T19:43:02.6031169Z fpu : yes 2025-05-07T19:43:02.6031360Z fpu_exception : yes 2025-05-07T19:43:02.6031593Z cpuid level : 13 2025-05-07T19:43:02.6031795Z wp : yes 2025-05-07T19:43:02.6034331Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6037010Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6037610Z bogomips : 5999.97 2025-05-07T19:43:02.6037845Z clflush size : 64 2025-05-07T19:43:02.6038087Z cache_alignment : 64 2025-05-07T19:43:02.6038363Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6038707Z power management: 2025-05-07T19:43:02.6038841Z 2025-05-07T19:43:02.6038925Z processor : 32 2025-05-07T19:43:02.6039156Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6039396Z cpu family : 6 2025-05-07T19:43:02.6039616Z model : 85 2025-05-07T19:43:02.6039897Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6040261Z stepping : 7 2025-05-07T19:43:02.6040470Z microcode : 0x5003901 2025-05-07T19:43:02.6040717Z cpu MHz : 2999.988 2025-05-07T19:43:02.6040934Z cache size : 36608 KB 2025-05-07T19:43:02.6041187Z physical id : 1 2025-05-07T19:43:02.6041417Z siblings : 48 2025-05-07T19:43:02.6041628Z core id : 8 2025-05-07T19:43:02.6041841Z cpu cores : 24 2025-05-07T19:43:02.6042046Z apicid : 80 2025-05-07T19:43:02.6042274Z initial apicid : 80 2025-05-07T19:43:02.6042489Z fpu : yes 2025-05-07T19:43:02.6042705Z fpu_exception : yes 2025-05-07T19:43:02.6042924Z cpuid level : 13 2025-05-07T19:43:02.6043146Z wp : yes 2025-05-07T19:43:02.6045554Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6048180Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6048797Z bogomips : 5999.97 2025-05-07T19:43:02.6049001Z clflush size : 64 2025-05-07T19:43:02.6049219Z cache_alignment : 64 2025-05-07T19:43:02.6049491Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6049794Z power management: 2025-05-07T19:43:02.6049921Z 2025-05-07T19:43:02.6050021Z processor : 33 2025-05-07T19:43:02.6050225Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6050469Z cpu family : 6 2025-05-07T19:43:02.6050741Z model : 85 2025-05-07T19:43:02.6051012Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6051341Z stepping : 7 2025-05-07T19:43:02.6051551Z microcode : 0x5003901 2025-05-07T19:43:02.6051765Z cpu MHz : 2999.988 2025-05-07T19:43:02.6051981Z cache size : 36608 KB 2025-05-07T19:43:02.6052207Z physical id : 1 2025-05-07T19:43:02.6052404Z siblings : 48 2025-05-07T19:43:02.6052606Z core id : 9 2025-05-07T19:43:02.6052798Z cpu cores : 24 2025-05-07T19:43:02.6053001Z apicid : 82 2025-05-07T19:43:02.6053190Z initial apicid : 82 2025-05-07T19:43:02.6053408Z fpu : yes 2025-05-07T19:43:02.6053588Z fpu_exception : yes 2025-05-07T19:43:02.6053806Z cpuid level : 13 2025-05-07T19:43:02.6053996Z wp : yes 2025-05-07T19:43:02.6056118Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6058716Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6059266Z bogomips : 5999.97 2025-05-07T19:43:02.6059483Z clflush size : 64 2025-05-07T19:43:02.6059702Z cache_alignment : 64 2025-05-07T19:43:02.6059957Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6060276Z power management: 2025-05-07T19:43:02.6060401Z 2025-05-07T19:43:02.6060482Z processor : 34 2025-05-07T19:43:02.6060700Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6060921Z cpu family : 6 2025-05-07T19:43:02.6061127Z model : 85 2025-05-07T19:43:02.6061385Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6061730Z stepping : 7 2025-05-07T19:43:02.6061926Z microcode : 0x5003901 2025-05-07T19:43:02.6062158Z cpu MHz : 2999.988 2025-05-07T19:43:02.6062364Z cache size : 36608 KB 2025-05-07T19:43:02.6062591Z physical id : 1 2025-05-07T19:43:02.6062803Z siblings : 48 2025-05-07T19:43:02.6126078Z core id : 10 2025-05-07T19:43:02.6126547Z cpu cores : 24 2025-05-07T19:43:02.6126764Z apicid : 84 2025-05-07T19:43:02.6126981Z initial apicid : 84 2025-05-07T19:43:02.6127207Z fpu : yes 2025-05-07T19:43:02.6127433Z fpu_exception : yes 2025-05-07T19:43:02.6127660Z cpuid level : 13 2025-05-07T19:43:02.6127748Z wp : yes 2025-05-07T19:43:02.6129927Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6130318Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6130400Z bogomips : 5999.97 2025-05-07T19:43:02.6130477Z clflush size : 64 2025-05-07T19:43:02.6130742Z cache_alignment : 64 2025-05-07T19:43:02.6130867Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6130946Z power management: 2025-05-07T19:43:02.6130954Z 2025-05-07T19:43:02.6131039Z processor : 35 2025-05-07T19:43:02.6131124Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6131197Z cpu family : 6 2025-05-07T19:43:02.6131267Z model : 85 2025-05-07T19:43:02.6131433Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6131507Z stepping : 7 2025-05-07T19:43:02.6131585Z microcode : 0x5003901 2025-05-07T19:43:02.6131666Z cpu MHz : 3623.058 2025-05-07T19:43:02.6131742Z cache size : 36608 KB 2025-05-07T19:43:02.6131816Z physical id : 1 2025-05-07T19:43:02.6131888Z siblings : 48 2025-05-07T19:43:02.6131969Z core id : 11 2025-05-07T19:43:02.6132042Z cpu cores : 24 2025-05-07T19:43:02.6132117Z apicid : 86 2025-05-07T19:43:02.6132205Z initial apicid : 86 2025-05-07T19:43:02.6132275Z fpu : yes 2025-05-07T19:43:02.6132356Z fpu_exception : yes 2025-05-07T19:43:02.6132435Z cpuid level : 13 2025-05-07T19:43:02.6132514Z wp : yes 2025-05-07T19:43:02.6134629Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6135014Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6135090Z bogomips : 5999.97 2025-05-07T19:43:02.6135239Z clflush size : 64 2025-05-07T19:43:02.6135319Z cache_alignment : 64 2025-05-07T19:43:02.6135457Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6135535Z power management: 2025-05-07T19:43:02.6135540Z 2025-05-07T19:43:02.6135615Z processor : 36 2025-05-07T19:43:02.6135702Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6135773Z cpu family : 6 2025-05-07T19:43:02.6135842Z model : 85 2025-05-07T19:43:02.6135996Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6136075Z stepping : 7 2025-05-07T19:43:02.6136152Z microcode : 0x5003901 2025-05-07T19:43:02.6136224Z cpu MHz : 2999.988 2025-05-07T19:43:02.6136305Z cache size : 36608 KB 2025-05-07T19:43:02.6136380Z physical id : 1 2025-05-07T19:43:02.6136454Z siblings : 48 2025-05-07T19:43:02.6136524Z core id : 12 2025-05-07T19:43:02.6136601Z cpu cores : 24 2025-05-07T19:43:02.6136673Z apicid : 88 2025-05-07T19:43:02.6136751Z initial apicid : 88 2025-05-07T19:43:02.6136825Z fpu : yes 2025-05-07T19:43:02.6136905Z fpu_exception : yes 2025-05-07T19:43:02.6136980Z cpuid level : 13 2025-05-07T19:43:02.6137054Z wp : yes 2025-05-07T19:43:02.6139174Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6139551Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6139636Z bogomips : 5999.97 2025-05-07T19:43:02.6139825Z clflush size : 64 2025-05-07T19:43:02.6139903Z cache_alignment : 64 2025-05-07T19:43:02.6140021Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6140153Z power management: 2025-05-07T19:43:02.6140158Z 2025-05-07T19:43:02.6140228Z processor : 37 2025-05-07T19:43:02.6140307Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6140383Z cpu family : 6 2025-05-07T19:43:02.6140450Z model : 85 2025-05-07T19:43:02.6140595Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6140663Z stepping : 7 2025-05-07T19:43:02.6140747Z microcode : 0x5003901 2025-05-07T19:43:02.6140818Z cpu MHz : 2999.988 2025-05-07T19:43:02.6140893Z cache size : 36608 KB 2025-05-07T19:43:02.6140971Z physical id : 1 2025-05-07T19:43:02.6141039Z siblings : 48 2025-05-07T19:43:02.6141107Z core id : 13 2025-05-07T19:43:02.6141175Z cpu cores : 24 2025-05-07T19:43:02.6141251Z apicid : 90 2025-05-07T19:43:02.6141325Z initial apicid : 90 2025-05-07T19:43:02.6141391Z fpu : yes 2025-05-07T19:43:02.6141468Z fpu_exception : yes 2025-05-07T19:43:02.6141553Z cpuid level : 13 2025-05-07T19:43:02.6141621Z wp : yes 2025-05-07T19:43:02.6143646Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6144015Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6144089Z bogomips : 5999.97 2025-05-07T19:43:02.6144166Z clflush size : 64 2025-05-07T19:43:02.6144244Z cache_alignment : 64 2025-05-07T19:43:02.6144481Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6144560Z power management: 2025-05-07T19:43:02.6144568Z 2025-05-07T19:43:02.6144651Z processor : 38 2025-05-07T19:43:02.6144730Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6144800Z cpu family : 6 2025-05-07T19:43:02.6144879Z model : 85 2025-05-07T19:43:02.6145026Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6145099Z stepping : 7 2025-05-07T19:43:02.6145175Z microcode : 0x5003901 2025-05-07T19:43:02.6145255Z cpu MHz : 2999.988 2025-05-07T19:43:02.6145334Z cache size : 36608 KB 2025-05-07T19:43:02.6145407Z physical id : 1 2025-05-07T19:43:02.6145483Z siblings : 48 2025-05-07T19:43:02.6145552Z core id : 14 2025-05-07T19:43:02.6145623Z cpu cores : 24 2025-05-07T19:43:02.6145694Z apicid : 92 2025-05-07T19:43:02.6145775Z initial apicid : 92 2025-05-07T19:43:02.6145842Z fpu : yes 2025-05-07T19:43:02.6145919Z fpu_exception : yes 2025-05-07T19:43:02.6145991Z cpuid level : 13 2025-05-07T19:43:02.6146070Z wp : yes 2025-05-07T19:43:02.6148068Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6148439Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6148509Z bogomips : 5999.97 2025-05-07T19:43:02.6148585Z clflush size : 64 2025-05-07T19:43:02.6148662Z cache_alignment : 64 2025-05-07T19:43:02.6148789Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6148862Z power management: 2025-05-07T19:43:02.6148867Z 2025-05-07T19:43:02.6148985Z processor : 39 2025-05-07T19:43:02.6149073Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6149141Z cpu family : 6 2025-05-07T19:43:02.6149209Z model : 85 2025-05-07T19:43:02.6149360Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6149432Z stepping : 7 2025-05-07T19:43:02.6149505Z microcode : 0x5003901 2025-05-07T19:43:02.6149575Z cpu MHz : 1792.785 2025-05-07T19:43:02.6149659Z cache size : 36608 KB 2025-05-07T19:43:02.6149733Z physical id : 1 2025-05-07T19:43:02.6149802Z siblings : 48 2025-05-07T19:43:02.6149870Z core id : 15 2025-05-07T19:43:02.6150097Z cpu cores : 24 2025-05-07T19:43:02.6150164Z apicid : 94 2025-05-07T19:43:02.6150239Z initial apicid : 94 2025-05-07T19:43:02.6150313Z fpu : yes 2025-05-07T19:43:02.6150386Z fpu_exception : yes 2025-05-07T19:43:02.6150455Z cpuid level : 13 2025-05-07T19:43:02.6150519Z wp : yes 2025-05-07T19:43:02.6152532Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6152988Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6153071Z bogomips : 5999.97 2025-05-07T19:43:02.6153316Z clflush size : 64 2025-05-07T19:43:02.6153397Z cache_alignment : 64 2025-05-07T19:43:02.6153524Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6153687Z power management: 2025-05-07T19:43:02.6153693Z 2025-05-07T19:43:02.6153768Z processor : 40 2025-05-07T19:43:02.6153858Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6153941Z cpu family : 6 2025-05-07T19:43:02.6154012Z model : 85 2025-05-07T19:43:02.6154169Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6154245Z stepping : 7 2025-05-07T19:43:02.6154329Z microcode : 0x5003901 2025-05-07T19:43:02.6154402Z cpu MHz : 2999.988 2025-05-07T19:43:02.6154479Z cache size : 36608 KB 2025-05-07T19:43:02.6154560Z physical id : 1 2025-05-07T19:43:02.6154699Z siblings : 48 2025-05-07T19:43:02.6154771Z core id : 16 2025-05-07T19:43:02.6154844Z cpu cores : 24 2025-05-07T19:43:02.6154924Z apicid : 96 2025-05-07T19:43:02.6155006Z initial apicid : 96 2025-05-07T19:43:02.6155076Z fpu : yes 2025-05-07T19:43:02.6155163Z fpu_exception : yes 2025-05-07T19:43:02.6155238Z cpuid level : 13 2025-05-07T19:43:02.6155309Z wp : yes 2025-05-07T19:43:02.6155726Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:02.6157923Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6158315Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6158396Z bogomips : 5999.97 2025-05-07T19:43:02.6158472Z clflush size : 64 2025-05-07T19:43:02.6158550Z cache_alignment : 64 2025-05-07T19:43:02.6158674Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6158759Z power management: 2025-05-07T19:43:02.6158764Z 2025-05-07T19:43:02.6158838Z processor : 41 2025-05-07T19:43:02.6158974Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6159054Z cpu family : 6 2025-05-07T19:43:02.6159134Z model : 85 2025-05-07T19:43:02.6159297Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6159393Z stepping : 7 2025-05-07T19:43:02.6159480Z microcode : 0x5003901 2025-05-07T19:43:02.6159564Z cpu MHz : 2999.988 2025-05-07T19:43:02.6159650Z cache size : 36608 KB 2025-05-07T19:43:02.6159749Z physical id : 1 2025-05-07T19:43:02.6159831Z siblings : 48 2025-05-07T19:43:02.6159913Z core id : 17 2025-05-07T19:43:02.6159995Z cpu cores : 24 2025-05-07T19:43:02.6160089Z apicid : 98 2025-05-07T19:43:02.6160176Z initial apicid : 98 2025-05-07T19:43:02.6160258Z fpu : yes 2025-05-07T19:43:02.6160361Z fpu_exception : yes 2025-05-07T19:43:02.6160443Z cpuid level : 13 2025-05-07T19:43:02.6160522Z wp : yes 2025-05-07T19:43:02.6162716Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6163116Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6163202Z bogomips : 5999.97 2025-05-07T19:43:02.6163305Z clflush size : 64 2025-05-07T19:43:02.6163396Z cache_alignment : 64 2025-05-07T19:43:02.6163530Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6163620Z power management: 2025-05-07T19:43:02.6163696Z 2025-05-07T19:43:02.6163783Z processor : 42 2025-05-07T19:43:02.6163876Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6163965Z cpu family : 6 2025-05-07T19:43:02.6164059Z model : 85 2025-05-07T19:43:02.6164225Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6164309Z stepping : 7 2025-05-07T19:43:02.6164399Z microcode : 0x5003901 2025-05-07T19:43:02.6164501Z cpu MHz : 2999.988 2025-05-07T19:43:02.6164590Z cache size : 36608 KB 2025-05-07T19:43:02.6164676Z physical id : 1 2025-05-07T19:43:02.6164778Z siblings : 48 2025-05-07T19:43:02.6164861Z core id : 18 2025-05-07T19:43:02.6164945Z cpu cores : 24 2025-05-07T19:43:02.6165031Z apicid : 100 2025-05-07T19:43:02.6165141Z initial apicid : 100 2025-05-07T19:43:02.6165224Z fpu : yes 2025-05-07T19:43:02.6165316Z fpu_exception : yes 2025-05-07T19:43:02.6165533Z cpuid level : 13 2025-05-07T19:43:02.6165608Z wp : yes 2025-05-07T19:43:02.6167629Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6168019Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6168099Z bogomips : 5999.97 2025-05-07T19:43:02.6168177Z clflush size : 64 2025-05-07T19:43:02.6168281Z cache_alignment : 64 2025-05-07T19:43:02.6168409Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6168493Z power management: 2025-05-07T19:43:02.6168497Z 2025-05-07T19:43:02.6168586Z processor : 43 2025-05-07T19:43:02.6168689Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6168771Z cpu family : 6 2025-05-07T19:43:02.6168899Z model : 85 2025-05-07T19:43:02.6169069Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6169150Z stepping : 7 2025-05-07T19:43:02.6169239Z microcode : 0x5003901 2025-05-07T19:43:02.6169321Z cpu MHz : 2999.988 2025-05-07T19:43:02.6169419Z cache size : 36608 KB 2025-05-07T19:43:02.6169502Z physical id : 1 2025-05-07T19:43:02.6169582Z siblings : 48 2025-05-07T19:43:02.6169678Z core id : 19 2025-05-07T19:43:02.6169757Z cpu cores : 24 2025-05-07T19:43:02.6169832Z apicid : 102 2025-05-07T19:43:02.6169919Z initial apicid : 102 2025-05-07T19:43:02.6170010Z fpu : yes 2025-05-07T19:43:02.6170091Z fpu_exception : yes 2025-05-07T19:43:02.6170167Z cpuid level : 13 2025-05-07T19:43:02.6170256Z wp : yes 2025-05-07T19:43:02.6172258Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6172626Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6172720Z bogomips : 5999.97 2025-05-07T19:43:02.6172798Z clflush size : 64 2025-05-07T19:43:02.6172881Z cache_alignment : 64 2025-05-07T19:43:02.6173016Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6173098Z power management: 2025-05-07T19:43:02.6173102Z 2025-05-07T19:43:02.6173179Z processor : 44 2025-05-07T19:43:02.6173311Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6173403Z cpu family : 6 2025-05-07T19:43:02.6173477Z model : 85 2025-05-07T19:43:02.6173631Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6173729Z stepping : 7 2025-05-07T19:43:02.6173808Z microcode : 0x5003901 2025-05-07T19:43:02.6173886Z cpu MHz : 2999.988 2025-05-07T19:43:02.6173966Z cache size : 36608 KB 2025-05-07T19:43:02.6174058Z physical id : 1 2025-05-07T19:43:02.6174135Z siblings : 48 2025-05-07T19:43:02.6174212Z core id : 20 2025-05-07T19:43:02.6174303Z cpu cores : 24 2025-05-07T19:43:02.6174379Z apicid : 104 2025-05-07T19:43:02.6174466Z initial apicid : 104 2025-05-07T19:43:02.6174542Z fpu : yes 2025-05-07T19:43:02.6174646Z fpu_exception : yes 2025-05-07T19:43:02.6174729Z cpuid level : 13 2025-05-07T19:43:02.6174804Z wp : yes 2025-05-07T19:43:02.6176851Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6177222Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6177301Z bogomips : 5999.97 2025-05-07T19:43:02.6177395Z clflush size : 64 2025-05-07T19:43:02.6177478Z cache_alignment : 64 2025-05-07T19:43:02.6177604Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6177701Z power management: 2025-05-07T19:43:02.6177705Z 2025-05-07T19:43:02.6177784Z processor : 45 2025-05-07T19:43:02.6177873Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6177952Z cpu family : 6 2025-05-07T19:43:02.6178044Z model : 85 2025-05-07T19:43:02.6178197Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6178327Z stepping : 7 2025-05-07T19:43:02.6178421Z microcode : 0x5003901 2025-05-07T19:43:02.6178499Z cpu MHz : 2999.988 2025-05-07T19:43:02.6178578Z cache size : 36608 KB 2025-05-07T19:43:02.6178658Z physical id : 1 2025-05-07T19:43:02.6178749Z siblings : 48 2025-05-07T19:43:02.6178825Z core id : 21 2025-05-07T19:43:02.6178904Z cpu cores : 24 2025-05-07T19:43:02.6178996Z apicid : 106 2025-05-07T19:43:02.6179079Z initial apicid : 106 2025-05-07T19:43:02.6179155Z fpu : yes 2025-05-07T19:43:02.6179240Z fpu_exception : yes 2025-05-07T19:43:02.6179337Z cpuid level : 13 2025-05-07T19:43:02.6179413Z wp : yes 2025-05-07T19:43:02.6181424Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6181816Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6181897Z bogomips : 5999.97 2025-05-07T19:43:02.6181977Z clflush size : 64 2025-05-07T19:43:02.6182073Z cache_alignment : 64 2025-05-07T19:43:02.6182198Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6182280Z power management: 2025-05-07T19:43:02.6182284Z 2025-05-07T19:43:02.6182376Z processor : 46 2025-05-07T19:43:02.6182463Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6182539Z cpu family : 6 2025-05-07T19:43:02.6182613Z model : 85 2025-05-07T19:43:02.6182828Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6182908Z stepping : 7 2025-05-07T19:43:02.6182991Z microcode : 0x5003901 2025-05-07T19:43:02.6183082Z cpu MHz : 2999.988 2025-05-07T19:43:02.6183161Z cache size : 36608 KB 2025-05-07T19:43:02.6183240Z physical id : 1 2025-05-07T19:43:02.6183315Z siblings : 48 2025-05-07T19:43:02.6183405Z core id : 22 2025-05-07T19:43:02.6183481Z cpu cores : 24 2025-05-07T19:43:02.6183558Z apicid : 108 2025-05-07T19:43:02.6183657Z initial apicid : 108 2025-05-07T19:43:02.6183733Z fpu : yes 2025-05-07T19:43:02.6183813Z fpu_exception : yes 2025-05-07T19:43:02.6183892Z cpuid level : 13 2025-05-07T19:43:02.6183980Z wp : yes 2025-05-07T19:43:02.6185996Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6186381Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6186462Z bogomips : 5999.97 2025-05-07T19:43:02.6186541Z clflush size : 64 2025-05-07T19:43:02.6186626Z cache_alignment : 64 2025-05-07T19:43:02.6186759Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6186840Z power management: 2025-05-07T19:43:02.6186844Z 2025-05-07T19:43:02.6186922Z processor : 47 2025-05-07T19:43:02.6187021Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6187096Z cpu family : 6 2025-05-07T19:43:02.6187171Z model : 85 2025-05-07T19:43:02.6187324Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6187414Z stepping : 7 2025-05-07T19:43:02.6187555Z microcode : 0x5003901 2025-05-07T19:43:02.6187632Z cpu MHz : 2999.988 2025-05-07T19:43:02.6187728Z cache size : 36608 KB 2025-05-07T19:43:02.6187806Z physical id : 1 2025-05-07T19:43:02.6187881Z siblings : 48 2025-05-07T19:43:02.6187955Z core id : 23 2025-05-07T19:43:02.6188044Z cpu cores : 24 2025-05-07T19:43:02.6188120Z apicid : 110 2025-05-07T19:43:02.6188202Z initial apicid : 110 2025-05-07T19:43:02.6188275Z fpu : yes 2025-05-07T19:43:02.6188372Z fpu_exception : yes 2025-05-07T19:43:02.6188449Z cpuid level : 13 2025-05-07T19:43:02.6188522Z wp : yes 2025-05-07T19:43:02.6190560Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6190931Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6191026Z bogomips : 5999.97 2025-05-07T19:43:02.6191105Z clflush size : 64 2025-05-07T19:43:02.6191188Z cache_alignment : 64 2025-05-07T19:43:02.6191313Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6191408Z power management: 2025-05-07T19:43:02.6191413Z 2025-05-07T19:43:02.6191490Z processor : 48 2025-05-07T19:43:02.6191576Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6191672Z cpu family : 6 2025-05-07T19:43:02.6191756Z model : 85 2025-05-07T19:43:02.6191911Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6192036Z stepping : 7 2025-05-07T19:43:02.6192135Z microcode : 0x5003901 2025-05-07T19:43:02.6192220Z cpu MHz : 3133.375 2025-05-07T19:43:02.6192300Z cache size : 36608 KB 2025-05-07T19:43:02.6192397Z physical id : 0 2025-05-07T19:43:02.6192479Z siblings : 48 2025-05-07T19:43:02.6192555Z core id : 0 2025-05-07T19:43:02.6192635Z cpu cores : 24 2025-05-07T19:43:02.6192802Z apicid : 1 2025-05-07T19:43:02.6192886Z initial apicid : 1 2025-05-07T19:43:02.6192961Z fpu : yes 2025-05-07T19:43:02.6193042Z fpu_exception : yes 2025-05-07T19:43:02.6193304Z cpuid level : 13 2025-05-07T19:43:02.6193385Z wp : yes 2025-05-07T19:43:02.6195559Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6195977Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6196062Z bogomips : 5999.97 2025-05-07T19:43:02.6196147Z clflush size : 64 2025-05-07T19:43:02.6196254Z cache_alignment : 64 2025-05-07T19:43:02.6196388Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6196474Z power management: 2025-05-07T19:43:02.6196479Z 2025-05-07T19:43:02.6196576Z processor : 49 2025-05-07T19:43:02.6196668Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6196750Z cpu family : 6 2025-05-07T19:43:02.6196848Z model : 85 2025-05-07T19:43:02.6197012Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6197095Z stepping : 7 2025-05-07T19:43:02.6197178Z microcode : 0x5003901 2025-05-07T19:43:02.6197271Z cpu MHz : 2999.988 2025-05-07T19:43:02.6197355Z cache size : 36608 KB 2025-05-07T19:43:02.6197502Z physical id : 0 2025-05-07T19:43:02.6197575Z siblings : 48 2025-05-07T19:43:02.6197658Z core id : 1 2025-05-07T19:43:02.6197737Z cpu cores : 24 2025-05-07T19:43:02.6197810Z apicid : 3 2025-05-07T19:43:02.6197896Z initial apicid : 3 2025-05-07T19:43:02.6197970Z fpu : yes 2025-05-07T19:43:02.6198052Z fpu_exception : yes 2025-05-07T19:43:02.6198128Z cpuid level : 13 2025-05-07T19:43:02.6198217Z wp : yes 2025-05-07T19:43:02.6200397Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6200807Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6200888Z bogomips : 5999.97 2025-05-07T19:43:02.6200969Z clflush size : 64 2025-05-07T19:43:02.6201051Z cache_alignment : 64 2025-05-07T19:43:02.6201187Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6201267Z power management: 2025-05-07T19:43:02.6201272Z 2025-05-07T19:43:02.6201350Z processor : 50 2025-05-07T19:43:02.6201436Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6201510Z cpu family : 6 2025-05-07T19:43:02.6201580Z model : 85 2025-05-07T19:43:02.6201736Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6201815Z stepping : 7 2025-05-07T19:43:02.6201893Z microcode : 0x5003901 2025-05-07T19:43:02.6202727Z cpu MHz : 3247.517 2025-05-07T19:43:02.6202818Z cache size : 36608 KB 2025-05-07T19:43:02.6202897Z physical id : 0 2025-05-07T19:43:02.6202979Z siblings : 48 2025-05-07T19:43:02.6203052Z core id : 2 2025-05-07T19:43:02.6203136Z cpu cores : 24 2025-05-07T19:43:02.6203210Z apicid : 5 2025-05-07T19:43:02.6203291Z initial apicid : 5 2025-05-07T19:43:02.6203372Z fpu : yes 2025-05-07T19:43:02.6203452Z fpu_exception : yes 2025-05-07T19:43:02.6203531Z cpuid level : 13 2025-05-07T19:43:02.6203602Z wp : yes 2025-05-07T19:43:02.6205779Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6206171Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6206256Z bogomips : 5999.97 2025-05-07T19:43:02.6206332Z clflush size : 64 2025-05-07T19:43:02.6206412Z cache_alignment : 64 2025-05-07T19:43:02.6206537Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6206621Z power management: 2025-05-07T19:43:02.6206626Z 2025-05-07T19:43:02.6206703Z processor : 51 2025-05-07T19:43:02.6206787Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6206869Z cpu family : 6 2025-05-07T19:43:02.6206940Z model : 85 2025-05-07T19:43:02.6207100Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6207180Z stepping : 7 2025-05-07T19:43:02.6207275Z microcode : 0x5003901 2025-05-07T19:43:02.6207352Z cpu MHz : 3270.277 2025-05-07T19:43:02.6207436Z cache size : 36608 KB 2025-05-07T19:43:02.6207529Z physical id : 0 2025-05-07T19:43:02.6207602Z siblings : 48 2025-05-07T19:43:02.6207770Z core id : 3 2025-05-07T19:43:02.6207847Z cpu cores : 24 2025-05-07T19:43:02.6207935Z apicid : 7 2025-05-07T19:43:02.6208015Z initial apicid : 7 2025-05-07T19:43:02.6208091Z fpu : yes 2025-05-07T19:43:02.6208191Z fpu_exception : yes 2025-05-07T19:43:02.6208267Z cpuid level : 13 2025-05-07T19:43:02.6208337Z wp : yes 2025-05-07T19:43:02.6210507Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6210899Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6210982Z bogomips : 5999.97 2025-05-07T19:43:02.6211066Z clflush size : 64 2025-05-07T19:43:02.6211153Z cache_alignment : 64 2025-05-07T19:43:02.6211280Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6211361Z power management: 2025-05-07T19:43:02.6211366Z 2025-05-07T19:43:02.6211449Z processor : 52 2025-05-07T19:43:02.6211533Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6211609Z cpu family : 6 2025-05-07T19:43:02.6211688Z model : 85 2025-05-07T19:43:02.6211844Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6211922Z stepping : 7 2025-05-07T19:43:02.6211999Z microcode : 0x5003901 2025-05-07T19:43:02.6212081Z cpu MHz : 2999.988 2025-05-07T19:43:02.6212158Z cache size : 36608 KB 2025-05-07T19:43:02.6212298Z physical id : 0 2025-05-07T19:43:02.6212378Z siblings : 48 2025-05-07T19:43:02.6212449Z core id : 4 2025-05-07T19:43:02.6212526Z cpu cores : 24 2025-05-07T19:43:02.6212599Z apicid : 9 2025-05-07T19:43:02.6212691Z initial apicid : 9 2025-05-07T19:43:02.6212768Z fpu : yes 2025-05-07T19:43:02.6212849Z fpu_exception : yes 2025-05-07T19:43:02.6212937Z cpuid level : 13 2025-05-07T19:43:02.6213011Z wp : yes 2025-05-07T19:43:02.6215306Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6215674Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6215752Z bogomips : 5999.97 2025-05-07T19:43:02.6215823Z clflush size : 64 2025-05-07T19:43:02.6215906Z cache_alignment : 64 2025-05-07T19:43:02.6216023Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6216101Z power management: 2025-05-07T19:43:02.6216105Z 2025-05-07T19:43:02.6216181Z processor : 53 2025-05-07T19:43:02.6216269Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6216343Z cpu family : 6 2025-05-07T19:43:02.6216413Z model : 85 2025-05-07T19:43:02.6216566Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6216642Z stepping : 7 2025-05-07T19:43:02.6216718Z microcode : 0x5003901 2025-05-07T19:43:02.6216789Z cpu MHz : 3245.245 2025-05-07T19:43:02.6216870Z cache size : 36608 KB 2025-05-07T19:43:02.6216944Z physical id : 0 2025-05-07T19:43:02.6217014Z siblings : 48 2025-05-07T19:43:02.6217095Z core id : 5 2025-05-07T19:43:02.6217164Z cpu cores : 24 2025-05-07T19:43:02.6217297Z apicid : 11 2025-05-07T19:43:02.6217376Z initial apicid : 11 2025-05-07T19:43:02.6217455Z fpu : yes 2025-05-07T19:43:02.6217532Z fpu_exception : yes 2025-05-07T19:43:02.6217603Z cpuid level : 13 2025-05-07T19:43:02.6217669Z wp : yes 2025-05-07T19:43:02.6219676Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6220041Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6220129Z bogomips : 5999.97 2025-05-07T19:43:02.6220202Z clflush size : 64 2025-05-07T19:43:02.6220282Z cache_alignment : 64 2025-05-07T19:43:02.6220419Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6220502Z power management: 2025-05-07T19:43:02.6220506Z 2025-05-07T19:43:02.6220583Z processor : 54 2025-05-07T19:43:02.6220663Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6220743Z cpu family : 6 2025-05-07T19:43:02.6220811Z model : 85 2025-05-07T19:43:02.6220960Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6221037Z stepping : 7 2025-05-07T19:43:02.6221119Z microcode : 0x5003901 2025-05-07T19:43:02.6221194Z cpu MHz : 3311.659 2025-05-07T19:43:02.6221266Z cache size : 36608 KB 2025-05-07T19:43:02.6221353Z physical id : 0 2025-05-07T19:43:02.6221420Z siblings : 48 2025-05-07T19:43:02.6221488Z core id : 6 2025-05-07T19:43:02.6221618Z cpu cores : 24 2025-05-07T19:43:02.6221687Z apicid : 13 2025-05-07T19:43:02.6221759Z initial apicid : 13 2025-05-07T19:43:02.6221831Z fpu : yes 2025-05-07T19:43:02.6221925Z fpu_exception : yes 2025-05-07T19:43:02.6221999Z cpuid level : 13 2025-05-07T19:43:02.6222072Z wp : yes 2025-05-07T19:43:02.6224089Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6224455Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6224534Z bogomips : 5999.97 2025-05-07T19:43:02.6224631Z clflush size : 64 2025-05-07T19:43:02.6224715Z cache_alignment : 64 2025-05-07T19:43:02.6224840Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6224920Z power management: 2025-05-07T19:43:02.6224930Z 2025-05-07T19:43:02.6225002Z processor : 55 2025-05-07T19:43:02.6225079Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6225151Z cpu family : 6 2025-05-07T19:43:02.6225230Z model : 85 2025-05-07T19:43:02.6225376Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6225446Z stepping : 7 2025-05-07T19:43:02.6225532Z microcode : 0x5003901 2025-05-07T19:43:02.6225602Z cpu MHz : 3265.960 2025-05-07T19:43:02.6225675Z cache size : 36608 KB 2025-05-07T19:43:02.6225755Z physical id : 0 2025-05-07T19:43:02.6225834Z siblings : 48 2025-05-07T19:43:02.6225904Z core id : 7 2025-05-07T19:43:02.6225983Z cpu cores : 24 2025-05-07T19:43:02.6226062Z apicid : 15 2025-05-07T19:43:02.6226148Z initial apicid : 15 2025-05-07T19:43:02.6226219Z fpu : yes 2025-05-07T19:43:02.6226360Z fpu_exception : yes 2025-05-07T19:43:02.6226446Z cpuid level : 13 2025-05-07T19:43:02.6226512Z wp : yes 2025-05-07T19:43:02.6228543Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6228918Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6228992Z bogomips : 5999.97 2025-05-07T19:43:02.6229065Z clflush size : 64 2025-05-07T19:43:02.6229161Z cache_alignment : 64 2025-05-07T19:43:02.6229282Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6229358Z power management: 2025-05-07T19:43:02.6229362Z 2025-05-07T19:43:02.6229453Z processor : 56 2025-05-07T19:43:02.6229534Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6229606Z cpu family : 6 2025-05-07T19:43:02.6229674Z model : 85 2025-05-07T19:43:02.6229835Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6229909Z stepping : 7 2025-05-07T19:43:02.6229980Z microcode : 0x5003901 2025-05-07T19:43:02.6230051Z cpu MHz : 3261.583 2025-05-07T19:43:02.6230130Z cache size : 36608 KB 2025-05-07T19:43:02.6230202Z physical id : 0 2025-05-07T19:43:02.6230270Z siblings : 48 2025-05-07T19:43:02.6230343Z core id : 8 2025-05-07T19:43:02.6230412Z cpu cores : 24 2025-05-07T19:43:02.6230480Z apicid : 17 2025-05-07T19:43:02.6230601Z initial apicid : 17 2025-05-07T19:43:02.6230675Z fpu : yes 2025-05-07T19:43:02.6230748Z fpu_exception : yes 2025-05-07T19:43:02.6230820Z cpuid level : 13 2025-05-07T19:43:02.6230903Z wp : yes 2025-05-07T19:43:02.6233009Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6233560Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6233649Z bogomips : 5999.97 2025-05-07T19:43:02.6233733Z clflush size : 64 2025-05-07T19:43:02.6233816Z cache_alignment : 64 2025-05-07T19:43:02.6233957Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6234128Z power management: 2025-05-07T19:43:02.6234133Z 2025-05-07T19:43:02.6234215Z processor : 57 2025-05-07T19:43:02.6234300Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6234388Z cpu family : 6 2025-05-07T19:43:02.6234465Z model : 85 2025-05-07T19:43:02.6234625Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6234707Z stepping : 7 2025-05-07T19:43:02.6234789Z microcode : 0x5003901 2025-05-07T19:43:02.6234869Z cpu MHz : 3276.223 2025-05-07T19:43:02.6234946Z cache size : 36608 KB 2025-05-07T19:43:02.6235034Z physical id : 0 2025-05-07T19:43:02.6235110Z siblings : 48 2025-05-07T19:43:02.6235183Z core id : 9 2025-05-07T19:43:02.6235260Z cpu cores : 24 2025-05-07T19:43:02.6235332Z apicid : 19 2025-05-07T19:43:02.6235411Z initial apicid : 19 2025-05-07T19:43:02.6235486Z fpu : yes 2025-05-07T19:43:02.6235578Z fpu_exception : yes 2025-05-07T19:43:02.6235654Z cpuid level : 13 2025-05-07T19:43:02.6235791Z wp : yes 2025-05-07T19:43:02.6237974Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6238362Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6238444Z bogomips : 5999.97 2025-05-07T19:43:02.6238530Z clflush size : 64 2025-05-07T19:43:02.6238615Z cache_alignment : 64 2025-05-07T19:43:02.6238743Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6238840Z power management: 2025-05-07T19:43:02.6238844Z 2025-05-07T19:43:02.6238921Z processor : 58 2025-05-07T19:43:02.6239007Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6239084Z cpu family : 6 2025-05-07T19:43:02.6239166Z model : 85 2025-05-07T19:43:02.6239324Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6239401Z stepping : 7 2025-05-07T19:43:02.6239493Z microcode : 0x5003901 2025-05-07T19:43:02.6239569Z cpu MHz : 2999.988 2025-05-07T19:43:02.6239646Z cache size : 36608 KB 2025-05-07T19:43:02.6239723Z physical id : 0 2025-05-07T19:43:02.6239805Z siblings : 48 2025-05-07T19:43:02.6239876Z core id : 10 2025-05-07T19:43:02.6239949Z cpu cores : 24 2025-05-07T19:43:02.6240030Z apicid : 21 2025-05-07T19:43:02.6240108Z initial apicid : 21 2025-05-07T19:43:02.6240180Z fpu : yes 2025-05-07T19:43:02.6240260Z fpu_exception : yes 2025-05-07T19:43:02.6240392Z cpuid level : 13 2025-05-07T19:43:02.6240463Z wp : yes 2025-05-07T19:43:02.6242623Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6243018Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6243095Z bogomips : 5999.97 2025-05-07T19:43:02.6243174Z clflush size : 64 2025-05-07T19:43:02.6243264Z cache_alignment : 64 2025-05-07T19:43:02.6243394Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6243473Z power management: 2025-05-07T19:43:02.6243481Z 2025-05-07T19:43:02.6243567Z processor : 59 2025-05-07T19:43:02.6243652Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6243730Z cpu family : 6 2025-05-07T19:43:02.6243800Z model : 85 2025-05-07T19:43:02.6243964Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6244040Z stepping : 7 2025-05-07T19:43:02.6244119Z microcode : 0x5003901 2025-05-07T19:43:02.6244202Z cpu MHz : 3309.180 2025-05-07T19:43:02.6244282Z cache size : 36608 KB 2025-05-07T19:43:02.6244360Z physical id : 0 2025-05-07T19:43:02.6244436Z siblings : 48 2025-05-07T19:43:02.6244523Z core id : 11 2025-05-07T19:43:02.6244600Z cpu cores : 24 2025-05-07T19:43:02.6244674Z apicid : 23 2025-05-07T19:43:02.6244763Z initial apicid : 23 2025-05-07T19:43:02.6244837Z fpu : yes 2025-05-07T19:43:02.6244915Z fpu_exception : yes 2025-05-07T19:43:02.6244991Z cpuid level : 13 2025-05-07T19:43:02.6245076Z wp : yes 2025-05-07T19:43:02.6247375Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6247796Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6247869Z bogomips : 5999.97 2025-05-07T19:43:02.6247943Z clflush size : 64 2025-05-07T19:43:02.6248018Z cache_alignment : 64 2025-05-07T19:43:02.6248144Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6248217Z power management: 2025-05-07T19:43:02.6248221Z 2025-05-07T19:43:02.6248298Z processor : 60 2025-05-07T19:43:02.6248386Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6248457Z cpu family : 6 2025-05-07T19:43:02.6248527Z model : 85 2025-05-07T19:43:02.6248675Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6248748Z stepping : 7 2025-05-07T19:43:02.6248821Z microcode : 0x5003901 2025-05-07T19:43:02.6248893Z cpu MHz : 3321.182 2025-05-07T19:43:02.6248974Z cache size : 36608 KB 2025-05-07T19:43:02.6249044Z physical id : 0 2025-05-07T19:43:02.6249113Z siblings : 48 2025-05-07T19:43:02.6249181Z core id : 12 2025-05-07T19:43:02.6249256Z cpu cores : 24 2025-05-07T19:43:02.6249324Z apicid : 25 2025-05-07T19:43:02.6249397Z initial apicid : 25 2025-05-07T19:43:02.6249463Z fpu : yes 2025-05-07T19:43:02.6249545Z fpu_exception : yes 2025-05-07T19:43:02.6249615Z cpuid level : 13 2025-05-07T19:43:02.6249683Z wp : yes 2025-05-07T19:43:02.6251736Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6252102Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6252187Z bogomips : 5999.97 2025-05-07T19:43:02.6252261Z clflush size : 64 2025-05-07T19:43:02.6252336Z cache_alignment : 64 2025-05-07T19:43:02.6252464Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6252551Z power management: 2025-05-07T19:43:02.6252555Z 2025-05-07T19:43:02.6252627Z processor : 61 2025-05-07T19:43:02.6252711Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6252791Z cpu family : 6 2025-05-07T19:43:02.6252857Z model : 85 2025-05-07T19:43:02.6253005Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6253077Z stepping : 7 2025-05-07T19:43:02.6253162Z microcode : 0x5003901 2025-05-07T19:43:02.6253232Z cpu MHz : 3253.053 2025-05-07T19:43:02.6253308Z cache size : 36608 KB 2025-05-07T19:43:02.6253391Z physical id : 0 2025-05-07T19:43:02.6253459Z siblings : 48 2025-05-07T19:43:02.6253531Z core id : 13 2025-05-07T19:43:02.6253601Z cpu cores : 24 2025-05-07T19:43:02.6253682Z apicid : 27 2025-05-07T19:43:02.6253755Z initial apicid : 27 2025-05-07T19:43:02.6253825Z fpu : yes 2025-05-07T19:43:02.6253900Z fpu_exception : yes 2025-05-07T19:43:02.6253985Z cpuid level : 13 2025-05-07T19:43:02.6254052Z wp : yes 2025-05-07T19:43:02.6256061Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6256480Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6256557Z bogomips : 5999.97 2025-05-07T19:43:02.6256629Z clflush size : 64 2025-05-07T19:43:02.6256716Z cache_alignment : 64 2025-05-07T19:43:02.6256834Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6256909Z power management: 2025-05-07T19:43:02.6256913Z 2025-05-07T19:43:02.6256998Z processor : 62 2025-05-07T19:43:02.6257078Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6257151Z cpu family : 6 2025-05-07T19:43:02.6257217Z model : 85 2025-05-07T19:43:02.6257369Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6257439Z stepping : 7 2025-05-07T19:43:02.6257512Z microcode : 0x5003901 2025-05-07T19:43:02.6257589Z cpu MHz : 3255.370 2025-05-07T19:43:02.6257661Z cache size : 36608 KB 2025-05-07T19:43:02.6257734Z physical id : 0 2025-05-07T19:43:02.6257803Z siblings : 48 2025-05-07T19:43:02.6257879Z core id : 14 2025-05-07T19:43:02.6257948Z cpu cores : 24 2025-05-07T19:43:02.6258016Z apicid : 29 2025-05-07T19:43:02.6258100Z initial apicid : 29 2025-05-07T19:43:02.6258170Z fpu : yes 2025-05-07T19:43:02.6258245Z fpu_exception : yes 2025-05-07T19:43:02.6258320Z cpuid level : 13 2025-05-07T19:43:02.6258397Z wp : yes 2025-05-07T19:43:02.6260477Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6260848Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6260921Z bogomips : 5999.97 2025-05-07T19:43:02.6260995Z clflush size : 64 2025-05-07T19:43:02.6261071Z cache_alignment : 64 2025-05-07T19:43:02.6261195Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6261269Z power management: 2025-05-07T19:43:02.6261273Z 2025-05-07T19:43:02.6261345Z processor : 63 2025-05-07T19:43:02.6261433Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6261502Z cpu family : 6 2025-05-07T19:43:02.6261573Z model : 85 2025-05-07T19:43:02.6261717Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6261797Z stepping : 7 2025-05-07T19:43:02.6261872Z microcode : 0x5003901 2025-05-07T19:43:02.6261943Z cpu MHz : 2999.988 2025-05-07T19:43:02.6262022Z cache size : 36608 KB 2025-05-07T19:43:02.6262093Z physical id : 0 2025-05-07T19:43:02.6262162Z siblings : 48 2025-05-07T19:43:02.6262229Z core id : 15 2025-05-07T19:43:02.6262303Z cpu cores : 24 2025-05-07T19:43:02.6262371Z apicid : 31 2025-05-07T19:43:02.6262446Z initial apicid : 31 2025-05-07T19:43:02.6262519Z fpu : yes 2025-05-07T19:43:02.6262594Z fpu_exception : yes 2025-05-07T19:43:02.6262665Z cpuid level : 13 2025-05-07T19:43:02.6262733Z wp : yes 2025-05-07T19:43:02.6264760Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6265166Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6265248Z bogomips : 5999.97 2025-05-07T19:43:02.6265318Z clflush size : 64 2025-05-07T19:43:02.6265393Z cache_alignment : 64 2025-05-07T19:43:02.6265508Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6265591Z power management: 2025-05-07T19:43:02.6265595Z 2025-05-07T19:43:02.6265665Z processor : 64 2025-05-07T19:43:02.6265746Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6265828Z cpu family : 6 2025-05-07T19:43:02.6265897Z model : 85 2025-05-07T19:43:02.6266041Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6266116Z stepping : 7 2025-05-07T19:43:02.6266198Z microcode : 0x5003901 2025-05-07T19:43:02.6266273Z cpu MHz : 2999.988 2025-05-07T19:43:02.6266346Z cache size : 36608 KB 2025-05-07T19:43:02.6266422Z physical id : 0 2025-05-07T19:43:02.6266491Z siblings : 48 2025-05-07T19:43:02.6266557Z core id : 16 2025-05-07T19:43:02.6266625Z cpu cores : 24 2025-05-07T19:43:02.6266702Z apicid : 33 2025-05-07T19:43:02.6266776Z initial apicid : 33 2025-05-07T19:43:02.6266847Z fpu : yes 2025-05-07T19:43:02.6266932Z fpu_exception : yes 2025-05-07T19:43:02.6267005Z cpuid level : 13 2025-05-07T19:43:02.6267072Z wp : yes 2025-05-07T19:43:02.6269142Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6269505Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6269579Z bogomips : 5999.97 2025-05-07T19:43:02.6269659Z clflush size : 64 2025-05-07T19:43:02.6269735Z cache_alignment : 64 2025-05-07T19:43:02.6269852Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6269927Z power management: 2025-05-07T19:43:02.6269932Z 2025-05-07T19:43:02.6270007Z processor : 65 2025-05-07T19:43:02.6270087Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6270158Z cpu family : 6 2025-05-07T19:43:02.6270232Z model : 85 2025-05-07T19:43:02.6270374Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6270448Z stepping : 7 2025-05-07T19:43:02.6270525Z microcode : 0x5003901 2025-05-07T19:43:02.6270600Z cpu MHz : 2999.988 2025-05-07T19:43:02.6270674Z cache size : 36608 KB 2025-05-07T19:43:02.6270747Z physical id : 0 2025-05-07T19:43:02.6270823Z siblings : 48 2025-05-07T19:43:02.6270890Z core id : 17 2025-05-07T19:43:02.6270959Z cpu cores : 24 2025-05-07T19:43:02.6271028Z apicid : 35 2025-05-07T19:43:02.6271113Z initial apicid : 35 2025-05-07T19:43:02.6271181Z fpu : yes 2025-05-07T19:43:02.6271255Z fpu_exception : yes 2025-05-07T19:43:02.6271333Z cpuid level : 13 2025-05-07T19:43:02.6271399Z wp : yes 2025-05-07T19:43:02.6273692Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6274150Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6274227Z bogomips : 5999.97 2025-05-07T19:43:02.6274302Z clflush size : 64 2025-05-07T19:43:02.6274392Z cache_alignment : 64 2025-05-07T19:43:02.6274519Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6274600Z power management: 2025-05-07T19:43:02.6274604Z 2025-05-07T19:43:02.6274680Z processor : 66 2025-05-07T19:43:02.6274772Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6274847Z cpu family : 6 2025-05-07T19:43:02.6274918Z model : 85 2025-05-07T19:43:02.6275086Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6275161Z stepping : 7 2025-05-07T19:43:02.6275243Z microcode : 0x5003901 2025-05-07T19:43:02.6275320Z cpu MHz : 2999.988 2025-05-07T19:43:02.6275409Z cache size : 36608 KB 2025-05-07T19:43:02.6275486Z physical id : 0 2025-05-07T19:43:02.6275559Z siblings : 48 2025-05-07T19:43:02.6275640Z core id : 18 2025-05-07T19:43:02.6275714Z cpu cores : 24 2025-05-07T19:43:02.6275786Z apicid : 37 2025-05-07T19:43:02.6275864Z initial apicid : 37 2025-05-07T19:43:02.6275941Z fpu : yes 2025-05-07T19:43:02.6276020Z fpu_exception : yes 2025-05-07T19:43:02.6276093Z cpuid level : 13 2025-05-07T19:43:02.6276162Z wp : yes 2025-05-07T19:43:02.6278371Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6278762Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6278847Z bogomips : 5999.97 2025-05-07T19:43:02.6278923Z clflush size : 64 2025-05-07T19:43:02.6279002Z cache_alignment : 64 2025-05-07T19:43:02.6279136Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6279219Z power management: 2025-05-07T19:43:02.6279223Z 2025-05-07T19:43:02.6279302Z processor : 67 2025-05-07T19:43:02.6279386Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6279467Z cpu family : 6 2025-05-07T19:43:02.6279540Z model : 85 2025-05-07T19:43:02.6279699Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6279805Z stepping : 7 2025-05-07T19:43:02.6279884Z microcode : 0x5003901 2025-05-07T19:43:02.6279965Z cpu MHz : 3256.790 2025-05-07T19:43:02.6280042Z cache size : 36608 KB 2025-05-07T19:43:02.6280124Z physical id : 0 2025-05-07T19:43:02.6280202Z siblings : 48 2025-05-07T19:43:02.6280276Z core id : 19 2025-05-07T19:43:02.6280357Z cpu cores : 24 2025-05-07T19:43:02.6280428Z apicid : 39 2025-05-07T19:43:02.6280517Z initial apicid : 39 2025-05-07T19:43:02.6280589Z fpu : yes 2025-05-07T19:43:02.6280677Z fpu_exception : yes 2025-05-07T19:43:02.6280751Z cpuid level : 13 2025-05-07T19:43:02.6280824Z wp : yes 2025-05-07T19:43:02.6283013Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6283451Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6283526Z bogomips : 5999.97 2025-05-07T19:43:02.6283608Z clflush size : 64 2025-05-07T19:43:02.6283695Z cache_alignment : 64 2025-05-07T19:43:02.6283822Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6283910Z power management: 2025-05-07T19:43:02.6283915Z 2025-05-07T19:43:02.6283995Z processor : 68 2025-05-07T19:43:02.6284083Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6284156Z cpu family : 6 2025-05-07T19:43:02.6284234Z model : 85 2025-05-07T19:43:02.6284394Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6284469Z stepping : 7 2025-05-07T19:43:02.6284562Z microcode : 0x5003901 2025-05-07T19:43:02.6284639Z cpu MHz : 3263.992 2025-05-07T19:43:02.6284720Z cache size : 36608 KB 2025-05-07T19:43:02.6284799Z physical id : 0 2025-05-07T19:43:02.6284880Z siblings : 48 2025-05-07T19:43:02.6284958Z core id : 20 2025-05-07T19:43:02.6285033Z cpu cores : 24 2025-05-07T19:43:02.6285104Z apicid : 41 2025-05-07T19:43:02.6285187Z initial apicid : 41 2025-05-07T19:43:02.6285258Z fpu : yes 2025-05-07T19:43:02.6285339Z fpu_exception : yes 2025-05-07T19:43:02.6285529Z cpuid level : 13 2025-05-07T19:43:02.6285596Z wp : yes 2025-05-07T19:43:02.6287651Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6288023Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6288097Z bogomips : 5999.97 2025-05-07T19:43:02.6288167Z clflush size : 64 2025-05-07T19:43:02.6288246Z cache_alignment : 64 2025-05-07T19:43:02.6288365Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6288439Z power management: 2025-05-07T19:43:02.6288443Z 2025-05-07T19:43:02.6288520Z processor : 69 2025-05-07T19:43:02.6288598Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6288672Z cpu family : 6 2025-05-07T19:43:02.6288740Z model : 85 2025-05-07T19:43:02.6288893Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6288962Z stepping : 7 2025-05-07T19:43:02.6289035Z microcode : 0x5003901 2025-05-07T19:43:02.6289113Z cpu MHz : 3286.994 2025-05-07T19:43:02.6289190Z cache size : 36608 KB 2025-05-07T19:43:02.6289267Z physical id : 0 2025-05-07T19:43:02.6289336Z siblings : 48 2025-05-07T19:43:02.6289412Z core id : 21 2025-05-07T19:43:02.6289481Z cpu cores : 24 2025-05-07T19:43:02.6289552Z apicid : 43 2025-05-07T19:43:02.6289627Z initial apicid : 43 2025-05-07T19:43:02.6289704Z fpu : yes 2025-05-07T19:43:02.6289781Z fpu_exception : yes 2025-05-07T19:43:02.6289853Z cpuid level : 13 2025-05-07T19:43:02.6289926Z wp : yes 2025-05-07T19:43:02.6291935Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6292341Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6292430Z bogomips : 5999.97 2025-05-07T19:43:02.6292503Z clflush size : 64 2025-05-07T19:43:02.6292578Z cache_alignment : 64 2025-05-07T19:43:02.6292705Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6292776Z power management: 2025-05-07T19:43:02.6292780Z 2025-05-07T19:43:02.6292851Z processor : 70 2025-05-07T19:43:02.6292936Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6293012Z cpu family : 6 2025-05-07T19:43:02.6293081Z model : 85 2025-05-07T19:43:02.6293226Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6293310Z stepping : 7 2025-05-07T19:43:02.6293382Z microcode : 0x5003901 2025-05-07T19:43:02.6293451Z cpu MHz : 3286.323 2025-05-07T19:43:02.6293533Z cache size : 36608 KB 2025-05-07T19:43:02.6293614Z physical id : 0 2025-05-07T19:43:02.6293687Z siblings : 48 2025-05-07T19:43:02.6293754Z core id : 22 2025-05-07T19:43:02.6293834Z cpu cores : 24 2025-05-07T19:43:02.6293903Z apicid : 45 2025-05-07T19:43:02.6293980Z initial apicid : 45 2025-05-07T19:43:02.6294046Z fpu : yes 2025-05-07T19:43:02.6294137Z fpu_exception : yes 2025-05-07T19:43:02.6294206Z cpuid level : 13 2025-05-07T19:43:02.6294271Z wp : yes 2025-05-07T19:43:02.6296411Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6296772Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6296847Z bogomips : 5999.97 2025-05-07T19:43:02.6296933Z clflush size : 64 2025-05-07T19:43:02.6297009Z cache_alignment : 64 2025-05-07T19:43:02.6297127Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6297208Z power management: 2025-05-07T19:43:02.6297213Z 2025-05-07T19:43:02.6297285Z processor : 71 2025-05-07T19:43:02.6297362Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6297430Z cpu family : 6 2025-05-07T19:43:02.6297505Z model : 85 2025-05-07T19:43:02.6297651Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6297721Z stepping : 7 2025-05-07T19:43:02.6297809Z microcode : 0x5003901 2025-05-07T19:43:02.6297884Z cpu MHz : 2999.988 2025-05-07T19:43:02.6297959Z cache size : 36608 KB 2025-05-07T19:43:02.6298031Z physical id : 0 2025-05-07T19:43:02.6298105Z siblings : 48 2025-05-07T19:43:02.6298175Z core id : 23 2025-05-07T19:43:02.6298249Z cpu cores : 24 2025-05-07T19:43:02.6298329Z apicid : 47 2025-05-07T19:43:02.6298405Z initial apicid : 47 2025-05-07T19:43:02.6298473Z fpu : yes 2025-05-07T19:43:02.6298547Z fpu_exception : yes 2025-05-07T19:43:02.6298625Z cpuid level : 13 2025-05-07T19:43:02.6298694Z wp : yes 2025-05-07T19:43:02.6300690Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6301057Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6301180Z bogomips : 5999.97 2025-05-07T19:43:02.6301254Z clflush size : 64 2025-05-07T19:43:02.6301338Z cache_alignment : 64 2025-05-07T19:43:02.6301456Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6301531Z power management: 2025-05-07T19:43:02.6301535Z 2025-05-07T19:43:02.6301615Z processor : 72 2025-05-07T19:43:02.6301697Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6301765Z cpu family : 6 2025-05-07T19:43:02.6301836Z model : 85 2025-05-07T19:43:02.6302139Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6302211Z stepping : 7 2025-05-07T19:43:02.6302460Z microcode : 0x5003901 2025-05-07T19:43:02.6302553Z cpu MHz : 2999.988 2025-05-07T19:43:02.6302630Z cache size : 36608 KB 2025-05-07T19:43:02.6302710Z physical id : 1 2025-05-07T19:43:02.6302787Z siblings : 48 2025-05-07T19:43:02.6302869Z core id : 0 2025-05-07T19:43:02.6302944Z cpu cores : 24 2025-05-07T19:43:02.6303024Z apicid : 65 2025-05-07T19:43:02.6303117Z initial apicid : 65 2025-05-07T19:43:02.6303191Z fpu : yes 2025-05-07T19:43:02.6303270Z fpu_exception : yes 2025-05-07T19:43:02.6303345Z cpuid level : 13 2025-05-07T19:43:02.6303424Z wp : yes 2025-05-07T19:43:02.6305593Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6306080Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6306164Z bogomips : 5999.97 2025-05-07T19:43:02.6306240Z clflush size : 64 2025-05-07T19:43:02.6306324Z cache_alignment : 64 2025-05-07T19:43:02.6306460Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6306540Z power management: 2025-05-07T19:43:02.6306545Z 2025-05-07T19:43:02.6306626Z processor : 73 2025-05-07T19:43:02.6306718Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6306795Z cpu family : 6 2025-05-07T19:43:02.6306872Z model : 85 2025-05-07T19:43:02.6307030Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6307119Z stepping : 7 2025-05-07T19:43:02.6307199Z microcode : 0x5003901 2025-05-07T19:43:02.6307276Z cpu MHz : 1687.814 2025-05-07T19:43:02.6307365Z cache size : 36608 KB 2025-05-07T19:43:02.6307442Z physical id : 1 2025-05-07T19:43:02.6307519Z siblings : 48 2025-05-07T19:43:02.6307594Z core id : 1 2025-05-07T19:43:02.6307678Z cpu cores : 24 2025-05-07T19:43:02.6307752Z apicid : 67 2025-05-07T19:43:02.6307837Z initial apicid : 67 2025-05-07T19:43:02.6307914Z fpu : yes 2025-05-07T19:43:02.6308015Z fpu_exception : yes 2025-05-07T19:43:02.6308090Z cpuid level : 13 2025-05-07T19:43:02.6308163Z wp : yes 2025-05-07T19:43:02.6310343Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6310741Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6310908Z bogomips : 5999.97 2025-05-07T19:43:02.6310989Z clflush size : 64 2025-05-07T19:43:02.6311079Z cache_alignment : 64 2025-05-07T19:43:02.6311207Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6311305Z power management: 2025-05-07T19:43:02.6311309Z 2025-05-07T19:43:02.6311384Z processor : 74 2025-05-07T19:43:02.6311473Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6311557Z cpu family : 6 2025-05-07T19:43:02.6311632Z model : 85 2025-05-07T19:43:02.6311797Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6311879Z stepping : 7 2025-05-07T19:43:02.6311973Z microcode : 0x5003901 2025-05-07T19:43:02.6312056Z cpu MHz : 2999.988 2025-05-07T19:43:02.6312136Z cache size : 36608 KB 2025-05-07T19:43:02.6312231Z physical id : 1 2025-05-07T19:43:02.6312305Z siblings : 48 2025-05-07T19:43:02.6312392Z core id : 2 2025-05-07T19:43:02.6312479Z cpu cores : 24 2025-05-07T19:43:02.6312565Z apicid : 69 2025-05-07T19:43:02.6312647Z initial apicid : 69 2025-05-07T19:43:02.6312796Z fpu : yes 2025-05-07T19:43:02.6312881Z fpu_exception : yes 2025-05-07T19:43:02.6312968Z cpuid level : 13 2025-05-07T19:43:02.6313038Z wp : yes 2025-05-07T19:43:02.6315206Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6315670Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6315758Z bogomips : 5999.97 2025-05-07T19:43:02.6315842Z clflush size : 64 2025-05-07T19:43:02.6315933Z cache_alignment : 64 2025-05-07T19:43:02.6316061Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6316142Z power management: 2025-05-07T19:43:02.6316147Z 2025-05-07T19:43:02.6316232Z processor : 75 2025-05-07T19:43:02.6316318Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6316396Z cpu family : 6 2025-05-07T19:43:02.6316480Z model : 85 2025-05-07T19:43:02.6316642Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6316720Z stepping : 7 2025-05-07T19:43:02.6316805Z microcode : 0x5003901 2025-05-07T19:43:02.6316900Z cpu MHz : 2999.988 2025-05-07T19:43:02.6316979Z cache size : 36608 KB 2025-05-07T19:43:02.6317060Z physical id : 1 2025-05-07T19:43:02.6317135Z siblings : 48 2025-05-07T19:43:02.6317223Z core id : 3 2025-05-07T19:43:02.6317300Z cpu cores : 24 2025-05-07T19:43:02.6317376Z apicid : 71 2025-05-07T19:43:02.6317472Z initial apicid : 71 2025-05-07T19:43:02.6317544Z fpu : yes 2025-05-07T19:43:02.6317630Z fpu_exception : yes 2025-05-07T19:43:02.6317710Z cpuid level : 13 2025-05-07T19:43:02.6317790Z wp : yes 2025-05-07T19:43:02.6319981Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6320391Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6320476Z bogomips : 5999.97 2025-05-07T19:43:02.6320560Z clflush size : 64 2025-05-07T19:43:02.6321313Z cache_alignment : 64 2025-05-07T19:43:02.6321461Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6321549Z power management: 2025-05-07T19:43:02.6321554Z 2025-05-07T19:43:02.6321641Z processor : 76 2025-05-07T19:43:02.6321744Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6321825Z cpu family : 6 2025-05-07T19:43:02.6321904Z model : 85 2025-05-07T19:43:02.6322067Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6322168Z stepping : 7 2025-05-07T19:43:02.6322256Z microcode : 0x5003901 2025-05-07T19:43:02.6322338Z cpu MHz : 2999.988 2025-05-07T19:43:02.6322438Z cache size : 36608 KB 2025-05-07T19:43:02.6322522Z physical id : 1 2025-05-07T19:43:02.6322600Z siblings : 48 2025-05-07T19:43:02.6322675Z core id : 4 2025-05-07T19:43:02.6322772Z cpu cores : 24 2025-05-07T19:43:02.6322849Z apicid : 73 2025-05-07T19:43:02.6322930Z initial apicid : 73 2025-05-07T19:43:02.6323018Z fpu : yes 2025-05-07T19:43:02.6323099Z fpu_exception : yes 2025-05-07T19:43:02.6323184Z cpuid level : 13 2025-05-07T19:43:02.6323255Z wp : yes 2025-05-07T19:43:02.6325508Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6325870Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6325955Z bogomips : 5999.97 2025-05-07T19:43:02.6326076Z clflush size : 64 2025-05-07T19:43:02.6326152Z cache_alignment : 64 2025-05-07T19:43:02.6326275Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6326368Z power management: 2025-05-07T19:43:02.6326373Z 2025-05-07T19:43:02.6326446Z processor : 77 2025-05-07T19:43:02.6326525Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6326610Z cpu family : 6 2025-05-07T19:43:02.6326681Z model : 85 2025-05-07T19:43:02.6326834Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6326907Z stepping : 7 2025-05-07T19:43:02.6326993Z microcode : 0x5003901 2025-05-07T19:43:02.6327065Z cpu MHz : 1479.458 2025-05-07T19:43:02.6327138Z cache size : 36608 KB 2025-05-07T19:43:02.6327219Z physical id : 1 2025-05-07T19:43:02.6327291Z siblings : 48 2025-05-07T19:43:02.6327363Z core id : 5 2025-05-07T19:43:02.6327434Z cpu cores : 24 2025-05-07T19:43:02.6327512Z apicid : 75 2025-05-07T19:43:02.6327590Z initial apicid : 75 2025-05-07T19:43:02.6327657Z fpu : yes 2025-05-07T19:43:02.6327745Z fpu_exception : yes 2025-05-07T19:43:02.6327820Z cpuid level : 13 2025-05-07T19:43:02.6327894Z wp : yes 2025-05-07T19:43:02.6329907Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6330271Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6330352Z bogomips : 5999.97 2025-05-07T19:43:02.6330437Z clflush size : 64 2025-05-07T19:43:02.6330516Z cache_alignment : 64 2025-05-07T19:43:02.6330635Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6330774Z power management: 2025-05-07T19:43:02.6330779Z 2025-05-07T19:43:02.6330865Z processor : 78 2025-05-07T19:43:02.6330945Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6331014Z cpu family : 6 2025-05-07T19:43:02.6331090Z model : 85 2025-05-07T19:43:02.6331235Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6331308Z stepping : 7 2025-05-07T19:43:02.6331388Z microcode : 0x5003901 2025-05-07T19:43:02.6331470Z cpu MHz : 2999.988 2025-05-07T19:43:02.6331546Z cache size : 36608 KB 2025-05-07T19:43:02.6331621Z physical id : 1 2025-05-07T19:43:02.6331709Z siblings : 48 2025-05-07T19:43:02.6331780Z core id : 6 2025-05-07T19:43:02.6331853Z cpu cores : 24 2025-05-07T19:43:02.6331925Z apicid : 77 2025-05-07T19:43:02.6332006Z initial apicid : 77 2025-05-07T19:43:02.6332073Z fpu : yes 2025-05-07T19:43:02.6332155Z fpu_exception : yes 2025-05-07T19:43:02.6332238Z cpuid level : 13 2025-05-07T19:43:02.6332305Z wp : yes 2025-05-07T19:43:02.6334305Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6334677Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6334751Z bogomips : 5999.97 2025-05-07T19:43:02.6334823Z clflush size : 64 2025-05-07T19:43:02.6334909Z cache_alignment : 64 2025-05-07T19:43:02.6335074Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6335148Z power management: 2025-05-07T19:43:02.6335155Z 2025-05-07T19:43:02.6335225Z processor : 79 2025-05-07T19:43:02.6335321Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6335392Z cpu family : 6 2025-05-07T19:43:02.6335461Z model : 85 2025-05-07T19:43:02.6335618Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6335692Z stepping : 7 2025-05-07T19:43:02.6335770Z microcode : 0x5003901 2025-05-07T19:43:02.6335841Z cpu MHz : 2999.988 2025-05-07T19:43:02.6335939Z cache size : 36608 KB 2025-05-07T19:43:02.6336013Z physical id : 1 2025-05-07T19:43:02.6336085Z siblings : 48 2025-05-07T19:43:02.6336167Z core id : 7 2025-05-07T19:43:02.6336238Z cpu cores : 24 2025-05-07T19:43:02.6336307Z apicid : 79 2025-05-07T19:43:02.6336381Z initial apicid : 79 2025-05-07T19:43:02.6336461Z fpu : yes 2025-05-07T19:43:02.6336537Z fpu_exception : yes 2025-05-07T19:43:02.6336610Z cpuid level : 13 2025-05-07T19:43:02.6336679Z wp : yes 2025-05-07T19:43:02.6338707Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6339072Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6339151Z bogomips : 5999.97 2025-05-07T19:43:02.6339222Z clflush size : 64 2025-05-07T19:43:02.6339303Z cache_alignment : 64 2025-05-07T19:43:02.6339427Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6339507Z power management: 2025-05-07T19:43:02.6339511Z 2025-05-07T19:43:02.6339586Z processor : 80 2025-05-07T19:43:02.6339718Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6339793Z cpu family : 6 2025-05-07T19:43:02.6339861Z model : 85 2025-05-07T19:43:02.6340007Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6340089Z stepping : 7 2025-05-07T19:43:02.6340165Z microcode : 0x5003901 2025-05-07T19:43:02.6340238Z cpu MHz : 2999.988 2025-05-07T19:43:02.6340311Z cache size : 36608 KB 2025-05-07T19:43:02.6340395Z physical id : 1 2025-05-07T19:43:02.6340469Z siblings : 48 2025-05-07T19:43:02.6340538Z core id : 8 2025-05-07T19:43:02.6340611Z cpu cores : 24 2025-05-07T19:43:02.6340699Z apicid : 81 2025-05-07T19:43:02.6340773Z initial apicid : 81 2025-05-07T19:43:02.6340843Z fpu : yes 2025-05-07T19:43:02.6340927Z fpu_exception : yes 2025-05-07T19:43:02.6341002Z cpuid level : 13 2025-05-07T19:43:02.6341072Z wp : yes 2025-05-07T19:43:02.6343097Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6343467Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6343543Z bogomips : 5999.97 2025-05-07T19:43:02.6343623Z clflush size : 64 2025-05-07T19:43:02.6343703Z cache_alignment : 64 2025-05-07T19:43:02.6343826Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6343901Z power management: 2025-05-07T19:43:02.6343960Z 2025-05-07T19:43:02.6344036Z processor : 81 2025-05-07T19:43:02.6344116Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6344191Z cpu family : 6 2025-05-07T19:43:02.6344269Z model : 85 2025-05-07T19:43:02.6344417Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6344489Z stepping : 7 2025-05-07T19:43:02.6344720Z microcode : 0x5003901 2025-05-07T19:43:02.6344792Z cpu MHz : 2999.988 2025-05-07T19:43:02.6344867Z cache size : 36608 KB 2025-05-07T19:43:02.6344940Z physical id : 1 2025-05-07T19:43:02.6345013Z siblings : 48 2025-05-07T19:43:02.6345083Z core id : 9 2025-05-07T19:43:02.6345158Z cpu cores : 24 2025-05-07T19:43:02.6345228Z apicid : 83 2025-05-07T19:43:02.6345315Z initial apicid : 83 2025-05-07T19:43:02.6345385Z fpu : yes 2025-05-07T19:43:02.6345463Z fpu_exception : yes 2025-05-07T19:43:02.6345543Z cpuid level : 13 2025-05-07T19:43:02.6345612Z wp : yes 2025-05-07T19:43:02.6347630Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6348004Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6348078Z bogomips : 5999.97 2025-05-07T19:43:02.6348152Z clflush size : 64 2025-05-07T19:43:02.6348240Z cache_alignment : 64 2025-05-07T19:43:02.6348360Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6348433Z power management: 2025-05-07T19:43:02.6348437Z 2025-05-07T19:43:02.6348520Z processor : 82 2025-05-07T19:43:02.6348597Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6348668Z cpu family : 6 2025-05-07T19:43:02.6348786Z model : 85 2025-05-07T19:43:02.6348942Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6349013Z stepping : 7 2025-05-07T19:43:02.6349085Z microcode : 0x5003901 2025-05-07T19:43:02.6349158Z cpu MHz : 2999.988 2025-05-07T19:43:02.6349236Z cache size : 36608 KB 2025-05-07T19:43:02.6349307Z physical id : 1 2025-05-07T19:43:02.6349377Z siblings : 48 2025-05-07T19:43:02.6349451Z core id : 10 2025-05-07T19:43:02.6349522Z cpu cores : 24 2025-05-07T19:43:02.6349589Z apicid : 85 2025-05-07T19:43:02.6349668Z initial apicid : 85 2025-05-07T19:43:02.6349744Z fpu : yes 2025-05-07T19:43:02.6349818Z fpu_exception : yes 2025-05-07T19:43:02.6349892Z cpuid level : 13 2025-05-07T19:43:02.6349964Z wp : yes 2025-05-07T19:43:02.6351959Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6352321Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6352404Z bogomips : 5999.97 2025-05-07T19:43:02.6352479Z clflush size : 64 2025-05-07T19:43:02.6352552Z cache_alignment : 64 2025-05-07T19:43:02.6352681Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6352832Z power management: 2025-05-07T19:43:02.6352837Z 2025-05-07T19:43:02.6352909Z processor : 83 2025-05-07T19:43:02.6353040Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6353291Z cpu family : 6 2025-05-07T19:43:02.6353369Z model : 85 2025-05-07T19:43:02.6353529Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6353619Z stepping : 7 2025-05-07T19:43:02.6353701Z microcode : 0x5003901 2025-05-07T19:43:02.6353776Z cpu MHz : 2999.988 2025-05-07T19:43:02.6353859Z cache size : 36608 KB 2025-05-07T19:43:02.6354007Z physical id : 1 2025-05-07T19:43:02.6354085Z siblings : 48 2025-05-07T19:43:02.6354163Z core id : 11 2025-05-07T19:43:02.6354248Z cpu cores : 24 2025-05-07T19:43:02.6354323Z apicid : 87 2025-05-07T19:43:02.6354405Z initial apicid : 87 2025-05-07T19:43:02.6354479Z fpu : yes 2025-05-07T19:43:02.6354572Z fpu_exception : yes 2025-05-07T19:43:02.6354652Z cpuid level : 13 2025-05-07T19:43:02.6354723Z wp : yes 2025-05-07T19:43:02.6356896Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6357291Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6357375Z bogomips : 5999.97 2025-05-07T19:43:02.6357468Z clflush size : 64 2025-05-07T19:43:02.6357549Z cache_alignment : 64 2025-05-07T19:43:02.6357680Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6357770Z power management: 2025-05-07T19:43:02.6357774Z 2025-05-07T19:43:02.6357852Z processor : 84 2025-05-07T19:43:02.6357937Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6358015Z cpu family : 6 2025-05-07T19:43:02.6358095Z model : 85 2025-05-07T19:43:02.6358252Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6358380Z stepping : 7 2025-05-07T19:43:02.6358471Z microcode : 0x5003901 2025-05-07T19:43:02.6358551Z cpu MHz : 1590.269 2025-05-07T19:43:02.6358630Z cache size : 36608 KB 2025-05-07T19:43:02.6358707Z physical id : 1 2025-05-07T19:43:02.6358790Z siblings : 48 2025-05-07T19:43:02.6358866Z core id : 12 2025-05-07T19:43:02.6358941Z cpu cores : 24 2025-05-07T19:43:02.6359023Z apicid : 89 2025-05-07T19:43:02.6359102Z initial apicid : 89 2025-05-07T19:43:02.6359177Z fpu : yes 2025-05-07T19:43:02.6359260Z fpu_exception : yes 2025-05-07T19:43:02.6359346Z cpuid level : 13 2025-05-07T19:43:02.6359420Z wp : yes 2025-05-07T19:43:02.6361597Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6361993Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6362075Z bogomips : 5999.97 2025-05-07T19:43:02.6362152Z clflush size : 64 2025-05-07T19:43:02.6362242Z cache_alignment : 64 2025-05-07T19:43:02.6362366Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6362446Z power management: 2025-05-07T19:43:02.6362451Z 2025-05-07T19:43:02.6362534Z processor : 85 2025-05-07T19:43:02.6362618Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6362695Z cpu family : 6 2025-05-07T19:43:02.6362770Z model : 85 2025-05-07T19:43:02.6362986Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6363067Z stepping : 7 2025-05-07T19:43:02.6363146Z microcode : 0x5003901 2025-05-07T19:43:02.6363235Z cpu MHz : 1517.575 2025-05-07T19:43:02.6363315Z cache size : 36608 KB 2025-05-07T19:43:02.6363393Z physical id : 1 2025-05-07T19:43:02.6363467Z siblings : 48 2025-05-07T19:43:02.6363553Z core id : 13 2025-05-07T19:43:02.6363628Z cpu cores : 24 2025-05-07T19:43:02.6363706Z apicid : 91 2025-05-07T19:43:02.6363798Z initial apicid : 91 2025-05-07T19:43:02.6363871Z fpu : yes 2025-05-07T19:43:02.6363951Z fpu_exception : yes 2025-05-07T19:43:02.6364029Z cpuid level : 13 2025-05-07T19:43:02.6364113Z wp : yes 2025-05-07T19:43:02.6366344Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6366713Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6366790Z bogomips : 5999.97 2025-05-07T19:43:02.6366868Z clflush size : 64 2025-05-07T19:43:02.6366946Z cache_alignment : 64 2025-05-07T19:43:02.6367074Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6367153Z power management: 2025-05-07T19:43:02.6367158Z 2025-05-07T19:43:02.6367229Z processor : 86 2025-05-07T19:43:02.6367319Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6367394Z cpu family : 6 2025-05-07T19:43:02.6367467Z model : 85 2025-05-07T19:43:02.6367617Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6367698Z stepping : 7 2025-05-07T19:43:02.6367773Z microcode : 0x5003901 2025-05-07T19:43:02.6367954Z cpu MHz : 1476.984 2025-05-07T19:43:02.6368041Z cache size : 36608 KB 2025-05-07T19:43:02.6368115Z physical id : 1 2025-05-07T19:43:02.6368184Z siblings : 48 2025-05-07T19:43:02.6368253Z core id : 14 2025-05-07T19:43:02.6368337Z cpu cores : 24 2025-05-07T19:43:02.6368406Z apicid : 93 2025-05-07T19:43:02.6368480Z initial apicid : 93 2025-05-07T19:43:02.6368546Z fpu : yes 2025-05-07T19:43:02.6368629Z fpu_exception : yes 2025-05-07T19:43:02.6368698Z cpuid level : 13 2025-05-07T19:43:02.6368767Z wp : yes 2025-05-07T19:43:02.6370824Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6371191Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6371273Z bogomips : 5999.97 2025-05-07T19:43:02.6371347Z clflush size : 64 2025-05-07T19:43:02.6371425Z cache_alignment : 64 2025-05-07T19:43:02.6371544Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6371630Z power management: 2025-05-07T19:43:02.6371634Z 2025-05-07T19:43:02.6371708Z processor : 87 2025-05-07T19:43:02.6371790Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6371869Z cpu family : 6 2025-05-07T19:43:02.6371938Z model : 85 2025-05-07T19:43:02.6372085Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6372204Z stepping : 7 2025-05-07T19:43:02.6372287Z microcode : 0x5003901 2025-05-07T19:43:02.6372359Z cpu MHz : 2999.988 2025-05-07T19:43:02.6372437Z cache size : 36608 KB 2025-05-07T19:43:02.6372521Z physical id : 1 2025-05-07T19:43:02.6372595Z siblings : 48 2025-05-07T19:43:02.6372667Z core id : 15 2025-05-07T19:43:02.6372740Z cpu cores : 24 2025-05-07T19:43:02.6372817Z apicid : 95 2025-05-07T19:43:02.6372893Z initial apicid : 95 2025-05-07T19:43:02.6372965Z fpu : yes 2025-05-07T19:43:02.6373046Z fpu_exception : yes 2025-05-07T19:43:02.6373127Z cpuid level : 13 2025-05-07T19:43:02.6373195Z wp : yes 2025-05-07T19:43:02.6375211Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6375587Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6375662Z bogomips : 5999.97 2025-05-07T19:43:02.6375739Z clflush size : 64 2025-05-07T19:43:02.6375831Z cache_alignment : 64 2025-05-07T19:43:02.6375948Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6376022Z power management: 2025-05-07T19:43:02.6376027Z 2025-05-07T19:43:02.6376112Z processor : 88 2025-05-07T19:43:02.6376194Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6376266Z cpu family : 6 2025-05-07T19:43:02.6376336Z model : 85 2025-05-07T19:43:02.6376499Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6376569Z stepping : 7 2025-05-07T19:43:02.6376651Z microcode : 0x5003901 2025-05-07T19:43:02.6376734Z cpu MHz : 2999.988 2025-05-07T19:43:02.6376807Z cache size : 36608 KB 2025-05-07T19:43:02.6376937Z physical id : 1 2025-05-07T19:43:02.6377014Z siblings : 48 2025-05-07T19:43:02.6377099Z core id : 16 2025-05-07T19:43:02.6377168Z cpu cores : 24 2025-05-07T19:43:02.6377241Z apicid : 97 2025-05-07T19:43:02.6377328Z initial apicid : 97 2025-05-07T19:43:02.6377394Z fpu : yes 2025-05-07T19:43:02.6377469Z fpu_exception : yes 2025-05-07T19:43:02.6377542Z cpuid level : 13 2025-05-07T19:43:02.6377621Z wp : yes 2025-05-07T19:43:02.6379631Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6380007Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6380083Z bogomips : 5999.97 2025-05-07T19:43:02.6380157Z clflush size : 64 2025-05-07T19:43:02.6380234Z cache_alignment : 64 2025-05-07T19:43:02.6380362Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6380436Z power management: 2025-05-07T19:43:02.6380440Z 2025-05-07T19:43:02.6380508Z processor : 89 2025-05-07T19:43:02.6380593Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6380665Z cpu family : 6 2025-05-07T19:43:02.6380735Z model : 85 2025-05-07T19:43:02.6380878Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6380956Z stepping : 7 2025-05-07T19:43:02.6381034Z microcode : 0x5003901 2025-05-07T19:43:02.6381153Z cpu MHz : 2999.988 2025-05-07T19:43:02.6381233Z cache size : 36608 KB 2025-05-07T19:43:02.6381307Z physical id : 1 2025-05-07T19:43:02.6381383Z siblings : 48 2025-05-07T19:43:02.6381452Z core id : 17 2025-05-07T19:43:02.6381530Z cpu cores : 24 2025-05-07T19:43:02.6381600Z apicid : 99 2025-05-07T19:43:02.6381676Z initial apicid : 99 2025-05-07T19:43:02.6381751Z fpu : yes 2025-05-07T19:43:02.6381827Z fpu_exception : yes 2025-05-07T19:43:02.6381899Z cpuid level : 13 2025-05-07T19:43:02.6381967Z wp : yes 2025-05-07T19:43:02.6383995Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6384360Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6384446Z bogomips : 5999.97 2025-05-07T19:43:02.6384519Z clflush size : 64 2025-05-07T19:43:02.6384599Z cache_alignment : 64 2025-05-07T19:43:02.6384718Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6384804Z power management: 2025-05-07T19:43:02.6384808Z 2025-05-07T19:43:02.6384880Z processor : 90 2025-05-07T19:43:02.6384960Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6385041Z cpu family : 6 2025-05-07T19:43:02.6385109Z model : 85 2025-05-07T19:43:02.6385254Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6385326Z stepping : 7 2025-05-07T19:43:02.6385408Z microcode : 0x5003901 2025-05-07T19:43:02.6385480Z cpu MHz : 2034.881 2025-05-07T19:43:02.6385560Z cache size : 36608 KB 2025-05-07T19:43:02.6385644Z physical id : 1 2025-05-07T19:43:02.6385713Z siblings : 48 2025-05-07T19:43:02.6385834Z core id : 18 2025-05-07T19:43:02.6385904Z cpu cores : 24 2025-05-07T19:43:02.6385980Z apicid : 101 2025-05-07T19:43:02.6386057Z initial apicid : 101 2025-05-07T19:43:02.6386125Z fpu : yes 2025-05-07T19:43:02.6386208Z fpu_exception : yes 2025-05-07T19:43:02.6386280Z cpuid level : 13 2025-05-07T19:43:02.6386348Z wp : yes 2025-05-07T19:43:02.6388366Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6388727Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6388805Z bogomips : 5999.97 2025-05-07T19:43:02.6388887Z clflush size : 64 2025-05-07T19:43:02.6388965Z cache_alignment : 64 2025-05-07T19:43:02.6389083Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6389159Z power management: 2025-05-07T19:43:02.6389163Z 2025-05-07T19:43:02.6389242Z processor : 91 2025-05-07T19:43:02.6389321Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6389392Z cpu family : 6 2025-05-07T19:43:02.6389468Z model : 85 2025-05-07T19:43:02.6389612Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6389681Z stepping : 7 2025-05-07T19:43:02.6389755Z microcode : 0x5003901 2025-05-07T19:43:02.6389835Z cpu MHz : 1922.307 2025-05-07T19:43:02.6389908Z cache size : 36608 KB 2025-05-07T19:43:02.6390027Z physical id : 1 2025-05-07T19:43:02.6390104Z siblings : 48 2025-05-07T19:43:02.6390173Z core id : 19 2025-05-07T19:43:02.6390244Z cpu cores : 24 2025-05-07T19:43:02.6390315Z apicid : 103 2025-05-07T19:43:02.6390400Z initial apicid : 103 2025-05-07T19:43:02.6390469Z fpu : yes 2025-05-07T19:43:02.6390544Z fpu_exception : yes 2025-05-07T19:43:02.6390623Z cpuid level : 13 2025-05-07T19:43:02.6390689Z wp : yes 2025-05-07T19:43:02.6392767Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6393322Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6393406Z bogomips : 5999.97 2025-05-07T19:43:02.6393488Z clflush size : 64 2025-05-07T19:43:02.6393582Z cache_alignment : 64 2025-05-07T19:43:02.6393709Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6393790Z power management: 2025-05-07T19:43:02.6393795Z 2025-05-07T19:43:02.6393873Z processor : 92 2025-05-07T19:43:02.6393965Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6394040Z cpu family : 6 2025-05-07T19:43:02.6394133Z model : 85 2025-05-07T19:43:02.6394296Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6394372Z stepping : 7 2025-05-07T19:43:02.6394450Z microcode : 0x5003901 2025-05-07T19:43:02.6394526Z cpu MHz : 1311.408 2025-05-07T19:43:02.6394612Z cache size : 36608 KB 2025-05-07T19:43:02.6394688Z physical id : 1 2025-05-07T19:43:02.6394769Z siblings : 48 2025-05-07T19:43:02.6394850Z core id : 20 2025-05-07T19:43:02.6394923Z cpu cores : 24 2025-05-07T19:43:02.6395055Z apicid : 105 2025-05-07T19:43:02.6395137Z initial apicid : 105 2025-05-07T19:43:02.6395221Z fpu : yes 2025-05-07T19:43:02.6395300Z fpu_exception : yes 2025-05-07T19:43:02.6395377Z cpuid level : 13 2025-05-07T19:43:02.6395449Z wp : yes 2025-05-07T19:43:02.6397638Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6398031Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6398121Z bogomips : 5999.97 2025-05-07T19:43:02.6398197Z clflush size : 64 2025-05-07T19:43:02.6398276Z cache_alignment : 64 2025-05-07T19:43:02.6398413Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6398490Z power management: 2025-05-07T19:43:02.6398495Z 2025-05-07T19:43:02.6398571Z processor : 93 2025-05-07T19:43:02.6398655Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6398737Z cpu family : 6 2025-05-07T19:43:02.6398809Z model : 85 2025-05-07T19:43:02.6398963Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6399046Z stepping : 7 2025-05-07T19:43:02.6399124Z microcode : 0x5003901 2025-05-07T19:43:02.6399199Z cpu MHz : 2999.988 2025-05-07T19:43:02.6399278Z cache size : 36608 KB 2025-05-07T19:43:02.6399362Z physical id : 1 2025-05-07T19:43:02.6399438Z siblings : 48 2025-05-07T19:43:02.6399510Z core id : 21 2025-05-07T19:43:02.6399657Z cpu cores : 24 2025-05-07T19:43:02.6399733Z apicid : 107 2025-05-07T19:43:02.6399816Z initial apicid : 107 2025-05-07T19:43:02.6399889Z fpu : yes 2025-05-07T19:43:02.6399975Z fpu_exception : yes 2025-05-07T19:43:02.6400052Z cpuid level : 13 2025-05-07T19:43:02.6400124Z wp : yes 2025-05-07T19:43:02.6402446Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6402843Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6402922Z bogomips : 5999.97 2025-05-07T19:43:02.6403009Z clflush size : 64 2025-05-07T19:43:02.6403091Z cache_alignment : 64 2025-05-07T19:43:02.6403218Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6403297Z power management: 2025-05-07T19:43:02.6403308Z 2025-05-07T19:43:02.6403384Z processor : 94 2025-05-07T19:43:02.6403467Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6403542Z cpu family : 6 2025-05-07T19:43:02.6403621Z model : 85 2025-05-07T19:43:02.6403778Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6403854Z stepping : 7 2025-05-07T19:43:02.6403942Z microcode : 0x5003901 2025-05-07T19:43:02.6404016Z cpu MHz : 2999.988 2025-05-07T19:43:02.6404095Z cache size : 36608 KB 2025-05-07T19:43:02.6404173Z physical id : 1 2025-05-07T19:43:02.6404254Z siblings : 48 2025-05-07T19:43:02.6404326Z core id : 22 2025-05-07T19:43:02.6404400Z cpu cores : 24 2025-05-07T19:43:02.6404476Z apicid : 109 2025-05-07T19:43:02.6404596Z initial apicid : 109 2025-05-07T19:43:02.6404972Z fpu : yes 2025-05-07T19:43:02.6405098Z fpu_exception : yes 2025-05-07T19:43:02.6405218Z cpuid level : 13 2025-05-07T19:43:02.6405330Z wp : yes 2025-05-07T19:43:02.6407667Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6408079Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6408294Z bogomips : 5999.97 2025-05-07T19:43:02.6408412Z clflush size : 64 2025-05-07T19:43:02.6408542Z cache_alignment : 64 2025-05-07T19:43:02.6408719Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6408883Z power management: 2025-05-07T19:43:02.6408888Z 2025-05-07T19:43:02.6408982Z processor : 95 2025-05-07T19:43:02.6409143Z vendor_id : GenuineIntel 2025-05-07T19:43:02.6409319Z cpu family : 6 2025-05-07T19:43:02.6409433Z model : 85 2025-05-07T19:43:02.6409643Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:02.6409804Z stepping : 7 2025-05-07T19:43:02.6409903Z microcode : 0x5003901 2025-05-07T19:43:02.6410044Z cpu MHz : 2999.988 2025-05-07T19:43:02.6410179Z cache size : 36608 KB 2025-05-07T19:43:02.6410348Z physical id : 1 2025-05-07T19:43:02.6410460Z siblings : 48 2025-05-07T19:43:02.6410572Z core id : 23 2025-05-07T19:43:02.6410708Z cpu cores : 24 2025-05-07T19:43:02.6410849Z apicid : 111 2025-05-07T19:43:02.6411063Z initial apicid : 111 2025-05-07T19:43:02.6411186Z fpu : yes 2025-05-07T19:43:02.6411349Z fpu_exception : yes 2025-05-07T19:43:02.6411531Z cpuid level : 13 2025-05-07T19:43:02.6411642Z wp : yes 2025-05-07T19:43:02.6414002Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:02.6414424Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:02.6414539Z bogomips : 5999.97 2025-05-07T19:43:02.6414693Z clflush size : 64 2025-05-07T19:43:02.6414806Z cache_alignment : 64 2025-05-07T19:43:02.6414966Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:02.6415133Z power management: 2025-05-07T19:43:02.6415138Z 2025-05-07T19:43:02.6415142Z 2025-05-07T19:43:02.6415307Z ################################################################################ 2025-05-07T19:43:02.6415431Z [INFO] Print PCI info ... 2025-05-07T19:43:02.6415583Z + lspci -v 2025-05-07T19:43:02.6415587Z 2025-05-07T19:43:02.6415793Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2025-05-07T19:43:02.6415927Z Subsystem: Amazon.com, Inc. Device 1237 2025-05-07T19:43:02.6416124Z Flags: bus master, medium devsel, latency 0 2025-05-07T19:43:02.6416129Z 2025-05-07T19:43:02.6416375Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2025-05-07T19:43:02.6416486Z Physical Slot: 1 2025-05-07T19:43:02.6416625Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:02.6416630Z 2025-05-07T19:43:02.6416950Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2025-05-07T19:43:02.6417111Z Physical Slot: 1 2025-05-07T19:43:02.6417251Z Flags: bus master, fast devsel, latency 0, IRQ 9 2025-05-07T19:43:02.6417255Z 2025-05-07T19:43:02.6417640Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 (prog-if 00 [VGA controller]) 2025-05-07T19:43:02.6417751Z Physical Slot: 3 2025-05-07T19:43:02.6417957Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:02.6418158Z Memory at c0000000 (32-bit, prefetchable) [size=4M] 2025-05-07T19:43:02.6418319Z Expansion ROM at 000c0000 [disabled] [size=128K] 2025-05-07T19:43:02.6418323Z 2025-05-07T19:43:02.6418638Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller (prog-if 02 [NVM Express]) 2025-05-07T19:43:02.6418863Z Subsystem: Amazon.com, Inc. Device 0000 2025-05-07T19:43:02.6418975Z Physical Slot: 4 2025-05-07T19:43:02.6419133Z Flags: bus master, fast devsel, latency 0, IRQ 11 2025-05-07T19:43:02.6419312Z Memory at c0514000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:02.6419490Z Capabilities: 2025-05-07T19:43:02.6419589Z Kernel driver in use: nvme 2025-05-07T19:43:02.6419593Z 2025-05-07T19:43:02.6419862Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2025-05-07T19:43:02.6420030Z Physical Slot: 5 2025-05-07T19:43:02.6420167Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:02.6420345Z Memory at c0510000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:02.6420552Z Memory at c0400000 (32-bit, prefetchable) [size=1M] 2025-05-07T19:43:02.6420705Z Memory at c0500000 (32-bit, non-prefetchable) [size=64K] 2025-05-07T19:43:02.6420861Z Capabilities: 2025-05-07T19:43:02.6421037Z Kernel driver in use: ena 2025-05-07T19:43:02.6421042Z 2025-05-07T19:43:02.6421045Z 2025-05-07T19:43:02.6421234Z ################################################################################ 2025-05-07T19:43:02.6421378Z [INFO] Print Linux distribution info ... 2025-05-07T19:43:02.6421490Z + uname -a 2025-05-07T19:43:02.6421537Z 2025-05-07T19:43:02.6421917Z Linux 2b02554cc611 6.1.130-139.222.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux 2025-05-07T19:43:02.6421922Z 2025-05-07T19:43:02.6422068Z + uname -m 2025-05-07T19:43:02.6422073Z 2025-05-07T19:43:02.6422303Z x86_64 2025-05-07T19:43:02.6422308Z 2025-05-07T19:43:02.6422425Z + cat /proc/version 2025-05-07T19:43:02.6422430Z 2025-05-07T19:43:02.6423021Z Linux version 6.1.130-139.222.amzn2023.x86_64 (mockbuild@ip-10-0-55-76) (gcc (GCC) 11.5.0 20240719 (Red Hat 11.5.0-5), GNU ld version 2.39-6.amzn2023.0.11) #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 2025-05-07T19:43:02.6423026Z 2025-05-07T19:43:02.6423181Z + cat /etc/os-release 2025-05-07T19:43:02.6423185Z 2025-05-07T19:43:02.6423275Z NAME="Amazon Linux" 2025-05-07T19:43:02.6423410Z VERSION="2023" 2025-05-07T19:43:02.6423578Z ID="amzn" 2025-05-07T19:43:02.6423695Z ID_LIKE="fedora" 2025-05-07T19:43:02.6423807Z VERSION_ID="2023" 2025-05-07T19:43:02.6423935Z PLATFORM_ID="platform:al2023" 2025-05-07T19:43:02.6424093Z PRETTY_NAME="Amazon Linux 2023.7.20250428" 2025-05-07T19:43:02.6424226Z ANSI_COLOR="0;33" 2025-05-07T19:43:02.6424392Z CPE_NAME="cpe:2.3:o:amazon:amazon_linux:2023" 2025-05-07T19:43:02.6424650Z HOME_URL="https://aws.amazon.com/linux/amazon-linux-2023/" 2025-05-07T19:43:02.6424840Z DOCUMENTATION_URL="https://docs.aws.amazon.com/linux/" 2025-05-07T19:43:02.6425019Z SUPPORT_URL="https://aws.amazon.com/premiumsupport/" 2025-05-07T19:43:02.6425235Z BUG_REPORT_URL="https://github.com/amazonlinux/amazon-linux-2023" 2025-05-07T19:43:02.6425394Z VENDOR_NAME="AWS" 2025-05-07T19:43:02.6425545Z VENDOR_URL="https://aws.amazon.com/" 2025-05-07T19:43:02.6425670Z SUPPORT_END="2029-06-30" 2025-05-07T19:43:02.6425674Z 2025-05-07T19:43:02.6465545Z ##[group]Run . $PRELUDE; print_gpu_info 2025-05-07T19:43:02.6465730Z . $PRELUDE; print_gpu_info 2025-05-07T19:43:02.6466063Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:02.6474274Z env: 2025-05-07T19:43:02.6474465Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:02.6474558Z BUILD_ENV: build_binary 2025-05-07T19:43:02.6474641Z BUILD_TARGET: default 2025-05-07T19:43:02.6474729Z BUILD_VARIANT: cuda 2025-05-07T19:43:02.6474830Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:43:02.6474908Z ##[endgroup] 2025-05-07T19:43:03.0779610Z ################################################################################ 2025-05-07T19:43:03.0780257Z [INFO] Printing general display info ... 2025-05-07T19:43:03.0794620Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:03.1706242Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:03.1711951Z /usr/bin/sudo 2025-05-07T19:43:03.1722553Z which: no apt-get in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:03.1727669Z /usr/bin/yum 2025-05-07T19:43:03.1728581Z [INSTALL] Updating system repositories ... 2025-05-07T19:43:03.1751060Z [EXEC] [ATTEMPT 0/3] + sudo yum update -y 2025-05-07T19:43:03.3922673Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:45 2025. 2025-05-07T19:43:03.4875728Z Dependencies resolved. 2025-05-07T19:43:03.5086244Z Nothing to do. 2025-05-07T19:43:03.5781125Z Complete! 2025-05-07T19:43:03.5782087Z [INSTALL] Installing system package(s): hostname lshw ... 2025-05-07T19:43:03.5801036Z [EXEC] [ATTEMPT 0/3] + sudo yum install -y hostname lshw 2025-05-07T19:43:03.7931414Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:45 2025. 2025-05-07T19:43:03.8439194Z Dependencies resolved. 2025-05-07T19:43:03.8603736Z ================================================================================ 2025-05-07T19:43:03.8605188Z Package Arch Version Repository Size 2025-05-07T19:43:03.8605632Z ================================================================================ 2025-05-07T19:43:03.8606006Z Installing: 2025-05-07T19:43:03.8606378Z hostname x86_64 3.23-4.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:43:03.8606891Z lshw x86_64 B.02.19.2-7.amzn2023.0.3 amazonlinux 319 k 2025-05-07T19:43:03.8607191Z 2025-05-07T19:43:03.8607322Z Transaction Summary 2025-05-07T19:43:03.8607600Z ================================================================================ 2025-05-07T19:43:03.8607957Z Install 2 Packages 2025-05-07T19:43:03.8608112Z 2025-05-07T19:43:03.8608220Z Total download size: 347 k 2025-05-07T19:43:03.8608534Z Installed size: 883 k 2025-05-07T19:43:03.8608827Z Downloading Packages: 2025-05-07T19:43:04.1555357Z (1/2): lshw-B.02.19.2-7.amzn2023.0.3.x86_64.rpm 13 MB/s | 319 kB 00:00 2025-05-07T19:43:04.1573280Z (2/2): hostname-3.23-4.amzn2023.0.3.x86_64.rpm 1.0 MB/s | 28 kB 00:00 2025-05-07T19:43:04.1577353Z -------------------------------------------------------------------------------- 2025-05-07T19:43:04.1580289Z Total 1.1 MB/s | 347 kB 00:00 2025-05-07T19:43:04.1796168Z Running transaction check 2025-05-07T19:43:04.1850148Z Transaction check succeeded. 2025-05-07T19:43:04.1850726Z Running transaction test 2025-05-07T19:43:04.2000119Z Transaction test succeeded. 2025-05-07T19:43:04.2001718Z Running transaction 2025-05-07T19:43:04.2270642Z Preparing : 1/1 2025-05-07T19:43:04.2339220Z Installing : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:04.2365753Z Installing : hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:05.2770019Z Running scriptlet: hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:05.2772473Z Verifying : hostname-3.23-4.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:05.3132640Z Verifying : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:05.3133648Z 2025-05-07T19:43:05.3133895Z Installed: 2025-05-07T19:43:05.3135419Z hostname-3.23-4.amzn2023.0.3.x86_64 lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2025-05-07T19:43:05.3136417Z 2025-05-07T19:43:05.3136738Z Complete! 2025-05-07T19:43:05.3467853Z + hostname 2025-05-07T19:43:05.3468213Z 2025-05-07T19:43:05.3471197Z 2b02554cc611 2025-05-07T19:43:05.3471382Z 2025-05-07T19:43:05.3471530Z + sudo lshw -C display 2025-05-07T19:43:05.3471702Z 2025-05-07T19:43:05.5432296Z *-display UNCLAIMED 2025-05-07T19:43:05.5433399Z description: VGA compatible controller 2025-05-07T19:43:05.5434444Z product: Amazon.com, Inc. 2025-05-07T19:43:05.5435254Z vendor: Amazon.com, Inc. 2025-05-07T19:43:05.5436048Z physical id: 3 2025-05-07T19:43:05.5436737Z bus info: pci@0000:00:03.0 2025-05-07T19:43:05.5437610Z version: 00 2025-05-07T19:43:05.5437855Z width: 32 bits 2025-05-07T19:43:05.5438117Z clock: 33MHz 2025-05-07T19:43:05.5438400Z capabilities: vga_controller bus_master 2025-05-07T19:43:05.5438721Z configuration: latency=0 2025-05-07T19:43:05.5439090Z resources: memory:c0000000-c03fffff memory:c0000-dffff 2025-05-07T19:43:05.5452512Z 2025-05-07T19:43:05.5453103Z ################################################################################ 2025-05-07T19:43:05.5454216Z [INFO] Printing NVIDIA GPU info ... 2025-05-07T19:43:05.5556224Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:05.5581128Z which: no nvidia-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:05.5581822Z [CHECK] nvidia-smi not found 2025-05-07T19:43:05.5582146Z ################################################################################ 2025-05-07T19:43:05.5582521Z [INFO] Printing AMD GPU info ... 2025-05-07T19:43:05.5687983Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:05.5711111Z which: no rocminfo in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:05.5712556Z [CHECK] rocminfo not found 2025-05-07T19:43:05.5716488Z which: no rocm-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:05.5717715Z [CHECK] rocm-smi not found 2025-05-07T19:43:05.5781119Z ##[group]Run . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:05.5781592Z . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:05.5782100Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:05.5782440Z env: 2025-05-07T19:43:05.5782659Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:05.5782987Z BUILD_ENV: build_binary 2025-05-07T19:43:05.5783230Z BUILD_TARGET: default 2025-05-07T19:43:05.5783476Z BUILD_VARIANT: cuda 2025-05-07T19:43:05.5783707Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:43:05.5783973Z ##[endgroup] 2025-05-07T19:43:05.9575524Z ################################################################################ 2025-05-07T19:43:05.9576539Z # Setup Miniconda 2025-05-07T19:43:05.9577168Z # 2025-05-07T19:43:05.9591151Z # [2025-05-07T19:43:05.958Z] + setup_miniconda /github/home/miniconda 2025-05-07T19:43:05.9592517Z ################################################################################ 2025-05-07T19:43:05.9593586Z 2025-05-07T19:43:05.9618238Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:06.0508887Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:06.0509282Z + mkdir -p /github/home/miniconda 2025-05-07T19:43:06.0509502Z 2025-05-07T19:43:06.0522597Z 2025-05-07T19:43:06.0522814Z [SETUP] Downloading the Miniconda installer ... 2025-05-07T19:43:06.0547266Z [EXEC] [ATTEMPT 0/3] + wget -q https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O miniconda.sh 2025-05-07T19:43:07.3602529Z [SETUP] Installing Miniconda ... 2025-05-07T19:43:07.3602953Z + bash miniconda.sh -b -p /github/home/miniconda -u 2025-05-07T19:43:07.3603240Z 2025-05-07T19:43:07.3748230Z PREFIX=/github/home/miniconda 2025-05-07T19:43:07.7319075Z Unpacking payload ... 2025-05-07T19:43:08.2131366Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:08.8795486Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:10.7260474Z 2025-05-07T19:43:10.7261058Z Installing base environment... 2025-05-07T19:43:10.7261335Z 2025-05-07T19:43:11.7137219Z Preparing transaction: ...working... done 2025-05-07T19:43:14.5481562Z Executing transaction: ...working... done 2025-05-07T19:43:15.0973126Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:15.1658300Z installation finished. 2025-05-07T19:43:15.1659014Z 2025-05-07T19:43:15.1660577Z + rm -f miniconda.sh 2025-05-07T19:43:15.1660898Z 2025-05-07T19:43:15.1794862Z 2025-05-07T19:43:15.1795252Z [SETUP] Reloading the bash configuration ... 2025-05-07T19:43:15.5476186Z + /github/home/miniconda/bin/conda init bash 2025-05-07T19:43:15.5476498Z 2025-05-07T19:43:15.5476663Z no change /github/home/miniconda/condabin/conda 2025-05-07T19:43:15.5477099Z no change /github/home/miniconda/bin/conda 2025-05-07T19:43:15.5477587Z no change /github/home/miniconda/bin/conda-env 2025-05-07T19:43:15.5477977Z no change /github/home/miniconda/bin/activate 2025-05-07T19:43:15.5478352Z no change /github/home/miniconda/bin/deactivate 2025-05-07T19:43:15.5478925Z no change /github/home/miniconda/etc/profile.d/conda.sh 2025-05-07T19:43:15.5479358Z no change /github/home/miniconda/etc/fish/conf.d/conda.fish 2025-05-07T19:43:15.5479837Z no change /github/home/miniconda/shell/condabin/Conda.psm1 2025-05-07T19:43:15.5480330Z no change /github/home/miniconda/shell/condabin/conda-hook.ps1 2025-05-07T19:43:15.5480877Z no change /github/home/miniconda/lib/python3.13/site-packages/xontrib/conda.xsh 2025-05-07T19:43:15.5481792Z no change /github/home/miniconda/etc/profile.d/conda.csh 2025-05-07T19:43:15.5482160Z modified /github/home/.bashrc 2025-05-07T19:43:15.5482367Z 2025-05-07T19:43:15.5482576Z ==> For changes to take effect, close and re-open your current shell. <== 2025-05-07T19:43:15.5482888Z 2025-05-07T19:43:15.6009321Z 2025-05-07T19:43:15.6009600Z + . /github/home/.bashrc 2025-05-07T19:43:15.6009808Z 2025-05-07T19:43:16.3897786Z 2025-05-07T19:43:16.3899218Z [SETUP] Installing libmamba-solver (required since Anaconda 2024.02-1) and libarchive ... 2025-05-07T19:43:16.3925353Z [EXEC] [ATTEMPT 0/3] + conda install --solver=classic -c conda-forge --override-channels -y conda-libmamba-solver libmamba libmambapy libarchive 2025-05-07T19:43:28.0359573Z Collecting package metadata (current_repodata.json): - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:29.4871108Z Solving environment: \ | / - \ | / - \ | / done 2025-05-07T19:43:29.5773015Z 2025-05-07T19:43:29.5773697Z ## Package Plan ## 2025-05-07T19:43:29.5774260Z 2025-05-07T19:43:29.5774792Z environment location: /github/home/miniconda 2025-05-07T19:43:29.5775671Z 2025-05-07T19:43:29.5775947Z added / updated specs: 2025-05-07T19:43:29.5776736Z - conda-libmamba-solver 2025-05-07T19:43:29.5777463Z - libarchive 2025-05-07T19:43:29.5778070Z - libmamba 2025-05-07T19:43:29.5778635Z - libmambapy 2025-05-07T19:43:29.5779028Z 2025-05-07T19:43:29.5779039Z 2025-05-07T19:43:29.5779384Z The following packages will be downloaded: 2025-05-07T19:43:29.5780035Z 2025-05-07T19:43:29.5780471Z package | build 2025-05-07T19:43:29.5781255Z ---------------------------|----------------- 2025-05-07T19:43:29.5781803Z ca-certificates-2025.4.26 | hbd8a1cb_0 149 KB conda-forge 2025-05-07T19:43:29.5782325Z certifi-2025.4.26 | pyhd8ed1ab_0 154 KB conda-forge 2025-05-07T19:43:29.5782824Z conda-25.3.1 | py313h78bf25f_1 1.1 MB conda-forge 2025-05-07T19:43:29.5783332Z conda-libmamba-solver-25.4.0| pyhd8ed1ab_0 41 KB conda-forge 2025-05-07T19:43:29.5783833Z ------------------------------------------------------------ 2025-05-07T19:43:29.5784187Z Total: 1.4 MB 2025-05-07T19:43:29.5784428Z 2025-05-07T19:43:29.5784546Z The following packages will be UPDATED: 2025-05-07T19:43:29.5784764Z 2025-05-07T19:43:29.5789434Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:43:29.5790284Z conda pkgs/main::conda-25.3.1-py313h06a4308~ --> conda-forge::conda-25.3.1-py313h78bf25f_1 2025-05-07T19:43:29.5790705Z 2025-05-07T19:43:29.5790927Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:43:29.5791248Z 2025-05-07T19:43:29.5791593Z certifi pkgs/main/linux-64::certifi-2025.4.26~ --> conda-forge/noarch::certifi-2025.4.26-pyhd8ed1ab_0 2025-05-07T19:43:29.5792418Z conda-libmamba-so~ pkgs/main::conda-libmamba-solver-25.4~ --> conda-forge::conda-libmamba-solver-25.4.0-pyhd8ed1ab_0 2025-05-07T19:43:29.5793533Z 2025-05-07T19:43:29.5793537Z 2025-05-07T19:43:29.5793541Z 2025-05-07T19:43:29.5793778Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:29.5794273Z conda-25.3.1 | 1.1 MB | | 0% 2025-05-07T19:43:29.5794513Z 2025-05-07T19:43:29.5794855Z certifi-2025.4.26 | 154 KB | | 0%  2025-05-07T19:43:29.5795141Z 2025-05-07T19:43:29.5795147Z 2025-05-07T19:43:29.5795446Z ca-certificates-2025 | 149 KB | | 0%  2025-05-07T19:43:29.5795725Z 2025-05-07T19:43:29.5795729Z 2025-05-07T19:43:29.5796060Z 2025-05-07T19:43:29.6276570Z conda-libmamba-solve | 41 KB | | 0%  2025-05-07T19:43:29.6277506Z 2025-05-07T19:43:29.6374740Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:29.6375592Z 2025-05-07T19:43:29.6396672Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:29.6506467Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:29.6506759Z 2025-05-07T19:43:29.6506764Z 2025-05-07T19:43:29.6583384Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:29.6584284Z 2025-05-07T19:43:29.6584313Z 2025-05-07T19:43:29.6584325Z 2025-05-07T19:43:29.6635537Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:29.6636119Z 2025-05-07T19:43:29.6636125Z 2025-05-07T19:43:29.6749387Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:29.6750048Z 2025-05-07T19:43:29.6750061Z 2025-05-07T19:43:29.6750065Z 2025-05-07T19:43:29.7476448Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:29.7476924Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:29.7477608Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:29.7477956Z 2025-05-07T19:43:29.7478250Z 2025-05-07T19:43:29.7478455Z  2025-05-07T19:43:29.7478674Z 2025-05-07T19:43:29.7478678Z 2025-05-07T19:43:29.7478939Z  2025-05-07T19:43:29.7479174Z 2025-05-07T19:43:29.7479178Z 2025-05-07T19:43:29.7479181Z 2025-05-07T19:43:29.7479377Z  done 2025-05-07T19:43:29.8492534Z Preparing transaction: \ done 2025-05-07T19:43:29.9500747Z Verifying transaction: / done 2025-05-07T19:43:31.2528705Z Executing transaction: \ | / - \ | / - \ | / - \ done 2025-05-07T19:43:32.8124016Z [SETUP] Updating Miniconda base packages ... 2025-05-07T19:43:32.8149465Z [EXEC] [ATTEMPT 0/3] + conda update -n base -c defaults --update-deps -y conda 2025-05-07T19:43:33.5436294Z Channels: 2025-05-07T19:43:33.5436980Z - defaults 2025-05-07T19:43:33.5437591Z Platform: linux-64 2025-05-07T19:43:34.5949820Z Collecting package metadata (repodata.json): - \ | / - \ done 2025-05-07T19:43:34.7248660Z Solving environment: / - Channels: 2025-05-07T19:43:34.7249024Z - defaults 2025-05-07T19:43:34.7249262Z Platform: linux-64 2025-05-07T19:43:35.0018401Z Collecting package metadata (repodata.json): | / - \ done 2025-05-07T19:43:35.2205064Z Solving environment: / - \ | done 2025-05-07T19:43:35.3104512Z done 2025-05-07T19:43:35.3735964Z 2025-05-07T19:43:35.3736423Z ## Package Plan ## 2025-05-07T19:43:35.3736812Z 2025-05-07T19:43:35.3737134Z environment location: /github/home/miniconda 2025-05-07T19:43:35.3737489Z 2025-05-07T19:43:35.3737639Z added / updated specs: 2025-05-07T19:43:35.3737919Z - conda 2025-05-07T19:43:35.3738059Z 2025-05-07T19:43:35.3738063Z 2025-05-07T19:43:35.3738188Z The following packages will be downloaded: 2025-05-07T19:43:35.3738416Z 2025-05-07T19:43:35.3738554Z package | build 2025-05-07T19:43:35.3738915Z ---------------------------|----------------- 2025-05-07T19:43:35.3739274Z pip-25.1 | pyhc872135_2 1.3 MB 2025-05-07T19:43:35.3739698Z tzdata-2025b | h04d1e81_0 116 KB 2025-05-07T19:43:35.3740116Z ------------------------------------------------------------ 2025-05-07T19:43:35.3740473Z Total: 1.4 MB 2025-05-07T19:43:35.3740705Z 2025-05-07T19:43:35.3740862Z The following packages will be UPDATED: 2025-05-07T19:43:35.3741090Z 2025-05-07T19:43:35.3741421Z pip pkgs/main/linux-64::pip-25.0-py313h06~ --> pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:35.3742239Z tzdata 2025a-h04d1e81_0 --> 2025b-h04d1e81_0 2025-05-07T19:43:35.3742520Z 2025-05-07T19:43:35.3742524Z 2025-05-07T19:43:35.3742528Z 2025-05-07T19:43:35.3742701Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:35.3743206Z pip-25.1 | 1.3 MB | | 0% 2025-05-07T19:43:35.3743452Z 2025-05-07T19:43:35.4204852Z tzdata-2025b | 116 KB | | 0%  2025-05-07T19:43:35.4205189Z 2025-05-07T19:43:35.4338382Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:35.6174834Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:35.6175601Z 2025-05-07T19:43:35.6176156Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:35.6176422Z 2025-05-07T19:43:35.6234321Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:35.6235597Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:35.6236867Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:35.6237897Z 2025-05-07T19:43:35.6238642Z 2025-05-07T19:43:35.6239041Z  done 2025-05-07T19:43:35.7245552Z Preparing transaction: - done 2025-05-07T19:43:35.8254514Z Verifying transaction: | done 2025-05-07T19:43:37.7287811Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | done 2025-05-07T19:43:38.2645313Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:43:38.2646515Z + conda clean --packages --tarball -y 2025-05-07T19:43:38.2647127Z 2025-05-07T19:43:38.6962673Z Will remove 99 (117.8 MB) tarball(s). 2025-05-07T19:43:38.6963695Z Will remove 11 (16.0 MB) package(s). 2025-05-07T19:43:38.7518509Z 2025-05-07T19:43:38.7523527Z + conda clean --all -y 2025-05-07T19:43:38.7524363Z 2025-05-07T19:43:39.1932254Z There are no unused tarball(s) to remove. 2025-05-07T19:43:39.1933236Z Will remove 1 index cache(s). 2025-05-07T19:43:39.1934142Z There are no unused package(s) to remove. 2025-05-07T19:43:39.1935118Z There are no tempfile(s) to remove. 2025-05-07T19:43:39.1935977Z There are no logfile(s) to remove. 2025-05-07T19:43:39.2480716Z 2025-05-07T19:43:39.2482057Z + conda info 2025-05-07T19:43:39.2482624Z 2025-05-07T19:43:39.8071001Z 2025-05-07T19:43:39.8071636Z active environment : base 2025-05-07T19:43:39.8072585Z active env location : /github/home/miniconda 2025-05-07T19:43:39.8073760Z shell level : 1 2025-05-07T19:43:39.8074573Z user config file : /github/home/.condarc 2025-05-07T19:43:39.8075787Z populated config files : /github/home/miniconda/.condarc 2025-05-07T19:43:39.8076858Z conda version : 25.3.1 2025-05-07T19:43:39.8077700Z conda-build version : not installed 2025-05-07T19:43:39.8078425Z python version : 3.13.2.final.0 2025-05-07T19:43:39.8078777Z solver : libmamba (default) 2025-05-07T19:43:39.8079138Z virtual packages : __archspec=1=cascadelake 2025-05-07T19:43:39.8079583Z __conda=25.3.1=0 2025-05-07T19:43:39.8079875Z __glibc=2.34=0 2025-05-07T19:43:39.8080150Z __linux=6.1.130=0 2025-05-07T19:43:39.8080446Z __unix=0=0 2025-05-07T19:43:39.8080767Z base environment : /github/home/miniconda (writable) 2025-05-07T19:43:39.8081196Z conda av data dir : /github/home/miniconda/etc/conda 2025-05-07T19:43:39.8081550Z conda av metadata url : None 2025-05-07T19:43:39.8081933Z channel URLs : https://repo.anaconda.com/pkgs/main/linux-64 2025-05-07T19:43:39.8082371Z https://repo.anaconda.com/pkgs/main/noarch 2025-05-07T19:43:39.8082751Z https://repo.anaconda.com/pkgs/r/linux-64 2025-05-07T19:43:39.8083151Z https://repo.anaconda.com/pkgs/r/noarch 2025-05-07T19:43:39.8083515Z package cache : /github/home/miniconda/pkgs 2025-05-07T19:43:39.8084107Z /github/home/.conda/pkgs 2025-05-07T19:43:39.8084444Z envs directories : /github/home/miniconda/envs 2025-05-07T19:43:39.8084788Z /github/home/.conda/envs 2025-05-07T19:43:39.8085116Z platform : linux-64 2025-05-07T19:43:39.8085968Z user-agent : conda/25.3.1 requests/2.32.3 CPython/3.13.2 Linux/6.1.130-139.222.amzn2023.x86_64 amzn/2023.7.20250428 glibc/2.34 solver/libmamba conda-libmamba-solver/25.4.0 libmambapy/2.0.5 aau/0.7.0 c/. s/. e/. 2025-05-07T19:43:39.8086845Z UID:GID : 0:0 2025-05-07T19:43:39.8087091Z netrc file : None 2025-05-07T19:43:39.8087369Z offline mode : False 2025-05-07T19:43:39.8087537Z 2025-05-07T19:43:39.8665203Z 2025-05-07T19:43:39.8666288Z [SETUP] Exporting Miniconda variables ... 2025-05-07T19:43:39.8667009Z [SETUP] Saving Miniconda variables to /__w/_temp/_runner_file_commands/add_path_3f3a1a7d-bdde-47e7-ae64-a74d9b73f00d ... 2025-05-07T19:43:39.8667768Z [SETUP] Successfully set up Miniconda at /github/home/miniconda 2025-05-07T19:43:39.8794525Z ##[group]Run . $PRELUDE; create_conda_environment $BUILD_ENV 3.13 2025-05-07T19:43:39.8795067Z . $PRELUDE; create_conda_environment $BUILD_ENV 3.13 2025-05-07T19:43:39.8795762Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:39.8796104Z env: 2025-05-07T19:43:39.8796329Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:39.8796656Z BUILD_ENV: build_binary 2025-05-07T19:43:39.8796905Z BUILD_TARGET: default 2025-05-07T19:43:39.8797159Z BUILD_VARIANT: cuda 2025-05-07T19:43:39.8797398Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:43:39.8797668Z ##[endgroup] 2025-05-07T19:43:40.2874148Z ################################################################################ 2025-05-07T19:43:40.2875241Z # Create Conda Environment 2025-05-07T19:43:40.2876424Z # 2025-05-07T19:43:40.2884665Z # [2025-05-07T19:43:40.288Z] + create_conda_environment build_binary 3.13 2025-05-07T19:43:40.2886092Z ################################################################################ 2025-05-07T19:43:40.2886766Z 2025-05-07T19:43:40.2903666Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:40.3759207Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:40.3759649Z [SETUP] Listing existing Conda environments ... 2025-05-07T19:43:40.3760038Z + conda info --envs 2025-05-07T19:43:40.3760197Z 2025-05-07T19:43:40.9386631Z 2025-05-07T19:43:40.9387351Z # conda environments: 2025-05-07T19:43:40.9387782Z # 2025-05-07T19:43:40.9388045Z base /github/home/miniconda 2025-05-07T19:43:40.9388320Z 2025-05-07T19:43:40.9977595Z 2025-05-07T19:43:40.9978693Z [SETUP] Deleting the prefix directory if it exists ... 2025-05-07T19:43:42.5938456Z + rm -rf /github/home/miniconda/envs/build_binary 2025-05-07T19:43:42.5938890Z 2025-05-07T19:43:42.5954798Z 2025-05-07T19:43:42.5963045Z [SETUP] Creating new Conda environment (Python 3.13) ... 2025-05-07T19:43:42.5990444Z [EXEC] [ATTEMPT 0/3] + conda create -y -n build_binary python=3.13 2025-05-07T19:43:43.1804303Z Channels: 2025-05-07T19:43:43.1805004Z - defaults 2025-05-07T19:43:43.1805600Z Platform: linux-64 2025-05-07T19:43:44.4916702Z Collecting package metadata (repodata.json): - \ | / - \ | / done 2025-05-07T19:43:44.5920949Z Solving environment: \ done 2025-05-07T19:43:44.6213291Z 2025-05-07T19:43:44.6213868Z ## Package Plan ## 2025-05-07T19:43:44.6214048Z 2025-05-07T19:43:44.6214288Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:43:44.6214705Z 2025-05-07T19:43:44.6214816Z added / updated specs: 2025-05-07T19:43:44.6215119Z - python=3.13 2025-05-07T19:43:44.6215270Z 2025-05-07T19:43:44.6215274Z 2025-05-07T19:43:44.6215412Z The following packages will be downloaded: 2025-05-07T19:43:44.6215685Z 2025-05-07T19:43:44.6215852Z package | build 2025-05-07T19:43:44.6216249Z ---------------------------|----------------- 2025-05-07T19:43:44.6216667Z _libgcc_mutex-0.1 | main 3 KB 2025-05-07T19:43:44.6217136Z _openmp_mutex-5.1 | 1_gnu 21 KB 2025-05-07T19:43:44.6217597Z ca-certificates-2025.2.25 | h06a4308_0 129 KB 2025-05-07T19:43:44.6218085Z python_abi-3.13 | 0_cp313 6 KB 2025-05-07T19:43:44.6218501Z ------------------------------------------------------------ 2025-05-07T19:43:44.6218911Z Total: 159 KB 2025-05-07T19:43:44.6219143Z 2025-05-07T19:43:44.6219320Z The following NEW packages will be INSTALLED: 2025-05-07T19:43:44.6219566Z 2025-05-07T19:43:44.6219803Z _libgcc_mutex pkgs/main/linux-64::_libgcc_mutex-0.1-main 2025-05-07T19:43:44.6220437Z _openmp_mutex pkgs/main/linux-64::_openmp_mutex-5.1-1_gnu 2025-05-07T19:43:44.6220904Z bzip2 pkgs/main/linux-64::bzip2-1.0.8-h5eee18b_6 2025-05-07T19:43:44.6221462Z ca-certificates pkgs/main/linux-64::ca-certificates-2025.2.25-h06a4308_0 2025-05-07T19:43:44.6222338Z expat pkgs/main/linux-64::expat-2.7.1-h6a678d5_0 2025-05-07T19:43:44.6222834Z ld_impl_linux-64 pkgs/main/linux-64::ld_impl_linux-64-2.40-h12ee557_0 2025-05-07T19:43:44.6223363Z libffi pkgs/main/linux-64::libffi-3.4.4-h6a678d5_1 2025-05-07T19:43:44.6223827Z libgcc-ng pkgs/main/linux-64::libgcc-ng-11.2.0-h1234567_1 2025-05-07T19:43:44.6224329Z libgomp pkgs/main/linux-64::libgomp-11.2.0-h1234567_1 2025-05-07T19:43:44.6224828Z libmpdec pkgs/main/linux-64::libmpdec-4.0.0-h5eee18b_0 2025-05-07T19:43:44.6225330Z libstdcxx-ng pkgs/main/linux-64::libstdcxx-ng-11.2.0-h1234567_1 2025-05-07T19:43:44.6225855Z libuuid pkgs/main/linux-64::libuuid-1.41.5-h5eee18b_0 2025-05-07T19:43:44.6226446Z ncurses pkgs/main/linux-64::ncurses-6.4-h6a678d5_0 2025-05-07T19:43:44.6226933Z openssl pkgs/main/linux-64::openssl-3.0.16-h5eee18b_0 2025-05-07T19:43:44.6227406Z pip pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:44.6227958Z python pkgs/main/linux-64::python-3.13.2-hf623796_100_cp313 2025-05-07T19:43:44.6228452Z python_abi pkgs/main/linux-64::python_abi-3.13-0_cp313 2025-05-07T19:43:44.6228887Z readline pkgs/main/linux-64::readline-8.2-h5eee18b_0 2025-05-07T19:43:44.6229389Z setuptools pkgs/main/linux-64::setuptools-78.1.1-py313h06a4308_0 2025-05-07T19:43:44.6229895Z sqlite pkgs/main/linux-64::sqlite-3.45.3-h5eee18b_0 2025-05-07T19:43:44.6230288Z tk pkgs/main/linux-64::tk-8.6.14-h39e8969_0 2025-05-07T19:43:44.6230696Z tzdata pkgs/main/noarch::tzdata-2025b-h04d1e81_0 2025-05-07T19:43:44.6231123Z wheel pkgs/main/linux-64::wheel-0.45.1-py313h06a4308_0 2025-05-07T19:43:44.6231549Z xz pkgs/main/linux-64::xz-5.6.4-h5eee18b_1 2025-05-07T19:43:44.6231919Z zlib pkgs/main/linux-64::zlib-1.2.13-h5eee18b_1 2025-05-07T19:43:44.6232194Z 2025-05-07T19:43:44.6232198Z 2025-05-07T19:43:44.6232202Z 2025-05-07T19:43:44.6232363Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:44.6232895Z ca-certificates-2025 | 129 KB | | 0% 2025-05-07T19:43:44.6233139Z 2025-05-07T19:43:44.6233656Z _openmp_mutex-5.1 | 21 KB | | 0%  2025-05-07T19:43:44.6233950Z 2025-05-07T19:43:44.6233955Z 2025-05-07T19:43:44.6235193Z python_abi-3.13 | 6 KB | | 0%  2025-05-07T19:43:44.6235505Z 2025-05-07T19:43:44.6235510Z 2025-05-07T19:43:44.6235513Z 2025-05-07T19:43:44.6537980Z _libgcc_mutex-0.1 | 3 KB | | 0%  2025-05-07T19:43:44.6606947Z ca-certificates-2025 | 129 KB | ########## | 100% 2025-05-07T19:43:44.6617444Z ca-certificates-2025 | 129 KB | ########## | 100% 2025-05-07T19:43:44.6618306Z 2025-05-07T19:43:44.6685796Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:44.6686659Z 2025-05-07T19:43:44.6686706Z 2025-05-07T19:43:44.6721134Z python_abi-3.13 | 6 KB | ########## | 100%  2025-05-07T19:43:44.6721643Z 2025-05-07T19:43:44.6721648Z 2025-05-07T19:43:44.6721652Z 2025-05-07T19:43:44.6780029Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:44.6780385Z 2025-05-07T19:43:44.6780389Z 2025-05-07T19:43:44.6808299Z python_abi-3.13 | 6 KB | ########## | 100%  2025-05-07T19:43:44.6809146Z 2025-05-07T19:43:44.6809161Z 2025-05-07T19:43:44.6809206Z 2025-05-07T19:43:44.6827214Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:44.6827536Z 2025-05-07T19:43:44.6831055Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:44.6831620Z 2025-05-07T19:43:44.6831864Z 2025-05-07T19:43:44.6832077Z  2025-05-07T19:43:44.6832304Z 2025-05-07T19:43:44.6832308Z 2025-05-07T19:43:44.6832955Z  2025-05-07T19:43:44.6833231Z 2025-05-07T19:43:44.6833235Z 2025-05-07T19:43:44.6833239Z 2025-05-07T19:43:44.6833447Z  done 2025-05-07T19:43:44.8945530Z Preparing transaction: / - done 2025-05-07T19:43:46.4436172Z Verifying transaction: | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:48.6581380Z Executing transaction: \ | / - \ | / - \ | / - \ | / - \ | / - \ | done 2025-05-07T19:43:48.6620007Z # 2025-05-07T19:43:48.6620705Z # To activate this environment, use 2025-05-07T19:43:48.6621606Z # 2025-05-07T19:43:48.6622222Z # $ conda activate build_binary 2025-05-07T19:43:48.6622998Z # 2025-05-07T19:43:48.6623653Z # To deactivate an active environment, use 2025-05-07T19:43:48.6624973Z # 2025-05-07T19:43:48.6625546Z # $ conda deactivate 2025-05-07T19:43:48.6626017Z 2025-05-07T19:43:48.7461263Z [SETUP] Upgrading PIP to latest ... 2025-05-07T19:43:48.7492895Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --upgrade pip 2025-05-07T19:43:51.6760522Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:43:51.6762436Z 2025-05-07T19:43:51.6762896Z Requirement already satisfied: pip in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (25.1) 2025-05-07T19:43:51.6763587Z Collecting pip 2025-05-07T19:43:51.6763938Z Downloading pip-25.1.1-py3-none-any.whl.metadata (3.6 kB) 2025-05-07T19:43:51.6764480Z Downloading pip-25.1.1-py3-none-any.whl (1.8 MB) 2025-05-07T19:43:51.6765376Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.8/1.8 MB 93.3 MB/s eta 0:00:00 2025-05-07T19:43:51.6765848Z Installing collected packages: pip 2025-05-07T19:43:51.6766219Z Attempting uninstall: pip 2025-05-07T19:43:51.6766541Z Found existing installation: pip 25.1 2025-05-07T19:43:51.6766911Z Uninstalling pip-25.1: 2025-05-07T19:43:51.6767218Z Successfully uninstalled pip-25.1 2025-05-07T19:43:51.6767602Z Successfully installed pip-25.1.1 2025-05-07T19:43:51.6767815Z 2025-05-07T19:43:51.7352589Z [SETUP] Upgrading pyOpenSSL ... 2025-05-07T19:43:51.7375578Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y pyOpenSSL>22.1.0 2025-05-07T19:43:52.3982300Z Channels: 2025-05-07T19:43:52.3982961Z - conda-forge 2025-05-07T19:43:52.3983661Z Platform: linux-64 2025-05-07T19:44:02.0299292Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:44:03.9401474Z Solving environment: | / - \ | done 2025-05-07T19:44:03.9855829Z 2025-05-07T19:44:03.9856230Z ## Package Plan ## 2025-05-07T19:44:03.9856626Z 2025-05-07T19:44:03.9857086Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:03.9857619Z 2025-05-07T19:44:03.9857765Z added / updated specs: 2025-05-07T19:44:03.9858100Z - pyopenssl[version='>22.1.0'] 2025-05-07T19:44:03.9858312Z 2025-05-07T19:44:03.9858316Z 2025-05-07T19:44:03.9858469Z The following packages will be downloaded: 2025-05-07T19:44:03.9858809Z 2025-05-07T19:44:03.9858987Z package | build 2025-05-07T19:44:03.9859474Z ---------------------------|----------------- 2025-05-07T19:44:03.9860042Z cffi-1.17.1 | py313hfab6e84_0 289 KB conda-forge 2025-05-07T19:44:03.9860729Z cryptography-44.0.3 | py313h6556f6e_0 1.5 MB conda-forge 2025-05-07T19:44:03.9861539Z libgcc-15.1.0 | h767d61c_2 810 KB conda-forge 2025-05-07T19:44:03.9862204Z libgcc-ng-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:44:03.9863226Z libgomp-15.1.0 | h767d61c_2 442 KB conda-forge 2025-05-07T19:44:03.9864095Z openssl-3.5.0 | h7b32b05_1 3.0 MB conda-forge 2025-05-07T19:44:03.9864863Z pycparser-2.22 | pyh29332c3_1 108 KB conda-forge 2025-05-07T19:44:03.9865500Z pyopenssl-25.0.0 | pyhd8ed1ab_0 120 KB conda-forge 2025-05-07T19:44:03.9866002Z typing-extensions-4.13.2 | h0e9735f_0 88 KB conda-forge 2025-05-07T19:44:03.9866498Z typing_extensions-4.13.2 | pyh29332c3_0 51 KB conda-forge 2025-05-07T19:44:03.9867065Z ------------------------------------------------------------ 2025-05-07T19:44:03.9867538Z Total: 6.4 MB 2025-05-07T19:44:03.9868117Z 2025-05-07T19:44:03.9868254Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:03.9868483Z 2025-05-07T19:44:03.9868739Z cffi conda-forge/linux-64::cffi-1.17.1-py313hfab6e84_0 2025-05-07T19:44:03.9869250Z cryptography conda-forge/linux-64::cryptography-44.0.3-py313h6556f6e_0 2025-05-07T19:44:03.9869828Z libgcc conda-forge/linux-64::libgcc-15.1.0-h767d61c_2 2025-05-07T19:44:03.9870374Z pycparser conda-forge/noarch::pycparser-2.22-pyh29332c3_1 2025-05-07T19:44:03.9870886Z pyopenssl conda-forge/noarch::pyopenssl-25.0.0-pyhd8ed1ab_0 2025-05-07T19:44:03.9871507Z typing-extensions conda-forge/noarch::typing-extensions-4.13.2-h0e9735f_0 2025-05-07T19:44:03.9872287Z typing_extensions conda-forge/noarch::typing_extensions-4.13.2-pyh29332c3_0 2025-05-07T19:44:03.9872681Z 2025-05-07T19:44:03.9872939Z The following packages will be UPDATED: 2025-05-07T19:44:03.9873352Z 2025-05-07T19:44:03.9873909Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:44:03.9874870Z libgcc-ng pkgs/main::libgcc-ng-11.2.0-h1234567_1 --> conda-forge::libgcc-ng-15.1.0-h69a702a_2 2025-05-07T19:44:03.9875681Z libgomp pkgs/main::libgomp-11.2.0-h1234567_1 --> conda-forge::libgomp-15.1.0-h767d61c_2 2025-05-07T19:44:03.9876386Z openssl pkgs/main::openssl-3.0.16-h5eee18b_0 --> conda-forge::openssl-3.5.0-h7b32b05_1 2025-05-07T19:44:03.9876827Z 2025-05-07T19:44:03.9876831Z 2025-05-07T19:44:03.9876835Z 2025-05-07T19:44:03.9876997Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:03.9877447Z openssl-3.5.0 | 3.0 MB | | 0% 2025-05-07T19:44:03.9877703Z 2025-05-07T19:44:03.9878075Z cryptography-44.0.3 | 1.5 MB | | 0%  2025-05-07T19:44:03.9878385Z 2025-05-07T19:44:03.9878389Z 2025-05-07T19:44:03.9891590Z libgcc-15.1.0 | 810 KB | | 0%  2025-05-07T19:44:03.9892210Z 2025-05-07T19:44:03.9892216Z 2025-05-07T19:44:03.9892221Z 2025-05-07T19:44:03.9911953Z libgomp-15.1.0 | 442 KB | | 0%  2025-05-07T19:44:03.9912848Z 2025-05-07T19:44:03.9912856Z 2025-05-07T19:44:03.9912864Z 2025-05-07T19:44:03.9912869Z 2025-05-07T19:44:03.9921897Z cffi-1.17.1 | 289 KB | | 0%  2025-05-07T19:44:03.9922618Z 2025-05-07T19:44:03.9922623Z 2025-05-07T19:44:03.9922627Z 2025-05-07T19:44:03.9922630Z 2025-05-07T19:44:03.9922634Z 2025-05-07T19:44:03.9922917Z pyopenssl-25.0.0 | 120 KB | | 0%  2025-05-07T19:44:03.9923253Z 2025-05-07T19:44:03.9923257Z 2025-05-07T19:44:03.9923260Z 2025-05-07T19:44:03.9923264Z 2025-05-07T19:44:03.9923267Z 2025-05-07T19:44:03.9923271Z 2025-05-07T19:44:03.9923536Z pycparser-2.22 | 108 KB | | 0%  2025-05-07T19:44:03.9923863Z 2025-05-07T19:44:03.9923866Z 2025-05-07T19:44:03.9923891Z 2025-05-07T19:44:03.9923894Z 2025-05-07T19:44:03.9923897Z 2025-05-07T19:44:03.9923902Z 2025-05-07T19:44:03.9923905Z 2025-05-07T19:44:03.9924466Z typing-extensions-4. | 88 KB | | 0%  2025-05-07T19:44:03.9924809Z 2025-05-07T19:44:03.9924842Z 2025-05-07T19:44:03.9924846Z 2025-05-07T19:44:03.9924849Z 2025-05-07T19:44:03.9924852Z 2025-05-07T19:44:03.9924856Z 2025-05-07T19:44:03.9924859Z 2025-05-07T19:44:03.9924862Z 2025-05-07T19:44:03.9925159Z typing_extensions-4. | 51 KB | | 0%  2025-05-07T19:44:03.9925475Z 2025-05-07T19:44:03.9925509Z 2025-05-07T19:44:03.9925512Z 2025-05-07T19:44:03.9925516Z 2025-05-07T19:44:03.9925519Z 2025-05-07T19:44:03.9925522Z 2025-05-07T19:44:03.9925525Z 2025-05-07T19:44:03.9925529Z 2025-05-07T19:44:03.9925532Z 2025-05-07T19:44:04.0659776Z libgcc-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:04.0660309Z 2025-05-07T19:44:04.0660313Z 2025-05-07T19:44:04.0660618Z 2025-05-07T19:44:04.0660621Z 2025-05-07T19:44:04.0744236Z cffi-1.17.1 | 289 KB | ########## | 100%  2025-05-07T19:44:04.0745097Z 2025-05-07T19:44:04.0745111Z 2025-05-07T19:44:04.0745157Z 2025-05-07T19:44:04.0858938Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:04.0891177Z openssl-3.5.0 | 3.0 MB | ##2 | 23% 2025-05-07T19:44:04.0892524Z 2025-05-07T19:44:04.0892893Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:04.0893210Z 2025-05-07T19:44:04.0996383Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:04.0996700Z 2025-05-07T19:44:04.0996706Z 2025-05-07T19:44:04.0996964Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:04.0997229Z 2025-05-07T19:44:04.0997875Z 2025-05-07T19:44:04.1127612Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:04.1127947Z 2025-05-07T19:44:04.1127951Z 2025-05-07T19:44:04.1127977Z 2025-05-07T19:44:04.1127980Z 2025-05-07T19:44:04.1127984Z 2025-05-07T19:44:04.1172700Z pyopenssl-25.0.0 | 120 KB | #3 | 13%  2025-05-07T19:44:04.1173065Z 2025-05-07T19:44:04.1173227Z 2025-05-07T19:44:04.1173238Z 2025-05-07T19:44:04.1173244Z 2025-05-07T19:44:04.1173258Z 2025-05-07T19:44:04.1274961Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:04.1275301Z 2025-05-07T19:44:04.1275306Z 2025-05-07T19:44:04.1275310Z 2025-05-07T19:44:04.1275337Z 2025-05-07T19:44:04.1275341Z 2025-05-07T19:44:04.1275344Z 2025-05-07T19:44:04.1316144Z pycparser-2.22 | 108 KB | #4 | 15%  2025-05-07T19:44:04.1316484Z 2025-05-07T19:44:04.1316489Z 2025-05-07T19:44:04.1316492Z 2025-05-07T19:44:04.1316496Z 2025-05-07T19:44:04.1316500Z 2025-05-07T19:44:04.1316528Z 2025-05-07T19:44:04.1370816Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:04.1371651Z 2025-05-07T19:44:04.1371739Z 2025-05-07T19:44:04.1371746Z 2025-05-07T19:44:04.1371893Z 2025-05-07T19:44:04.1371900Z 2025-05-07T19:44:04.1371932Z 2025-05-07T19:44:04.1371938Z 2025-05-07T19:44:04.1371941Z 2025-05-07T19:44:04.1398685Z typing_extensions-4. | 51 KB | ###1 | 31%  2025-05-07T19:44:04.1399050Z 2025-05-07T19:44:04.1399054Z 2025-05-07T19:44:04.1399084Z 2025-05-07T19:44:04.1399088Z 2025-05-07T19:44:04.1399091Z 2025-05-07T19:44:04.1399095Z 2025-05-07T19:44:04.1399098Z 2025-05-07T19:44:04.1399102Z 2025-05-07T19:44:04.1435342Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:04.1435704Z 2025-05-07T19:44:04.1435709Z 2025-05-07T19:44:04.1435736Z 2025-05-07T19:44:04.1435739Z 2025-05-07T19:44:04.1435743Z 2025-05-07T19:44:04.1435746Z 2025-05-07T19:44:04.1435750Z 2025-05-07T19:44:04.1465745Z typing-extensions-4. | 88 KB | #8 | 18%  2025-05-07T19:44:04.1466113Z 2025-05-07T19:44:04.1466117Z 2025-05-07T19:44:04.1466135Z 2025-05-07T19:44:04.1466165Z 2025-05-07T19:44:04.1466169Z 2025-05-07T19:44:04.1466172Z 2025-05-07T19:44:04.1466176Z 2025-05-07T19:44:04.1555376Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:04.1555754Z 2025-05-07T19:44:04.1555758Z 2025-05-07T19:44:04.1555762Z 2025-05-07T19:44:04.1555765Z 2025-05-07T19:44:04.1558742Z cffi-1.17.1 | 289 KB | ########## | 100%  2025-05-07T19:44:04.1559009Z 2025-05-07T19:44:04.1559021Z 2025-05-07T19:44:04.1559025Z 2025-05-07T19:44:04.1559028Z 2025-05-07T19:44:04.1590175Z cffi-1.17.1 | 289 KB | ########## | 100%  2025-05-07T19:44:04.1590465Z 2025-05-07T19:44:04.1590470Z 2025-05-07T19:44:04.1590474Z 2025-05-07T19:44:04.1593832Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:04.1594184Z 2025-05-07T19:44:04.1594188Z 2025-05-07T19:44:04.1594196Z 2025-05-07T19:44:04.1683084Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:04.1683608Z 2025-05-07T19:44:04.1683637Z 2025-05-07T19:44:04.1683641Z 2025-05-07T19:44:04.1683645Z 2025-05-07T19:44:04.1683648Z 2025-05-07T19:44:04.1683652Z 2025-05-07T19:44:04.1683662Z 2025-05-07T19:44:04.1683666Z 2025-05-07T19:44:04.1683670Z 2025-05-07T19:44:04.1684200Z libgcc-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:04.1702397Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:04.1702992Z 2025-05-07T19:44:04.1703050Z 2025-05-07T19:44:04.1703056Z 2025-05-07T19:44:04.1703060Z 2025-05-07T19:44:04.1703063Z 2025-05-07T19:44:04.1703066Z 2025-05-07T19:44:04.1703070Z 2025-05-07T19:44:04.1703073Z 2025-05-07T19:44:04.1703110Z 2025-05-07T19:44:04.1995823Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:04.1996184Z 2025-05-07T19:44:04.1996191Z 2025-05-07T19:44:04.1996227Z 2025-05-07T19:44:04.1996232Z 2025-05-07T19:44:04.1996236Z 2025-05-07T19:44:04.2342205Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:04.2342537Z 2025-05-07T19:44:04.2342543Z 2025-05-07T19:44:04.2514512Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:04.2515498Z 2025-05-07T19:44:04.2515512Z 2025-05-07T19:44:04.2515523Z 2025-05-07T19:44:04.2515533Z 2025-05-07T19:44:04.2515544Z 2025-05-07T19:44:04.2515554Z 2025-05-07T19:44:04.2515564Z 2025-05-07T19:44:04.2515574Z 2025-05-07T19:44:04.2518135Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:04.2518511Z 2025-05-07T19:44:04.2518515Z 2025-05-07T19:44:04.2518518Z 2025-05-07T19:44:04.2518521Z 2025-05-07T19:44:04.2518527Z 2025-05-07T19:44:04.2518530Z 2025-05-07T19:44:04.2518533Z 2025-05-07T19:44:04.2518541Z 2025-05-07T19:44:04.2644299Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:04.2644650Z 2025-05-07T19:44:04.2644655Z 2025-05-07T19:44:04.2644659Z 2025-05-07T19:44:04.2644682Z 2025-05-07T19:44:04.2644685Z 2025-05-07T19:44:04.2644688Z 2025-05-07T19:44:04.2644692Z 2025-05-07T19:44:04.2646082Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:04.2646457Z 2025-05-07T19:44:04.2646460Z 2025-05-07T19:44:04.2646464Z 2025-05-07T19:44:04.2646469Z 2025-05-07T19:44:04.2646472Z 2025-05-07T19:44:04.2646476Z 2025-05-07T19:44:04.2646483Z 2025-05-07T19:44:04.2959237Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:04.2959596Z 2025-05-07T19:44:04.2959601Z 2025-05-07T19:44:04.2959604Z 2025-05-07T19:44:04.2959608Z 2025-05-07T19:44:04.2959611Z 2025-05-07T19:44:04.2959615Z 2025-05-07T19:44:04.2960387Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:04.2960700Z 2025-05-07T19:44:04.2960720Z 2025-05-07T19:44:04.2960726Z 2025-05-07T19:44:04.2960733Z 2025-05-07T19:44:04.2960738Z 2025-05-07T19:44:04.2960744Z 2025-05-07T19:44:04.3254133Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:04.3254487Z 2025-05-07T19:44:04.3254491Z 2025-05-07T19:44:04.3254495Z 2025-05-07T19:44:04.3254498Z 2025-05-07T19:44:04.3254753Z 2025-05-07T19:44:04.3254758Z 2025-05-07T19:44:04.3254762Z 2025-05-07T19:44:04.3254765Z 2025-05-07T19:44:04.3254769Z 2025-05-07T19:44:04.3256328Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:04.3256689Z 2025-05-07T19:44:04.3256694Z 2025-05-07T19:44:04.3256711Z 2025-05-07T19:44:04.3256715Z 2025-05-07T19:44:04.3256718Z 2025-05-07T19:44:04.3256721Z 2025-05-07T19:44:04.3256725Z 2025-05-07T19:44:04.3256728Z 2025-05-07T19:44:04.3256732Z 2025-05-07T19:44:04.3629728Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:04.3630081Z 2025-05-07T19:44:04.3897433Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:04.3898802Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:04.3903242Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:04.3907439Z 2025-05-07T19:44:04.3908109Z 2025-05-07T19:44:04.3908850Z  2025-05-07T19:44:04.3909470Z 2025-05-07T19:44:04.3909483Z 2025-05-07T19:44:04.3909964Z  2025-05-07T19:44:04.3910621Z 2025-05-07T19:44:04.3910632Z 2025-05-07T19:44:04.3910643Z 2025-05-07T19:44:04.3911138Z  2025-05-07T19:44:04.3911769Z 2025-05-07T19:44:04.3911780Z 2025-05-07T19:44:04.3911791Z 2025-05-07T19:44:04.3911802Z 2025-05-07T19:44:04.3912324Z  2025-05-07T19:44:04.3912550Z 2025-05-07T19:44:04.3912553Z 2025-05-07T19:44:04.3912558Z 2025-05-07T19:44:04.3912561Z 2025-05-07T19:44:04.3912566Z 2025-05-07T19:44:04.3912846Z  2025-05-07T19:44:04.3913089Z 2025-05-07T19:44:04.3913093Z 2025-05-07T19:44:04.3913096Z 2025-05-07T19:44:04.3913099Z 2025-05-07T19:44:04.3913103Z 2025-05-07T19:44:04.3913113Z 2025-05-07T19:44:04.3913304Z  2025-05-07T19:44:04.3913559Z 2025-05-07T19:44:04.3913563Z 2025-05-07T19:44:04.3913566Z 2025-05-07T19:44:04.3913570Z 2025-05-07T19:44:04.3913573Z 2025-05-07T19:44:04.3913576Z 2025-05-07T19:44:04.3913580Z 2025-05-07T19:44:04.3913841Z  2025-05-07T19:44:04.3914101Z 2025-05-07T19:44:04.3914104Z 2025-05-07T19:44:04.3914108Z 2025-05-07T19:44:04.3914111Z 2025-05-07T19:44:04.3914114Z 2025-05-07T19:44:04.3914118Z 2025-05-07T19:44:04.3914121Z 2025-05-07T19:44:04.3914125Z 2025-05-07T19:44:04.3914313Z  2025-05-07T19:44:04.3914552Z 2025-05-07T19:44:04.3914576Z 2025-05-07T19:44:04.3914580Z 2025-05-07T19:44:04.3914583Z 2025-05-07T19:44:04.3914586Z 2025-05-07T19:44:04.3914590Z 2025-05-07T19:44:04.3914593Z 2025-05-07T19:44:04.3914604Z 2025-05-07T19:44:04.3914607Z 2025-05-07T19:44:04.3914816Z  done 2025-05-07T19:44:04.4915837Z Preparing transaction: - done 2025-05-07T19:44:04.5923584Z Verifying transaction: | done 2025-05-07T19:44:05.9956599Z Executing transaction: - \ | / - \ | / - \ | / - \ done 2025-05-07T19:44:06.0913090Z [SETUP] Testing pyOpenSSL import ... 2025-05-07T19:44:07.7587235Z [CHECK] Python (sub-)package 'OpenSSL' found ... 2025-05-07T19:44:07.7594039Z [SETUP] Installing libxcrypt ... 2025-05-07T19:44:07.7630347Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y libxcrypt 2025-05-07T19:44:08.4294206Z Channels: 2025-05-07T19:44:08.4294867Z - conda-forge 2025-05-07T19:44:08.4295623Z Platform: linux-64 2025-05-07T19:44:11.6012987Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:12.0233831Z Solving environment: \ done 2025-05-07T19:44:12.0683570Z 2025-05-07T19:44:12.0684702Z ## Package Plan ## 2025-05-07T19:44:12.0685221Z 2025-05-07T19:44:12.0685829Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:12.0686755Z 2025-05-07T19:44:12.0687039Z added / updated specs: 2025-05-07T19:44:12.0687786Z - libxcrypt 2025-05-07T19:44:12.0688177Z 2025-05-07T19:44:12.0688190Z 2025-05-07T19:44:12.0688576Z The following packages will be downloaded: 2025-05-07T19:44:12.0689248Z 2025-05-07T19:44:12.0689586Z package | build 2025-05-07T19:44:12.0690588Z ---------------------------|----------------- 2025-05-07T19:44:12.0691395Z libxcrypt-4.4.36 | hd590300_1 98 KB conda-forge 2025-05-07T19:44:12.0691836Z ------------------------------------------------------------ 2025-05-07T19:44:12.0692504Z Total: 98 KB 2025-05-07T19:44:12.0692723Z 2025-05-07T19:44:12.0692862Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:12.0693087Z 2025-05-07T19:44:12.0693357Z libxcrypt conda-forge/linux-64::libxcrypt-4.4.36-hd590300_1 2025-05-07T19:44:12.0693651Z 2025-05-07T19:44:12.0693655Z 2025-05-07T19:44:12.0693658Z 2025-05-07T19:44:12.0693802Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:12.2340032Z libxcrypt-4.4.36 | 98 KB | | 0% 2025-05-07T19:44:12.2362099Z libxcrypt-4.4.36 | 98 KB | #6 | 16% 2025-05-07T19:44:12.2474008Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:12.2475259Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:12.2476266Z 2025-05-07T19:44:12.2477121Z done 2025-05-07T19:44:12.3482695Z Preparing transaction: / done 2025-05-07T19:44:12.4493558Z Verifying transaction: \ done 2025-05-07T19:44:12.5502712Z Executing transaction: / done 2025-05-07T19:44:15.8141820Z [SETUP] Copying over ... 2025-05-07T19:44:15.8143966Z + cp /github/home/miniconda/envs/build_binary/include/crypt.h /github/home/miniconda/envs/build_binary/include/python3.13/crypt.h 2025-05-07T19:44:15.8144569Z 2025-05-07T19:44:15.8176827Z 2025-05-07T19:44:17.4165660Z [SETUP] Installed Python version: Python 3.13.2 2025-05-07T19:44:17.4167027Z [SETUP] Successfully created Conda environment: build_binary 2025-05-07T19:44:17.4248199Z ##[group]Run . $PRELUDE; install_cxx_compiler $BUILD_ENV clang 2025-05-07T19:44:17.4248708Z . $PRELUDE; install_cxx_compiler $BUILD_ENV clang 2025-05-07T19:44:17.4249241Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:17.4249583Z env: 2025-05-07T19:44:17.4249821Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:17.4250129Z BUILD_ENV: build_binary 2025-05-07T19:44:17.4250415Z BUILD_TARGET: default 2025-05-07T19:44:17.4250647Z BUILD_VARIANT: cuda 2025-05-07T19:44:17.4250898Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:44:17.4251147Z ##[endgroup] 2025-05-07T19:44:17.8605219Z ################################################################################ 2025-05-07T19:44:17.8606315Z # Install C/C++ Compilers 2025-05-07T19:44:17.8607033Z # 2025-05-07T19:44:17.8622989Z # [2025-05-07T19:44:17.861Z] + install_cxx_compiler build_binary clang 2025-05-07T19:44:17.8623835Z ################################################################################ 2025-05-07T19:44:17.8624212Z 2025-05-07T19:44:17.8643648Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:17.9546720Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:17.9553174Z [INSTALL] Installing GLIBC (architecture = 64) ... 2025-05-07T19:44:17.9579462Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y sysroot_linux-64=2.17 2025-05-07T19:44:18.6238283Z Channels: 2025-05-07T19:44:18.6238635Z - conda-forge 2025-05-07T19:44:18.6238942Z Platform: linux-64 2025-05-07T19:44:21.6867468Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:22.1079093Z Solving environment: \ done 2025-05-07T19:44:22.1532961Z 2025-05-07T19:44:22.1533650Z ## Package Plan ## 2025-05-07T19:44:22.1533892Z 2025-05-07T19:44:22.1534165Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:22.1534510Z 2025-05-07T19:44:22.1534625Z added / updated specs: 2025-05-07T19:44:22.1535004Z - sysroot_linux-64=2.17 2025-05-07T19:44:22.1535198Z 2025-05-07T19:44:22.1535202Z 2025-05-07T19:44:22.1535345Z The following packages will be downloaded: 2025-05-07T19:44:22.1535618Z 2025-05-07T19:44:22.1535747Z package | build 2025-05-07T19:44:22.1536134Z ---------------------------|----------------- 2025-05-07T19:44:22.1536603Z kernel-headers_linux-64-3.10.0| he073ed8_18 921 KB conda-forge 2025-05-07T19:44:22.1537492Z sysroot_linux-64-2.17 | h0157908_18 14.5 MB conda-forge 2025-05-07T19:44:22.1537953Z ------------------------------------------------------------ 2025-05-07T19:44:22.1538364Z Total: 15.4 MB 2025-05-07T19:44:22.1538595Z 2025-05-07T19:44:22.1538738Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:22.1539012Z 2025-05-07T19:44:22.1539340Z kernel-headers_li~ conda-forge/noarch::kernel-headers_linux-64-3.10.0-he073ed8_18 2025-05-07T19:44:22.1540002Z sysroot_linux-64 conda-forge/noarch::sysroot_linux-64-2.17-h0157908_18 2025-05-07T19:44:22.1540347Z 2025-05-07T19:44:22.1540351Z 2025-05-07T19:44:22.1540354Z 2025-05-07T19:44:22.1540512Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:22.1540946Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:22.1541199Z 2025-05-07T19:44:22.3686857Z kernel-headers_linux | 921 KB | | 0%  2025-05-07T19:44:22.3851729Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:22.3852070Z 2025-05-07T19:44:22.4092804Z kernel-headers_linux | 921 KB | 1 | 2%  2025-05-07T19:44:22.4093115Z 2025-05-07T19:44:22.4684809Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:22.5755023Z sysroot_linux-64-2.1 | 14.5 MB | ######1 | 62% 2025-05-07T19:44:22.5755322Z 2025-05-07T19:44:22.5755696Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:22.5755974Z 2025-05-07T19:44:22.6237024Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:22.6716022Z sysroot_linux-64-2.1 | 14.5 MB | #########7 | 98% 2025-05-07T19:44:23.1116543Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:23.1118463Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:23.1118892Z 2025-05-07T19:44:23.1119219Z 2025-05-07T19:44:23.1119668Z  done 2025-05-07T19:44:23.2130105Z Preparing transaction: / done 2025-05-07T19:44:23.4141808Z Verifying transaction: \ | done 2025-05-07T19:44:23.5152808Z Executing transaction: - done 2025-05-07T19:44:23.5981350Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:44:23.5981833Z [CHECK] CONDA_PREFIX is not set. 2025-05-07T19:44:25.2272065Z [CHECK] libstdc++.so.6 found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libstdc++.so.6 2025-05-07T19:44:25.2288544Z [INSTALL] Installing GCC (11.4.0, 64) through Conda ... 2025-05-07T19:44:25.2314938Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y gxx_linux-64=11.4.0 2025-05-07T19:44:25.9364187Z Channels: 2025-05-07T19:44:25.9364652Z - conda-forge 2025-05-07T19:44:25.9364956Z Platform: linux-64 2025-05-07T19:44:29.0891067Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:30.2270561Z Solving environment: \ | / done 2025-05-07T19:44:30.2752621Z 2025-05-07T19:44:30.2753465Z ## Package Plan ## 2025-05-07T19:44:30.2753938Z 2025-05-07T19:44:30.2754665Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:30.2755604Z 2025-05-07T19:44:30.2755894Z added / updated specs: 2025-05-07T19:44:30.2756681Z - gxx_linux-64=11.4.0 2025-05-07T19:44:30.2757155Z 2025-05-07T19:44:30.2757167Z 2025-05-07T19:44:30.2757559Z The following packages will be downloaded: 2025-05-07T19:44:30.2758225Z 2025-05-07T19:44:30.2758593Z package | build 2025-05-07T19:44:30.2759581Z ---------------------------|----------------- 2025-05-07T19:44:30.2760802Z binutils_impl_linux-64-2.40| ha1999f0_7 6.0 MB conda-forge 2025-05-07T19:44:30.2762102Z binutils_linux-64-2.40 | hb3c18ed_4 28 KB conda-forge 2025-05-07T19:44:30.2762609Z gcc_impl_linux-64-11.4.0 | h00c12a0_13 53.0 MB conda-forge 2025-05-07T19:44:30.2763419Z gcc_linux-64-11.4.0 | ha077dfb_4 31 KB conda-forge 2025-05-07T19:44:30.2763927Z gxx_impl_linux-64-11.4.0 | h634f3ee_13 11.2 MB conda-forge 2025-05-07T19:44:30.2764406Z gxx_linux-64-11.4.0 | h35bfe5d_4 29 KB conda-forge 2025-05-07T19:44:30.2764904Z ld_impl_linux-64-2.40 | hf3520f5_7 691 KB conda-forge 2025-05-07T19:44:30.2765414Z libgcc-devel_linux-64-11.4.0| h8f596e0_113 2.3 MB conda-forge 2025-05-07T19:44:30.2765959Z libsanitizer-11.4.0 | h5763a12_13 3.5 MB conda-forge 2025-05-07T19:44:30.2766443Z libstdcxx-15.1.0 | h8f9b012_2 3.7 MB conda-forge 2025-05-07T19:44:30.2766985Z libstdcxx-devel_linux-64-11.4.0| h8f596e0_113 11.1 MB conda-forge 2025-05-07T19:44:30.2767533Z libstdcxx-ng-15.1.0 | h4852527_2 34 KB conda-forge 2025-05-07T19:44:30.2767976Z ------------------------------------------------------------ 2025-05-07T19:44:30.2768376Z Total: 91.6 MB 2025-05-07T19:44:30.2768728Z 2025-05-07T19:44:30.2768867Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:30.2769133Z 2025-05-07T19:44:30.2769629Z binutils_impl_lin~ conda-forge/linux-64::binutils_impl_linux-64-2.40-ha1999f0_7 2025-05-07T19:44:30.2770284Z binutils_linux-64 conda-forge/linux-64::binutils_linux-64-2.40-hb3c18ed_4 2025-05-07T19:44:30.2771051Z gcc_impl_linux-64 conda-forge/linux-64::gcc_impl_linux-64-11.4.0-h00c12a0_13 2025-05-07T19:44:30.2771656Z gcc_linux-64 conda-forge/linux-64::gcc_linux-64-11.4.0-ha077dfb_4 2025-05-07T19:44:30.2772217Z gxx_impl_linux-64 conda-forge/linux-64::gxx_impl_linux-64-11.4.0-h634f3ee_13 2025-05-07T19:44:30.2772809Z gxx_linux-64 conda-forge/linux-64::gxx_linux-64-11.4.0-h35bfe5d_4 2025-05-07T19:44:30.2773428Z libgcc-devel_linu~ conda-forge/noarch::libgcc-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:30.2774068Z libsanitizer conda-forge/linux-64::libsanitizer-11.4.0-h5763a12_13 2025-05-07T19:44:30.2774635Z libstdcxx conda-forge/linux-64::libstdcxx-15.1.0-h8f9b012_2 2025-05-07T19:44:30.2775262Z libstdcxx-devel_l~ conda-forge/noarch::libstdcxx-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:30.2775670Z 2025-05-07T19:44:30.2775799Z The following packages will be UPDATED: 2025-05-07T19:44:30.2776032Z 2025-05-07T19:44:30.2776412Z ld_impl_linux-64 pkgs/main::ld_impl_linux-64-2.40-h12e~ --> conda-forge::ld_impl_linux-64-2.40-hf3520f5_7 2025-05-07T19:44:30.2777206Z libstdcxx-ng pkgs/main::libstdcxx-ng-11.2.0-h12345~ --> conda-forge::libstdcxx-ng-15.1.0-h4852527_2 2025-05-07T19:44:30.2777692Z 2025-05-07T19:44:30.2777696Z 2025-05-07T19:44:30.2777699Z 2025-05-07T19:44:30.2777859Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:30.2778299Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:30.2778550Z 2025-05-07T19:44:30.2779007Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:30.2779301Z 2025-05-07T19:44:30.2779305Z 2025-05-07T19:44:30.2779663Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:30.2779930Z 2025-05-07T19:44:30.2779933Z 2025-05-07T19:44:30.2779936Z 2025-05-07T19:44:30.2780201Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:30.2780473Z 2025-05-07T19:44:30.2780477Z 2025-05-07T19:44:30.2780480Z 2025-05-07T19:44:30.2780484Z 2025-05-07T19:44:30.2805662Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:30.2806040Z 2025-05-07T19:44:30.2806045Z 2025-05-07T19:44:30.2806049Z 2025-05-07T19:44:30.2806053Z 2025-05-07T19:44:30.2806056Z 2025-05-07T19:44:30.2806333Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:30.2806671Z 2025-05-07T19:44:30.2806675Z 2025-05-07T19:44:30.2806678Z 2025-05-07T19:44:30.2806682Z 2025-05-07T19:44:30.2806685Z 2025-05-07T19:44:30.2806938Z 2025-05-07T19:44:30.2807246Z libgcc-devel_linux-6 | 2.3 MB | | 0%  2025-05-07T19:44:30.2807561Z 2025-05-07T19:44:30.2807594Z 2025-05-07T19:44:30.2807598Z 2025-05-07T19:44:30.2807602Z 2025-05-07T19:44:30.2807605Z 2025-05-07T19:44:30.2807609Z 2025-05-07T19:44:30.2807612Z 2025-05-07T19:44:30.2827561Z ld_impl_linux-64-2.4 | 691 KB | | 0%  2025-05-07T19:44:30.2828589Z 2025-05-07T19:44:30.2828603Z 2025-05-07T19:44:30.2828615Z 2025-05-07T19:44:30.2828625Z 2025-05-07T19:44:30.2828667Z 2025-05-07T19:44:30.2828679Z 2025-05-07T19:44:30.2828689Z 2025-05-07T19:44:30.2828699Z 2025-05-07T19:44:30.2829496Z libstdcxx-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:30.2830385Z 2025-05-07T19:44:30.2830430Z 2025-05-07T19:44:30.2830440Z 2025-05-07T19:44:30.2830450Z 2025-05-07T19:44:30.2830460Z 2025-05-07T19:44:30.2830470Z 2025-05-07T19:44:30.2830480Z 2025-05-07T19:44:30.2830508Z 2025-05-07T19:44:30.2830518Z 2025-05-07T19:44:30.2831257Z gcc_linux-64-11.4.0 | 31 KB | | 0%  2025-05-07T19:44:30.2831987Z 2025-05-07T19:44:30.2831991Z 2025-05-07T19:44:30.2832025Z 2025-05-07T19:44:30.2832029Z 2025-05-07T19:44:30.2832032Z 2025-05-07T19:44:30.2832035Z 2025-05-07T19:44:30.2832039Z 2025-05-07T19:44:30.2832042Z 2025-05-07T19:44:30.2832045Z 2025-05-07T19:44:30.2832049Z 2025-05-07T19:44:30.2832319Z gxx_linux-64-11.4.0 | 29 KB | | 0%  2025-05-07T19:44:30.2833019Z 2025-05-07T19:44:30.2833030Z 2025-05-07T19:44:30.2833067Z 2025-05-07T19:44:30.2833070Z 2025-05-07T19:44:30.2833074Z 2025-05-07T19:44:30.2833077Z 2025-05-07T19:44:30.2833080Z 2025-05-07T19:44:30.2833084Z 2025-05-07T19:44:30.2833087Z 2025-05-07T19:44:30.2833091Z 2025-05-07T19:44:30.2833094Z 2025-05-07T19:44:30.3789474Z binutils_linux-64-2. | 28 KB | | 0%  2025-05-07T19:44:30.3790536Z 2025-05-07T19:44:30.3790550Z 2025-05-07T19:44:30.3790911Z 2025-05-07T19:44:30.5832578Z binutils_impl_linux- | 6.0 MB | 1 | 2%  2025-05-07T19:44:30.5833680Z 2025-05-07T19:44:30.5833694Z 2025-05-07T19:44:30.5833704Z 2025-05-07T19:44:30.6261891Z binutils_impl_linux- | 6.0 MB | 3 | 4%  2025-05-07T19:44:30.6262397Z 2025-05-07T19:44:30.6307149Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:30.6307592Z 2025-05-07T19:44:30.6307599Z 2025-05-07T19:44:30.6307606Z 2025-05-07T19:44:30.6307612Z 2025-05-07T19:44:30.6593018Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:30.6593395Z 2025-05-07T19:44:30.6593403Z 2025-05-07T19:44:30.6621544Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:30.7078925Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:30.7080033Z 2025-05-07T19:44:30.7080056Z 2025-05-07T19:44:30.7080100Z 2025-05-07T19:44:30.7081563Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:30.7082062Z 2025-05-07T19:44:30.7082194Z 2025-05-07T19:44:30.7082199Z 2025-05-07T19:44:30.7135774Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:30.7136121Z 2025-05-07T19:44:30.7136126Z 2025-05-07T19:44:30.7136129Z 2025-05-07T19:44:30.7136133Z 2025-05-07T19:44:30.7261907Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:30.7262761Z 2025-05-07T19:44:30.7415449Z gxx_impl_linux-64-11 | 11.2 MB | #####8 | 59%  2025-05-07T19:44:30.7415856Z 2025-05-07T19:44:30.7415861Z 2025-05-07T19:44:30.7415865Z 2025-05-07T19:44:30.7415868Z 2025-05-07T19:44:30.7415885Z 2025-05-07T19:44:30.7557079Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:30.7558166Z 2025-05-07T19:44:30.7558185Z 2025-05-07T19:44:30.7558201Z 2025-05-07T19:44:30.7558217Z 2025-05-07T19:44:30.7558236Z 2025-05-07T19:44:30.7558252Z 2025-05-07T19:44:30.7593929Z libgcc-devel_linux-6 | 2.3 MB | | 1%  2025-05-07T19:44:30.7594262Z 2025-05-07T19:44:30.7594267Z 2025-05-07T19:44:30.7623249Z libstdcxx-devel_linu | 11.1 MB | ###9 | 40%  2025-05-07T19:44:30.8552963Z gcc_impl_linux-64-11 | 53.0 MB | #6 | 17% 2025-05-07T19:44:30.8553272Z 2025-05-07T19:44:30.8553435Z 2025-05-07T19:44:30.8553443Z 2025-05-07T19:44:30.8553449Z 2025-05-07T19:44:30.8553454Z 2025-05-07T19:44:30.8553466Z 2025-05-07T19:44:30.8595479Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:30.8595854Z 2025-05-07T19:44:30.8595859Z 2025-05-07T19:44:30.8623039Z libstdcxx-devel_linu | 11.1 MB | ######## | 80%  2025-05-07T19:44:30.8882030Z gcc_impl_linux-64-11 | 53.0 MB | ##7 | 27% 2025-05-07T19:44:30.8882339Z 2025-05-07T19:44:30.8882345Z 2025-05-07T19:44:30.8882348Z 2025-05-07T19:44:30.8882353Z 2025-05-07T19:44:30.8882487Z 2025-05-07T19:44:30.8882972Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:30.8883347Z 2025-05-07T19:44:30.8883351Z 2025-05-07T19:44:30.8883361Z 2025-05-07T19:44:30.8883365Z 2025-05-07T19:44:30.8883368Z 2025-05-07T19:44:30.9064952Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:30.9065311Z 2025-05-07T19:44:30.9065504Z 2025-05-07T19:44:30.9065513Z 2025-05-07T19:44:30.9065518Z 2025-05-07T19:44:30.9065523Z 2025-05-07T19:44:30.9065528Z 2025-05-07T19:44:30.9065535Z 2025-05-07T19:44:30.9414773Z ld_impl_linux-64-2.4 | 691 KB | 2 | 2%  2025-05-07T19:44:30.9415150Z 2025-05-07T19:44:30.9415154Z 2025-05-07T19:44:30.9415158Z 2025-05-07T19:44:30.9415162Z 2025-05-07T19:44:30.9415165Z 2025-05-07T19:44:30.9415169Z 2025-05-07T19:44:30.9415172Z 2025-05-07T19:44:30.9440117Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:30.9440472Z 2025-05-07T19:44:30.9440477Z 2025-05-07T19:44:30.9440481Z 2025-05-07T19:44:30.9440504Z 2025-05-07T19:44:30.9440508Z 2025-05-07T19:44:30.9440511Z 2025-05-07T19:44:30.9440515Z 2025-05-07T19:44:30.9442130Z 2025-05-07T19:44:30.9460418Z libstdcxx-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:30.9460784Z 2025-05-07T19:44:30.9460789Z 2025-05-07T19:44:30.9460793Z 2025-05-07T19:44:30.9460797Z 2025-05-07T19:44:30.9460801Z 2025-05-07T19:44:30.9460804Z 2025-05-07T19:44:30.9460807Z 2025-05-07T19:44:30.9460811Z 2025-05-07T19:44:30.9623315Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:30.9698408Z gcc_impl_linux-64-11 | 53.0 MB | ####3 | 44% 2025-05-07T19:44:30.9699393Z 2025-05-07T19:44:30.9699397Z 2025-05-07T19:44:30.9699401Z 2025-05-07T19:44:30.9699518Z 2025-05-07T19:44:30.9699959Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:30.9700263Z 2025-05-07T19:44:30.9700267Z 2025-05-07T19:44:30.9700271Z 2025-05-07T19:44:30.9700280Z 2025-05-07T19:44:30.9770742Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:30.9771059Z 2025-05-07T19:44:30.9771293Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:30.9772985Z 2025-05-07T19:44:30.9830888Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:30.9831193Z 2025-05-07T19:44:30.9831198Z 2025-05-07T19:44:30.9831281Z 2025-05-07T19:44:30.9831289Z 2025-05-07T19:44:30.9831294Z 2025-05-07T19:44:30.9831298Z 2025-05-07T19:44:30.9831303Z 2025-05-07T19:44:30.9831307Z 2025-05-07T19:44:30.9831328Z 2025-05-07T19:44:30.9843299Z gcc_linux-64-11.4.0 | 31 KB | #####2 | 52%  2025-05-07T19:44:30.9843642Z 2025-05-07T19:44:30.9843646Z 2025-05-07T19:44:30.9843650Z 2025-05-07T19:44:30.9843663Z 2025-05-07T19:44:30.9843667Z 2025-05-07T19:44:30.9843671Z 2025-05-07T19:44:30.9843674Z 2025-05-07T19:44:30.9843677Z 2025-05-07T19:44:30.9843681Z 2025-05-07T19:44:30.9944732Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:30.9945247Z 2025-05-07T19:44:30.9945252Z 2025-05-07T19:44:30.9945255Z 2025-05-07T19:44:30.9945269Z 2025-05-07T19:44:30.9945273Z 2025-05-07T19:44:30.9945276Z 2025-05-07T19:44:30.9945279Z 2025-05-07T19:44:30.9945283Z 2025-05-07T19:44:30.9945286Z 2025-05-07T19:44:30.9945289Z 2025-05-07T19:44:30.9962595Z gxx_linux-64-11.4.0 | 29 KB | #####5 | 55%  2025-05-07T19:44:30.9962897Z 2025-05-07T19:44:30.9962901Z 2025-05-07T19:44:30.9962905Z 2025-05-07T19:44:30.9962920Z 2025-05-07T19:44:30.9962925Z 2025-05-07T19:44:30.9962932Z 2025-05-07T19:44:30.9962937Z 2025-05-07T19:44:30.9962942Z 2025-05-07T19:44:30.9962949Z 2025-05-07T19:44:30.9963118Z 2025-05-07T19:44:31.0089458Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:31.0089800Z 2025-05-07T19:44:31.0089805Z 2025-05-07T19:44:31.0089809Z 2025-05-07T19:44:31.0089812Z 2025-05-07T19:44:31.0089829Z 2025-05-07T19:44:31.0089833Z 2025-05-07T19:44:31.0090142Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:31.0090454Z 2025-05-07T19:44:31.0090458Z 2025-05-07T19:44:31.0090462Z 2025-05-07T19:44:31.0090465Z 2025-05-07T19:44:31.0090469Z 2025-05-07T19:44:31.0090472Z 2025-05-07T19:44:31.0209038Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:31.0209395Z 2025-05-07T19:44:31.0209399Z 2025-05-07T19:44:31.0209403Z 2025-05-07T19:44:31.0209407Z 2025-05-07T19:44:31.0209410Z 2025-05-07T19:44:31.0209604Z 2025-05-07T19:44:31.0209625Z 2025-05-07T19:44:31.0209629Z 2025-05-07T19:44:31.0209632Z 2025-05-07T19:44:31.0209635Z 2025-05-07T19:44:31.0209639Z 2025-05-07T19:44:31.0227149Z binutils_linux-64-2. | 28 KB | #####6 | 56%  2025-05-07T19:44:31.0227500Z 2025-05-07T19:44:31.0227505Z 2025-05-07T19:44:31.0227508Z 2025-05-07T19:44:31.0227524Z 2025-05-07T19:44:31.0227528Z 2025-05-07T19:44:31.0227543Z 2025-05-07T19:44:31.0227547Z 2025-05-07T19:44:31.0227550Z 2025-05-07T19:44:31.0227554Z 2025-05-07T19:44:31.0227557Z 2025-05-07T19:44:31.0227561Z 2025-05-07T19:44:31.0398259Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:31.0398616Z 2025-05-07T19:44:31.0398620Z 2025-05-07T19:44:31.0589113Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:31.0589460Z 2025-05-07T19:44:31.0589465Z 2025-05-07T19:44:31.0589470Z 2025-05-07T19:44:31.0589475Z 2025-05-07T19:44:31.0589479Z 2025-05-07T19:44:31.0589499Z 2025-05-07T19:44:31.0589516Z 2025-05-07T19:44:31.0589813Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:31.0590117Z 2025-05-07T19:44:31.0590120Z 2025-05-07T19:44:31.0590124Z 2025-05-07T19:44:31.0590127Z 2025-05-07T19:44:31.0590131Z 2025-05-07T19:44:31.0590134Z 2025-05-07T19:44:31.0590269Z 2025-05-07T19:44:31.0624981Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:31.1004072Z gcc_impl_linux-64-11 | 53.0 MB | ######2 | 63% 2025-05-07T19:44:31.1004402Z 2025-05-07T19:44:31.1004407Z 2025-05-07T19:44:31.1004411Z 2025-05-07T19:44:31.1004415Z 2025-05-07T19:44:31.1004418Z 2025-05-07T19:44:31.1004423Z 2025-05-07T19:44:31.1004426Z 2025-05-07T19:44:31.1004430Z 2025-05-07T19:44:31.1006930Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:31.1007267Z 2025-05-07T19:44:31.1007271Z 2025-05-07T19:44:31.1007281Z 2025-05-07T19:44:31.1007299Z 2025-05-07T19:44:31.1007303Z 2025-05-07T19:44:31.1007306Z 2025-05-07T19:44:31.1007310Z 2025-05-07T19:44:31.1007313Z 2025-05-07T19:44:31.1624572Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:31.1745068Z gcc_impl_linux-64-11 | 53.0 MB | ########3 | 83% 2025-05-07T19:44:31.1745343Z 2025-05-07T19:44:31.1745347Z 2025-05-07T19:44:31.1745351Z 2025-05-07T19:44:31.1745354Z 2025-05-07T19:44:31.1745594Z 2025-05-07T19:44:31.2033439Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:31.2033758Z 2025-05-07T19:44:31.2033763Z 2025-05-07T19:44:31.2033767Z 2025-05-07T19:44:31.2034035Z 2025-05-07T19:44:31.2034111Z 2025-05-07T19:44:31.2034117Z 2025-05-07T19:44:31.2034121Z 2025-05-07T19:44:31.2034146Z 2025-05-07T19:44:31.2034150Z 2025-05-07T19:44:31.2035459Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:31.2035834Z 2025-05-07T19:44:31.2035838Z 2025-05-07T19:44:31.2035860Z 2025-05-07T19:44:31.2035863Z 2025-05-07T19:44:31.2035867Z 2025-05-07T19:44:31.2035870Z 2025-05-07T19:44:31.2035886Z 2025-05-07T19:44:31.2035889Z 2025-05-07T19:44:31.2036006Z 2025-05-07T19:44:31.2265257Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:31.2265749Z 2025-05-07T19:44:31.2265764Z 2025-05-07T19:44:31.2265770Z 2025-05-07T19:44:31.2341491Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:31.2342080Z 2025-05-07T19:44:31.2342086Z 2025-05-07T19:44:31.2342093Z 2025-05-07T19:44:31.2342100Z 2025-05-07T19:44:31.2342106Z 2025-05-07T19:44:31.2342112Z 2025-05-07T19:44:31.2342117Z 2025-05-07T19:44:31.2342122Z 2025-05-07T19:44:31.2342128Z 2025-05-07T19:44:31.2342151Z 2025-05-07T19:44:31.2342768Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:31.2343101Z 2025-05-07T19:44:31.2343107Z 2025-05-07T19:44:31.2343111Z 2025-05-07T19:44:31.2343114Z 2025-05-07T19:44:31.2343322Z 2025-05-07T19:44:31.2343327Z 2025-05-07T19:44:31.2343330Z 2025-05-07T19:44:31.2343333Z 2025-05-07T19:44:31.2343337Z 2025-05-07T19:44:31.2343359Z 2025-05-07T19:44:31.2543082Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:31.2543427Z 2025-05-07T19:44:31.2543431Z 2025-05-07T19:44:31.2543436Z 2025-05-07T19:44:31.2543440Z 2025-05-07T19:44:31.2543461Z 2025-05-07T19:44:31.2543478Z 2025-05-07T19:44:31.2543482Z 2025-05-07T19:44:31.2543486Z 2025-05-07T19:44:31.2543491Z 2025-05-07T19:44:31.2543494Z 2025-05-07T19:44:31.2543498Z 2025-05-07T19:44:31.2543794Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:31.2544114Z 2025-05-07T19:44:31.2544118Z 2025-05-07T19:44:31.2544121Z 2025-05-07T19:44:31.2544145Z 2025-05-07T19:44:31.2544149Z 2025-05-07T19:44:31.2544153Z 2025-05-07T19:44:31.2544156Z 2025-05-07T19:44:31.2544160Z 2025-05-07T19:44:31.2544163Z 2025-05-07T19:44:31.2544173Z 2025-05-07T19:44:31.2544179Z 2025-05-07T19:44:31.3855621Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:31.3856007Z 2025-05-07T19:44:31.4763650Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:31.6221280Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:31.6221848Z 2025-05-07T19:44:31.6221868Z 2025-05-07T19:44:32.0672435Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:32.0678588Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:32.0679062Z 2025-05-07T19:44:32.0679545Z 2025-05-07T19:44:32.0679786Z  2025-05-07T19:44:32.0680029Z 2025-05-07T19:44:32.0680035Z 2025-05-07T19:44:32.0680212Z  2025-05-07T19:44:32.0680531Z 2025-05-07T19:44:32.0680534Z 2025-05-07T19:44:32.0680565Z 2025-05-07T19:44:32.0680776Z  2025-05-07T19:44:32.0680997Z 2025-05-07T19:44:32.0681001Z 2025-05-07T19:44:32.0681004Z 2025-05-07T19:44:32.0681008Z 2025-05-07T19:44:32.0681204Z  2025-05-07T19:44:32.0681425Z 2025-05-07T19:44:32.0681429Z 2025-05-07T19:44:32.0681433Z 2025-05-07T19:44:32.0681436Z 2025-05-07T19:44:32.0681693Z 2025-05-07T19:44:32.0681884Z  2025-05-07T19:44:32.0682127Z 2025-05-07T19:44:32.0682131Z 2025-05-07T19:44:32.0682135Z 2025-05-07T19:44:32.0682139Z 2025-05-07T19:44:32.0682142Z 2025-05-07T19:44:32.0682146Z 2025-05-07T19:44:32.0682326Z  2025-05-07T19:44:32.0682588Z 2025-05-07T19:44:32.0682592Z 2025-05-07T19:44:32.0682595Z 2025-05-07T19:44:32.0682599Z 2025-05-07T19:44:32.0682602Z 2025-05-07T19:44:32.0682606Z 2025-05-07T19:44:32.0682618Z 2025-05-07T19:44:32.0682819Z  2025-05-07T19:44:32.0683065Z 2025-05-07T19:44:32.0683068Z 2025-05-07T19:44:32.0683072Z 2025-05-07T19:44:32.0683075Z 2025-05-07T19:44:32.0683079Z 2025-05-07T19:44:32.0683082Z 2025-05-07T19:44:32.0683086Z 2025-05-07T19:44:32.0683089Z 2025-05-07T19:44:32.0683276Z  2025-05-07T19:44:32.0683540Z 2025-05-07T19:44:32.0683544Z 2025-05-07T19:44:32.0683547Z 2025-05-07T19:44:32.0683551Z 2025-05-07T19:44:32.0683554Z 2025-05-07T19:44:32.0683558Z 2025-05-07T19:44:32.0683562Z 2025-05-07T19:44:32.0683565Z 2025-05-07T19:44:32.0683569Z 2025-05-07T19:44:32.0683762Z  2025-05-07T19:44:32.0683995Z 2025-05-07T19:44:32.0684016Z 2025-05-07T19:44:32.0684019Z 2025-05-07T19:44:32.0684023Z 2025-05-07T19:44:32.0684026Z 2025-05-07T19:44:32.0684161Z 2025-05-07T19:44:32.0684165Z 2025-05-07T19:44:32.0684169Z 2025-05-07T19:44:32.0684172Z 2025-05-07T19:44:32.0684176Z 2025-05-07T19:44:32.0684373Z  2025-05-07T19:44:32.0684619Z 2025-05-07T19:44:32.0684651Z 2025-05-07T19:44:32.0684655Z 2025-05-07T19:44:32.0684658Z 2025-05-07T19:44:32.0684661Z 2025-05-07T19:44:32.0684665Z 2025-05-07T19:44:32.0684672Z 2025-05-07T19:44:32.0684676Z 2025-05-07T19:44:32.0684679Z 2025-05-07T19:44:32.0684683Z 2025-05-07T19:44:32.0684686Z 2025-05-07T19:44:32.0684908Z  done 2025-05-07T19:44:32.1693999Z Preparing transaction: \ done 2025-05-07T19:44:32.4715208Z Verifying transaction: / - \ done 2025-05-07T19:44:32.5731456Z Executing transaction: / done 2025-05-07T19:44:32.6918670Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:36.3877134Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:36.3878943Z 2025-05-07T19:44:36.3890230Z 2025-05-07T19:44:36.3905499Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:36.3906120Z 2025-05-07T19:44:36.3924338Z 2025-05-07T19:44:36.3948470Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:36.3950289Z 2025-05-07T19:44:36.3957073Z 2025-05-07T19:44:36.3978495Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:36.3980306Z 2025-05-07T19:44:36.3989850Z 2025-05-07T19:44:36.3995797Z [INSTALL] Installing Clang (16.0.6, 64) and relevant libraries through Conda ... 2025-05-07T19:44:36.4020991Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y clangxx=16.0.6 libcxx llvm-openmp=16.0.6 compiler-rt=16.0.6 2025-05-07T19:44:37.1091134Z Channels: 2025-05-07T19:44:37.1091467Z - conda-forge 2025-05-07T19:44:37.1091742Z Platform: linux-64 2025-05-07T19:44:40.1529573Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:41.4971529Z Solving environment: \ | / done 2025-05-07T19:44:41.5509696Z 2025-05-07T19:44:41.5510014Z ## Package Plan ## 2025-05-07T19:44:41.5510571Z 2025-05-07T19:44:41.5510787Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:41.5511111Z 2025-05-07T19:44:41.5511241Z added / updated specs: 2025-05-07T19:44:41.5511517Z - clangxx=16.0.6 2025-05-07T19:44:41.5511771Z - compiler-rt=16.0.6 2025-05-07T19:44:41.5512034Z - libcxx 2025-05-07T19:44:41.5512291Z - llvm-openmp=16.0.6 2025-05-07T19:44:41.5512461Z 2025-05-07T19:44:41.5512465Z 2025-05-07T19:44:41.5512601Z The following packages will be downloaded: 2025-05-07T19:44:41.5512948Z 2025-05-07T19:44:41.5513095Z package | build 2025-05-07T19:44:41.5513439Z ---------------------------|----------------- 2025-05-07T19:44:41.5513847Z clang-16.0.6 |default_h9e3a008_14 110 KB conda-forge 2025-05-07T19:44:41.5514340Z clang-16-16.0.6 |default_hb5137d0_14 780 KB conda-forge 2025-05-07T19:44:41.5514823Z clangxx-16.0.6 |default_ha78316a_14 110 KB conda-forge 2025-05-07T19:44:41.5515344Z compiler-rt-16.0.6 | h00ab1b0_2 107 KB conda-forge 2025-05-07T19:44:41.5515854Z compiler-rt_linux-64-16.0.6| h00ab1b0_2 36.0 MB conda-forge 2025-05-07T19:44:41.5516352Z icu-73.2 | h59595ed_0 11.5 MB conda-forge 2025-05-07T19:44:41.5516859Z libclang-cpp16-16.0.6 |default_hb5137d0_14 17.3 MB conda-forge 2025-05-07T19:44:41.5517354Z libcxx-19.1.7 | h2713693_1 1000 KB conda-forge 2025-05-07T19:44:41.5518092Z libcxxabi-19.1.7 | hd85fd95_1 158 KB conda-forge 2025-05-07T19:44:41.5518561Z libiconv-1.18 | h4ce23a2_1 696 KB conda-forge 2025-05-07T19:44:41.5519061Z libllvm16-16.0.6 | hb3ce162_3 33.7 MB conda-forge 2025-05-07T19:44:41.5519516Z libxml2-2.12.7 | hc051c1a_1 688 KB conda-forge 2025-05-07T19:44:41.5520008Z libzlib-1.2.13 | h4ab18f5_6 60 KB conda-forge 2025-05-07T19:44:41.5520512Z llvm-openmp-16.0.6 | h4dfa4b3_0 39.9 MB conda-forge 2025-05-07T19:44:41.5520969Z zlib-1.2.13 | h4ab18f5_6 91 KB conda-forge 2025-05-07T19:44:41.5521421Z zstd-1.5.6 | ha6fb4c9_0 542 KB conda-forge 2025-05-07T19:44:41.5521831Z ------------------------------------------------------------ 2025-05-07T19:44:41.5522241Z Total: 142.6 MB 2025-05-07T19:44:41.5522473Z 2025-05-07T19:44:41.5522649Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:41.5522896Z 2025-05-07T19:44:41.5523152Z clang conda-forge/linux-64::clang-16.0.6-default_h9e3a008_14 2025-05-07T19:44:41.5523719Z clang-16 conda-forge/linux-64::clang-16-16.0.6-default_hb5137d0_14 2025-05-07T19:44:41.5524264Z clangxx conda-forge/linux-64::clangxx-16.0.6-default_ha78316a_14 2025-05-07T19:44:41.5524844Z compiler-rt conda-forge/linux-64::compiler-rt-16.0.6-h00ab1b0_2 2025-05-07T19:44:41.5525460Z compiler-rt_linux~ conda-forge/noarch::compiler-rt_linux-64-16.0.6-h00ab1b0_2 2025-05-07T19:44:41.5525994Z icu conda-forge/linux-64::icu-73.2-h59595ed_0 2025-05-07T19:44:41.5526562Z libclang-cpp16 conda-forge/linux-64::libclang-cpp16-16.0.6-default_hb5137d0_14 2025-05-07T19:44:41.5527132Z libcxx conda-forge/linux-64::libcxx-19.1.7-h2713693_1 2025-05-07T19:44:41.5527664Z libcxxabi conda-forge/linux-64::libcxxabi-19.1.7-hd85fd95_1 2025-05-07T19:44:41.5528203Z libiconv conda-forge/linux-64::libiconv-1.18-h4ce23a2_1 2025-05-07T19:44:41.5528707Z libllvm16 conda-forge/linux-64::libllvm16-16.0.6-hb3ce162_3 2025-05-07T19:44:41.5529236Z libxml2 conda-forge/linux-64::libxml2-2.12.7-hc051c1a_1 2025-05-07T19:44:41.5532175Z libzlib conda-forge/linux-64::libzlib-1.2.13-h4ab18f5_6 2025-05-07T19:44:41.5532852Z llvm-openmp conda-forge/linux-64::llvm-openmp-16.0.6-h4dfa4b3_0 2025-05-07T19:44:41.5533362Z zstd conda-forge/linux-64::zstd-1.5.6-ha6fb4c9_0 2025-05-07T19:44:41.5533675Z 2025-05-07T19:44:41.5533811Z The following packages will be UPDATED: 2025-05-07T19:44:41.5534045Z 2025-05-07T19:44:41.5534341Z zlib pkgs/main::zlib-1.2.13-h5eee18b_1 --> conda-forge::zlib-1.2.13-h4ab18f5_6 2025-05-07T19:44:41.5534712Z 2025-05-07T19:44:41.5534716Z 2025-05-07T19:44:41.5534719Z 2025-05-07T19:44:41.5534891Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:41.5535352Z llvm-openmp-16.0.6 | 39.9 MB | | 0% 2025-05-07T19:44:41.5535633Z 2025-05-07T19:44:41.5536088Z compiler-rt_linux-64 | 36.0 MB | | 0%  2025-05-07T19:44:41.5536367Z 2025-05-07T19:44:41.5536370Z 2025-05-07T19:44:41.5536606Z libllvm16-16.0.6 | 33.7 MB | | 0%  2025-05-07T19:44:41.5536921Z 2025-05-07T19:44:41.5536924Z 2025-05-07T19:44:41.5536928Z 2025-05-07T19:44:41.5537180Z libclang-cpp16-16.0. | 17.3 MB | | 0%  2025-05-07T19:44:41.5537473Z 2025-05-07T19:44:41.5537477Z 2025-05-07T19:44:41.5537480Z 2025-05-07T19:44:41.5537484Z 2025-05-07T19:44:41.5559445Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:41.5559751Z 2025-05-07T19:44:41.5559756Z 2025-05-07T19:44:41.5559760Z 2025-05-07T19:44:41.5559764Z 2025-05-07T19:44:41.5559768Z 2025-05-07T19:44:41.5561203Z libcxx-19.1.7 | 1000 KB | | 0%  2025-05-07T19:44:41.5561501Z 2025-05-07T19:44:41.5561505Z 2025-05-07T19:44:41.5561509Z 2025-05-07T19:44:41.5561513Z 2025-05-07T19:44:41.5561516Z 2025-05-07T19:44:41.5561523Z 2025-05-07T19:44:41.5577768Z clang-16-16.0.6 | 780 KB | | 0%  2025-05-07T19:44:41.5578112Z 2025-05-07T19:44:41.5578325Z 2025-05-07T19:44:41.5578336Z 2025-05-07T19:44:41.5578342Z 2025-05-07T19:44:41.5578362Z 2025-05-07T19:44:41.5578366Z 2025-05-07T19:44:41.5578373Z 2025-05-07T19:44:41.5578973Z libiconv-1.18 | 696 KB | | 0%  2025-05-07T19:44:41.5579342Z 2025-05-07T19:44:41.5579346Z 2025-05-07T19:44:41.5579349Z 2025-05-07T19:44:41.5579352Z 2025-05-07T19:44:41.5579356Z 2025-05-07T19:44:41.5579359Z 2025-05-07T19:44:41.5579363Z 2025-05-07T19:44:41.5579366Z 2025-05-07T19:44:41.5579648Z libxml2-2.12.7 | 688 KB | | 0%  2025-05-07T19:44:41.5579978Z 2025-05-07T19:44:41.5579989Z 2025-05-07T19:44:41.5579992Z 2025-05-07T19:44:41.5579995Z 2025-05-07T19:44:41.5579999Z 2025-05-07T19:44:41.5580002Z 2025-05-07T19:44:41.5580006Z 2025-05-07T19:44:41.5580009Z 2025-05-07T19:44:41.5580013Z 2025-05-07T19:44:41.5580259Z zstd-1.5.6 | 542 KB | | 0%  2025-05-07T19:44:41.5580572Z 2025-05-07T19:44:41.5580576Z 2025-05-07T19:44:41.5580579Z 2025-05-07T19:44:41.5580586Z 2025-05-07T19:44:41.5580590Z 2025-05-07T19:44:41.5580593Z 2025-05-07T19:44:41.5580596Z 2025-05-07T19:44:41.5580600Z 2025-05-07T19:44:41.5580603Z 2025-05-07T19:44:41.5580607Z 2025-05-07T19:44:41.5582793Z libcxxabi-19.1.7 | 158 KB | | 0%  2025-05-07T19:44:41.5583179Z 2025-05-07T19:44:41.5583183Z 2025-05-07T19:44:41.5583218Z 2025-05-07T19:44:41.5583221Z 2025-05-07T19:44:41.5583225Z 2025-05-07T19:44:41.5583229Z 2025-05-07T19:44:41.5583233Z 2025-05-07T19:44:41.5583236Z 2025-05-07T19:44:41.5583240Z 2025-05-07T19:44:41.5583262Z 2025-05-07T19:44:41.5583266Z 2025-05-07T19:44:41.5583541Z clang-16.0.6 | 110 KB | | 0%  2025-05-07T19:44:41.5583842Z 2025-05-07T19:44:41.5583877Z 2025-05-07T19:44:41.5583881Z 2025-05-07T19:44:41.5583885Z 2025-05-07T19:44:41.5583888Z 2025-05-07T19:44:41.5583892Z 2025-05-07T19:44:41.5583895Z 2025-05-07T19:44:41.5583899Z 2025-05-07T19:44:41.5583903Z 2025-05-07T19:44:41.5584033Z 2025-05-07T19:44:41.5584037Z 2025-05-07T19:44:41.5584044Z 2025-05-07T19:44:41.5585495Z clangxx-16.0.6 | 110 KB | | 0%  2025-05-07T19:44:41.5585852Z 2025-05-07T19:44:41.5585857Z 2025-05-07T19:44:41.5585895Z 2025-05-07T19:44:41.5585899Z 2025-05-07T19:44:41.5585903Z 2025-05-07T19:44:41.5585907Z 2025-05-07T19:44:41.5585911Z 2025-05-07T19:44:41.5585914Z 2025-05-07T19:44:41.5585918Z 2025-05-07T19:44:41.5585922Z 2025-05-07T19:44:41.5585926Z 2025-05-07T19:44:41.5585929Z 2025-05-07T19:44:41.5585947Z 2025-05-07T19:44:41.5586248Z compiler-rt-16.0.6 | 107 KB | | 0%  2025-05-07T19:44:41.5586607Z 2025-05-07T19:44:41.5586611Z 2025-05-07T19:44:41.5586614Z 2025-05-07T19:44:41.5586618Z 2025-05-07T19:44:41.5586621Z 2025-05-07T19:44:41.5586625Z 2025-05-07T19:44:41.5586628Z 2025-05-07T19:44:41.5586632Z 2025-05-07T19:44:41.5586635Z 2025-05-07T19:44:41.5586638Z 2025-05-07T19:44:41.5586651Z 2025-05-07T19:44:41.5586655Z 2025-05-07T19:44:41.5586659Z 2025-05-07T19:44:41.5586665Z 2025-05-07T19:44:41.5587120Z zlib-1.2.13 | 91 KB | | 0%  2025-05-07T19:44:41.5587415Z 2025-05-07T19:44:41.5587419Z 2025-05-07T19:44:41.5587422Z 2025-05-07T19:44:41.5587426Z 2025-05-07T19:44:41.5587429Z 2025-05-07T19:44:41.5587433Z 2025-05-07T19:44:41.5587436Z 2025-05-07T19:44:41.5587439Z 2025-05-07T19:44:41.5587443Z 2025-05-07T19:44:41.5587447Z 2025-05-07T19:44:41.5587450Z 2025-05-07T19:44:41.5587587Z 2025-05-07T19:44:41.5587592Z 2025-05-07T19:44:41.5587595Z 2025-05-07T19:44:41.5587602Z 2025-05-07T19:44:41.6855296Z libzlib-1.2.13 | 60 KB | | 0%  2025-05-07T19:44:41.6855656Z 2025-05-07T19:44:41.6855661Z 2025-05-07T19:44:41.6855665Z 2025-05-07T19:44:41.6904750Z 2025-05-07T19:44:41.6905611Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:41.6905911Z 2025-05-07T19:44:41.6905916Z 2025-05-07T19:44:41.7123436Z 2025-05-07T19:44:41.7907662Z libclang-cpp16-16.0. | 17.3 MB | | 0%  2025-05-07T19:44:41.7908000Z 2025-05-07T19:44:41.7908499Z 2025-05-07T19:44:41.7908509Z 2025-05-07T19:44:41.7908574Z 2025-05-07T19:44:41.7919354Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:41.7919651Z 2025-05-07T19:44:41.7919663Z 2025-05-07T19:44:41.7919667Z 2025-05-07T19:44:41.8358158Z libclang-cpp16-16.0. | 17.3 MB | | 0%  2025-05-07T19:44:41.8448522Z llvm-openmp-16.0.6 | 39.9 MB | | 0% 2025-05-07T19:44:41.8448827Z 2025-05-07T19:44:41.8907546Z compiler-rt_linux-64 | 36.0 MB | | 0%  2025-05-07T19:44:41.8907850Z 2025-05-07T19:44:41.8907856Z 2025-05-07T19:44:41.8907860Z 2025-05-07T19:44:41.8907864Z 2025-05-07T19:44:41.8920251Z icu-73.2 | 11.5 MB | ###4 | 34%  2025-05-07T19:44:41.8920555Z 2025-05-07T19:44:41.8920565Z 2025-05-07T19:44:41.8921856Z 2025-05-07T19:44:41.9357981Z libclang-cpp16-16.0. | 17.3 MB | ##8 | 29%  2025-05-07T19:44:41.9500788Z llvm-openmp-16.0.6 | 39.9 MB | ##1 | 21% 2025-05-07T19:44:41.9501121Z 2025-05-07T19:44:41.9501127Z 2025-05-07T19:44:41.9595162Z libllvm16-16.0.6 | 33.7 MB | | 0%  2025-05-07T19:44:41.9595459Z 2025-05-07T19:44:41.9921637Z compiler-rt_linux-64 | 36.0 MB | #8 | 18%  2025-05-07T19:44:41.9921967Z 2025-05-07T19:44:41.9921973Z 2025-05-07T19:44:41.9922006Z 2025-05-07T19:44:42.0438846Z libclang-cpp16-16.0. | 17.3 MB | #######7 | 77%  2025-05-07T19:44:42.0555145Z llvm-openmp-16.0.6 | 39.9 MB | ###3 | 34% 2025-05-07T19:44:42.0555433Z 2025-05-07T19:44:42.0555439Z 2025-05-07T19:44:42.0594861Z libllvm16-16.0.6 | 33.7 MB | 9 | 10%  2025-05-07T19:44:42.0595164Z 2025-05-07T19:44:42.0718776Z compiler-rt_linux-64 | 36.0 MB | ####6 | 47%  2025-05-07T19:44:42.0719350Z 2025-05-07T19:44:42.0719355Z 2025-05-07T19:44:42.0719364Z 2025-05-07T19:44:42.0720407Z 2025-05-07T19:44:42.0720947Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:42.0721260Z 2025-05-07T19:44:42.0721265Z 2025-05-07T19:44:42.0721269Z 2025-05-07T19:44:42.0721281Z 2025-05-07T19:44:42.1135857Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:42.1136668Z 2025-05-07T19:44:42.1136682Z 2025-05-07T19:44:42.1136721Z 2025-05-07T19:44:42.1136733Z 2025-05-07T19:44:42.1136777Z 2025-05-07T19:44:42.1439452Z libcxx-19.1.7 | 1000 KB | 1 | 2%  2025-05-07T19:44:42.1447105Z llvm-openmp-16.0.6 | 39.9 MB | #####3 | 54% 2025-05-07T19:44:42.1447881Z 2025-05-07T19:44:42.1447896Z 2025-05-07T19:44:42.1447907Z 2025-05-07T19:44:42.1447919Z 2025-05-07T19:44:42.1447930Z 2025-05-07T19:44:42.1555230Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:42.1555586Z 2025-05-07T19:44:42.1555590Z 2025-05-07T19:44:42.1647140Z libllvm16-16.0.6 | 33.7 MB | ##4 | 25%  2025-05-07T19:44:42.1647884Z 2025-05-07T19:44:42.1801401Z compiler-rt_linux-64 | 36.0 MB | ######4 | 65%  2025-05-07T19:44:42.1801728Z 2025-05-07T19:44:42.1801734Z 2025-05-07T19:44:42.1801759Z 2025-05-07T19:44:42.1801763Z 2025-05-07T19:44:42.1801767Z 2025-05-07T19:44:42.1801779Z 2025-05-07T19:44:42.2110724Z clang-16-16.0.6 | 780 KB | 2 | 2%  2025-05-07T19:44:42.2111052Z 2025-05-07T19:44:42.2111306Z 2025-05-07T19:44:42.2111314Z 2025-05-07T19:44:42.2111317Z 2025-05-07T19:44:42.2111343Z 2025-05-07T19:44:42.2113393Z 2025-05-07T19:44:42.2560431Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:42.2561304Z 2025-05-07T19:44:42.2561318Z 2025-05-07T19:44:42.2561329Z 2025-05-07T19:44:42.2561339Z 2025-05-07T19:44:42.2561349Z 2025-05-07T19:44:42.2561360Z 2025-05-07T19:44:42.2561405Z 2025-05-07T19:44:42.2588771Z libiconv-1.18 | 696 KB | 2 | 2%  2025-05-07T19:44:42.2589094Z 2025-05-07T19:44:42.2589098Z 2025-05-07T19:44:42.2596788Z libllvm16-16.0.6 | 33.7 MB | ###8 | 39%  2025-05-07T19:44:42.2597094Z 2025-05-07T19:44:42.2597098Z 2025-05-07T19:44:42.2600669Z 2025-05-07T19:44:42.2609528Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:42.2927385Z llvm-openmp-16.0.6 | 39.9 MB | ######8 | 69% 2025-05-07T19:44:42.2928227Z 2025-05-07T19:44:42.2928275Z 2025-05-07T19:44:42.2928288Z 2025-05-07T19:44:42.2928299Z 2025-05-07T19:44:42.2928309Z 2025-05-07T19:44:42.2928319Z 2025-05-07T19:44:42.2928329Z 2025-05-07T19:44:42.3082854Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:42.3083487Z 2025-05-07T19:44:42.3200178Z compiler-rt_linux-64 | 36.0 MB | ########2 | 82%  2025-05-07T19:44:42.3200504Z 2025-05-07T19:44:42.3200510Z 2025-05-07T19:44:42.3200532Z 2025-05-07T19:44:42.3200536Z 2025-05-07T19:44:42.3201452Z 2025-05-07T19:44:42.3205667Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:42.3205976Z 2025-05-07T19:44:42.3205982Z 2025-05-07T19:44:42.3205985Z 2025-05-07T19:44:42.3206012Z 2025-05-07T19:44:42.3206024Z 2025-05-07T19:44:42.3414943Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:42.3415263Z 2025-05-07T19:44:42.3415268Z 2025-05-07T19:44:42.3415273Z 2025-05-07T19:44:42.3415278Z 2025-05-07T19:44:42.3415281Z 2025-05-07T19:44:42.3415327Z 2025-05-07T19:44:42.3415331Z 2025-05-07T19:44:42.3415335Z 2025-05-07T19:44:42.3416232Z 2025-05-07T19:44:42.3497634Z zstd-1.5.6 | 542 KB | 2 | 3%  2025-05-07T19:44:42.3498516Z 2025-05-07T19:44:42.3498529Z 2025-05-07T19:44:42.3498540Z 2025-05-07T19:44:42.3498551Z 2025-05-07T19:44:42.3498562Z 2025-05-07T19:44:42.3498572Z 2025-05-07T19:44:42.3498583Z 2025-05-07T19:44:42.3499023Z 2025-05-07T19:44:42.3774567Z libxml2-2.12.7 | 688 KB | 2 | 2%  2025-05-07T19:44:42.3774900Z 2025-05-07T19:44:42.3774906Z 2025-05-07T19:44:42.3774910Z 2025-05-07T19:44:42.3774913Z 2025-05-07T19:44:42.3774916Z 2025-05-07T19:44:42.3774921Z 2025-05-07T19:44:42.3774924Z 2025-05-07T19:44:42.3774928Z 2025-05-07T19:44:42.3775004Z 2025-05-07T19:44:42.3915131Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:42.3915440Z 2025-05-07T19:44:42.3915444Z 2025-05-07T19:44:42.3958796Z libllvm16-16.0.6 | 33.7 MB | ####9 | 50%  2025-05-07T19:44:42.3959094Z 2025-05-07T19:44:42.3959113Z 2025-05-07T19:44:42.3959117Z 2025-05-07T19:44:42.3959121Z 2025-05-07T19:44:42.3959124Z 2025-05-07T19:44:42.3959129Z 2025-05-07T19:44:42.3959132Z 2025-05-07T19:44:42.3959136Z 2025-05-07T19:44:42.4111932Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:42.4113073Z 2025-05-07T19:44:42.4113087Z 2025-05-07T19:44:42.4113099Z 2025-05-07T19:44:42.4113110Z 2025-05-07T19:44:42.4113121Z 2025-05-07T19:44:42.4113131Z 2025-05-07T19:44:42.4113141Z 2025-05-07T19:44:42.4113152Z 2025-05-07T19:44:42.4113162Z 2025-05-07T19:44:42.4113203Z 2025-05-07T19:44:42.4147088Z libcxxabi-19.1.7 | 158 KB | # | 10%  2025-05-07T19:44:42.4222949Z llvm-openmp-16.0.6 | 39.9 MB | ########3 | 83% 2025-05-07T19:44:42.4223417Z 2025-05-07T19:44:42.4223487Z 2025-05-07T19:44:42.4223492Z 2025-05-07T19:44:42.4223724Z 2025-05-07T19:44:42.4223747Z 2025-05-07T19:44:42.4223752Z 2025-05-07T19:44:42.4223758Z 2025-05-07T19:44:42.4223764Z 2025-05-07T19:44:42.4223769Z 2025-05-07T19:44:42.4223773Z 2025-05-07T19:44:42.4531045Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:42.4531405Z 2025-05-07T19:44:42.4536246Z compiler-rt_linux-64 | 36.0 MB | #########8 | 98%  2025-05-07T19:44:42.4536531Z 2025-05-07T19:44:42.4536549Z 2025-05-07T19:44:42.4536553Z 2025-05-07T19:44:42.4536557Z 2025-05-07T19:44:42.4536560Z 2025-05-07T19:44:42.4536564Z 2025-05-07T19:44:42.4536567Z 2025-05-07T19:44:42.4536591Z 2025-05-07T19:44:42.4536594Z 2025-05-07T19:44:42.4536598Z 2025-05-07T19:44:42.4536601Z 2025-05-07T19:44:42.4604675Z clang-16.0.6 | 110 KB | #4 | 15%  2025-05-07T19:44:42.4605005Z 2025-05-07T19:44:42.4605010Z 2025-05-07T19:44:42.4605029Z 2025-05-07T19:44:42.4605056Z 2025-05-07T19:44:42.4605060Z 2025-05-07T19:44:42.4605079Z 2025-05-07T19:44:42.4605083Z 2025-05-07T19:44:42.4605087Z 2025-05-07T19:44:42.4605090Z 2025-05-07T19:44:42.4605094Z 2025-05-07T19:44:42.4605097Z 2025-05-07T19:44:42.4605101Z 2025-05-07T19:44:42.4647855Z clangxx-16.0.6 | 110 KB | #4 | 15%  2025-05-07T19:44:42.4648213Z 2025-05-07T19:44:42.4648565Z 2025-05-07T19:44:42.4648574Z 2025-05-07T19:44:42.4648596Z 2025-05-07T19:44:42.4648600Z 2025-05-07T19:44:42.4648606Z 2025-05-07T19:44:42.4648621Z 2025-05-07T19:44:42.4648626Z 2025-05-07T19:44:42.4648634Z 2025-05-07T19:44:42.4648640Z 2025-05-07T19:44:42.4648650Z 2025-05-07T19:44:42.4681927Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:42.4682247Z 2025-05-07T19:44:42.4682252Z 2025-05-07T19:44:42.4682256Z 2025-05-07T19:44:42.4682275Z 2025-05-07T19:44:42.4682278Z 2025-05-07T19:44:42.4682282Z 2025-05-07T19:44:42.4682285Z 2025-05-07T19:44:42.4682289Z 2025-05-07T19:44:42.4682304Z 2025-05-07T19:44:42.4682337Z 2025-05-07T19:44:42.4682341Z 2025-05-07T19:44:42.4682344Z 2025-05-07T19:44:42.4915367Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:42.4915757Z 2025-05-07T19:44:42.4915767Z 2025-05-07T19:44:42.5035750Z libllvm16-16.0.6 | 33.7 MB | ###### | 61%  2025-05-07T19:44:42.5036071Z 2025-05-07T19:44:42.5036076Z 2025-05-07T19:44:42.5036303Z 2025-05-07T19:44:42.5036307Z 2025-05-07T19:44:42.5036310Z 2025-05-07T19:44:42.5036313Z 2025-05-07T19:44:42.5036317Z 2025-05-07T19:44:42.5036321Z 2025-05-07T19:44:42.5036324Z 2025-05-07T19:44:42.5036328Z 2025-05-07T19:44:42.5036331Z 2025-05-07T19:44:42.5036360Z 2025-05-07T19:44:42.5036364Z 2025-05-07T19:44:42.5036367Z 2025-05-07T19:44:42.5092759Z zlib-1.2.13 | 91 KB | #7 | 18%  2025-05-07T19:44:42.5093715Z 2025-05-07T19:44:42.5093730Z 2025-05-07T19:44:42.5093742Z 2025-05-07T19:44:42.5093781Z 2025-05-07T19:44:42.5093940Z 2025-05-07T19:44:42.5093944Z 2025-05-07T19:44:42.5093948Z 2025-05-07T19:44:42.5093951Z 2025-05-07T19:44:42.5093954Z 2025-05-07T19:44:42.5093958Z 2025-05-07T19:44:42.5093961Z 2025-05-07T19:44:42.5093965Z 2025-05-07T19:44:42.5093968Z 2025-05-07T19:44:42.5093972Z 2025-05-07T19:44:42.5189857Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:42.5190474Z 2025-05-07T19:44:42.5190572Z 2025-05-07T19:44:42.5190578Z 2025-05-07T19:44:42.5190581Z 2025-05-07T19:44:42.5190607Z 2025-05-07T19:44:42.5190623Z 2025-05-07T19:44:42.5190626Z 2025-05-07T19:44:42.5190630Z 2025-05-07T19:44:42.5190633Z 2025-05-07T19:44:42.5190636Z 2025-05-07T19:44:42.5190640Z 2025-05-07T19:44:42.5190643Z 2025-05-07T19:44:42.5190712Z 2025-05-07T19:44:42.5252926Z compiler-rt-16.0.6 | 107 KB | #4 | 15%  2025-05-07T19:44:42.5253324Z 2025-05-07T19:44:42.5253330Z 2025-05-07T19:44:42.5253542Z 2025-05-07T19:44:42.5253548Z 2025-05-07T19:44:42.5253553Z 2025-05-07T19:44:42.5253557Z 2025-05-07T19:44:42.5253560Z 2025-05-07T19:44:42.5253563Z 2025-05-07T19:44:42.5253567Z 2025-05-07T19:44:42.5253577Z 2025-05-07T19:44:42.5253581Z 2025-05-07T19:44:42.5253585Z 2025-05-07T19:44:42.5253588Z 2025-05-07T19:44:42.5392060Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:42.5477109Z llvm-openmp-16.0.6 | 39.9 MB | #########6 | 96% 2025-05-07T19:44:42.5477421Z 2025-05-07T19:44:42.5477445Z 2025-05-07T19:44:42.5477448Z 2025-05-07T19:44:42.5477452Z 2025-05-07T19:44:42.5477456Z 2025-05-07T19:44:42.5477459Z 2025-05-07T19:44:42.5477463Z 2025-05-07T19:44:42.5477466Z 2025-05-07T19:44:42.5477469Z 2025-05-07T19:44:42.5477473Z 2025-05-07T19:44:42.5477477Z 2025-05-07T19:44:42.5477481Z 2025-05-07T19:44:42.5477484Z 2025-05-07T19:44:42.5477487Z 2025-05-07T19:44:42.5478042Z 2025-05-07T19:44:42.5507014Z libzlib-1.2.13 | 60 KB | ##6 | 27%  2025-05-07T19:44:42.5507395Z 2025-05-07T19:44:42.5507400Z 2025-05-07T19:44:42.5507404Z 2025-05-07T19:44:42.5507408Z 2025-05-07T19:44:42.5507411Z 2025-05-07T19:44:42.5507415Z 2025-05-07T19:44:42.5507419Z 2025-05-07T19:44:42.5507422Z 2025-05-07T19:44:42.5507425Z 2025-05-07T19:44:42.5507429Z 2025-05-07T19:44:42.5507432Z 2025-05-07T19:44:42.5507436Z 2025-05-07T19:44:42.5507446Z 2025-05-07T19:44:42.5507450Z 2025-05-07T19:44:42.5507541Z 2025-05-07T19:44:42.5915828Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:42.5916175Z 2025-05-07T19:44:42.5916186Z 2025-05-07T19:44:42.7222685Z libllvm16-16.0.6 | 33.7 MB | #######7 | 78%  2025-05-07T19:44:42.7223138Z 2025-05-07T19:44:42.7223145Z 2025-05-07T19:44:42.7223151Z 2025-05-07T19:44:42.7223157Z 2025-05-07T19:44:42.7223163Z 2025-05-07T19:44:42.7223166Z 2025-05-07T19:44:42.7223479Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:42.7223784Z 2025-05-07T19:44:42.7223788Z 2025-05-07T19:44:42.7223792Z 2025-05-07T19:44:42.7223796Z 2025-05-07T19:44:42.7223799Z 2025-05-07T19:44:42.7225578Z 2025-05-07T19:44:42.7922882Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:42.7923256Z 2025-05-07T19:44:42.7923261Z 2025-05-07T19:44:42.7923265Z 2025-05-07T19:44:42.7923547Z 2025-05-07T19:44:42.7923553Z 2025-05-07T19:44:42.7923558Z 2025-05-07T19:44:42.7923563Z 2025-05-07T19:44:42.7923837Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:42.7924154Z 2025-05-07T19:44:42.7924159Z 2025-05-07T19:44:42.7924164Z 2025-05-07T19:44:42.7924168Z 2025-05-07T19:44:42.7924172Z 2025-05-07T19:44:42.7924177Z 2025-05-07T19:44:42.7924185Z 2025-05-07T19:44:42.8310747Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:42.8311115Z 2025-05-07T19:44:42.8311151Z 2025-05-07T19:44:42.8311155Z 2025-05-07T19:44:42.8311158Z 2025-05-07T19:44:42.8311162Z 2025-05-07T19:44:42.8311166Z 2025-05-07T19:44:42.8311170Z 2025-05-07T19:44:42.8311174Z 2025-05-07T19:44:42.8311177Z 2025-05-07T19:44:42.8311431Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:42.8311728Z 2025-05-07T19:44:42.8311733Z 2025-05-07T19:44:42.8311737Z 2025-05-07T19:44:42.8311752Z 2025-05-07T19:44:42.8311755Z 2025-05-07T19:44:42.8311759Z 2025-05-07T19:44:42.8311762Z 2025-05-07T19:44:42.8311765Z 2025-05-07T19:44:42.8311769Z 2025-05-07T19:44:42.8857329Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:42.8857683Z 2025-05-07T19:44:42.8857688Z 2025-05-07T19:44:42.8857693Z 2025-05-07T19:44:42.8857697Z 2025-05-07T19:44:42.8917406Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:42.8917695Z 2025-05-07T19:44:42.9024591Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:42.9024916Z 2025-05-07T19:44:42.9024921Z 2025-05-07T19:44:42.9024926Z 2025-05-07T19:44:42.9024930Z 2025-05-07T19:44:42.9024935Z 2025-05-07T19:44:42.9024939Z 2025-05-07T19:44:42.9024946Z 2025-05-07T19:44:42.9024974Z 2025-05-07T19:44:42.9025253Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:42.9025541Z 2025-05-07T19:44:42.9025545Z 2025-05-07T19:44:42.9025559Z 2025-05-07T19:44:42.9025563Z 2025-05-07T19:44:42.9025566Z 2025-05-07T19:44:42.9025570Z 2025-05-07T19:44:42.9025574Z 2025-05-07T19:44:42.9025578Z 2025-05-07T19:44:42.9219227Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:42.9219556Z 2025-05-07T19:44:42.9219561Z 2025-05-07T19:44:42.9219678Z 2025-05-07T19:44:42.9219689Z 2025-05-07T19:44:42.9219694Z 2025-05-07T19:44:42.9219731Z 2025-05-07T19:44:42.9219735Z 2025-05-07T19:44:42.9219740Z 2025-05-07T19:44:42.9219744Z 2025-05-07T19:44:42.9219749Z 2025-05-07T19:44:42.9220310Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:42.9220645Z 2025-05-07T19:44:42.9220649Z 2025-05-07T19:44:42.9220653Z 2025-05-07T19:44:42.9220656Z 2025-05-07T19:44:42.9220684Z 2025-05-07T19:44:42.9220687Z 2025-05-07T19:44:42.9220691Z 2025-05-07T19:44:42.9220694Z 2025-05-07T19:44:42.9220699Z 2025-05-07T19:44:42.9220702Z 2025-05-07T19:44:43.0074980Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:43.0118360Z llvm-openmp-16.0.6 | 39.9 MB | ########## | 100% 2025-05-07T19:44:43.0118646Z 2025-05-07T19:44:43.0118651Z 2025-05-07T19:44:43.0118655Z 2025-05-07T19:44:43.0118658Z 2025-05-07T19:44:43.0118662Z 2025-05-07T19:44:43.0118667Z 2025-05-07T19:44:43.0118670Z 2025-05-07T19:44:43.0118674Z 2025-05-07T19:44:43.0118679Z 2025-05-07T19:44:43.0118682Z 2025-05-07T19:44:43.0118693Z 2025-05-07T19:44:43.0120478Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:43.0120773Z 2025-05-07T19:44:43.0120777Z 2025-05-07T19:44:43.0120781Z 2025-05-07T19:44:43.0120784Z 2025-05-07T19:44:43.0120788Z 2025-05-07T19:44:43.0120791Z 2025-05-07T19:44:43.0120803Z 2025-05-07T19:44:43.0120826Z 2025-05-07T19:44:43.0120829Z 2025-05-07T19:44:43.0120833Z 2025-05-07T19:44:43.0120836Z 2025-05-07T19:44:43.0261868Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:43.0262433Z 2025-05-07T19:44:43.0262438Z 2025-05-07T19:44:43.0262442Z 2025-05-07T19:44:43.0262445Z 2025-05-07T19:44:43.0262449Z 2025-05-07T19:44:43.0262452Z 2025-05-07T19:44:43.0262456Z 2025-05-07T19:44:43.0262459Z 2025-05-07T19:44:43.0262463Z 2025-05-07T19:44:43.0262467Z 2025-05-07T19:44:43.0262470Z 2025-05-07T19:44:43.0262474Z 2025-05-07T19:44:43.0262753Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:43.0263072Z 2025-05-07T19:44:43.0263076Z 2025-05-07T19:44:43.0263087Z 2025-05-07T19:44:43.0263091Z 2025-05-07T19:44:43.0263094Z 2025-05-07T19:44:43.0263097Z 2025-05-07T19:44:43.0263101Z 2025-05-07T19:44:43.0263104Z 2025-05-07T19:44:43.0263107Z 2025-05-07T19:44:43.0263111Z 2025-05-07T19:44:43.0263114Z 2025-05-07T19:44:43.0263118Z 2025-05-07T19:44:43.0357345Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:43.0357696Z 2025-05-07T19:44:43.0357715Z 2025-05-07T19:44:43.0357719Z 2025-05-07T19:44:43.0357722Z 2025-05-07T19:44:43.0357726Z 2025-05-07T19:44:43.0357730Z 2025-05-07T19:44:43.0357733Z 2025-05-07T19:44:43.0357737Z 2025-05-07T19:44:43.0357740Z 2025-05-07T19:44:43.0357744Z 2025-05-07T19:44:43.0357747Z 2025-05-07T19:44:43.0357751Z 2025-05-07T19:44:43.0357754Z 2025-05-07T19:44:43.0357758Z 2025-05-07T19:44:43.0359386Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:43.0359685Z 2025-05-07T19:44:43.0359689Z 2025-05-07T19:44:43.0359887Z 2025-05-07T19:44:43.0359892Z 2025-05-07T19:44:43.0359896Z 2025-05-07T19:44:43.0359907Z 2025-05-07T19:44:43.0359911Z 2025-05-07T19:44:43.0359914Z 2025-05-07T19:44:43.0359918Z 2025-05-07T19:44:43.0359935Z 2025-05-07T19:44:43.0359938Z 2025-05-07T19:44:43.0359942Z 2025-05-07T19:44:43.0359945Z 2025-05-07T19:44:43.0359949Z 2025-05-07T19:44:43.0496608Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:43.0496951Z 2025-05-07T19:44:43.0496956Z 2025-05-07T19:44:43.0497204Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:43.0497476Z 2025-05-07T19:44:43.0497480Z 2025-05-07T19:44:43.0518285Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:43.0518581Z 2025-05-07T19:44:43.0518585Z 2025-05-07T19:44:43.0518588Z 2025-05-07T19:44:43.0518592Z 2025-05-07T19:44:43.0518595Z 2025-05-07T19:44:43.0518598Z 2025-05-07T19:44:43.0518602Z 2025-05-07T19:44:43.0518605Z 2025-05-07T19:44:43.0518619Z 2025-05-07T19:44:43.0518623Z 2025-05-07T19:44:43.0518627Z 2025-05-07T19:44:43.0518630Z 2025-05-07T19:44:43.0518638Z 2025-05-07T19:44:43.0518641Z 2025-05-07T19:44:43.0518917Z 2025-05-07T19:44:43.0521757Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:43.0522065Z 2025-05-07T19:44:43.0522069Z 2025-05-07T19:44:43.0522072Z 2025-05-07T19:44:43.0522076Z 2025-05-07T19:44:43.0522085Z 2025-05-07T19:44:43.0522089Z 2025-05-07T19:44:43.0522092Z 2025-05-07T19:44:43.0522103Z 2025-05-07T19:44:43.0522106Z 2025-05-07T19:44:43.0522122Z 2025-05-07T19:44:43.0522126Z 2025-05-07T19:44:43.0522129Z 2025-05-07T19:44:43.0522133Z 2025-05-07T19:44:43.0522136Z 2025-05-07T19:44:43.0522140Z 2025-05-07T19:44:43.0611132Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:43.0611472Z 2025-05-07T19:44:43.0611491Z 2025-05-07T19:44:43.0611494Z 2025-05-07T19:44:43.0611498Z 2025-05-07T19:44:43.0611515Z 2025-05-07T19:44:43.0611519Z 2025-05-07T19:44:43.0611522Z 2025-05-07T19:44:43.0611526Z 2025-05-07T19:44:43.0611529Z 2025-05-07T19:44:43.0611533Z 2025-05-07T19:44:43.0611536Z 2025-05-07T19:44:43.0611539Z 2025-05-07T19:44:43.0611543Z 2025-05-07T19:44:43.0613601Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:43.0613949Z 2025-05-07T19:44:43.0614124Z 2025-05-07T19:44:43.0614127Z 2025-05-07T19:44:43.0614131Z 2025-05-07T19:44:43.0614134Z 2025-05-07T19:44:43.0614138Z 2025-05-07T19:44:43.0614148Z 2025-05-07T19:44:43.0614151Z 2025-05-07T19:44:43.0614155Z 2025-05-07T19:44:43.0614158Z 2025-05-07T19:44:43.0614162Z 2025-05-07T19:44:43.0614165Z 2025-05-07T19:44:43.0614168Z 2025-05-07T19:44:43.1236674Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:43.1237076Z 2025-05-07T19:44:43.1237081Z 2025-05-07T19:44:43.1237087Z 2025-05-07T19:44:43.5572240Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:43.5572639Z 2025-05-07T19:44:43.6285063Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:43.6285427Z 2025-05-07T19:44:43.6285446Z 2025-05-07T19:44:43.6646214Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:43.6652201Z llvm-openmp-16.0.6 | 39.9 MB | ########## | 100% 2025-05-07T19:44:43.6652973Z 2025-05-07T19:44:43.6653388Z 2025-05-07T19:44:43.6653684Z  2025-05-07T19:44:43.6653908Z 2025-05-07T19:44:43.6653912Z 2025-05-07T19:44:43.6654092Z  2025-05-07T19:44:43.6654330Z 2025-05-07T19:44:43.6654334Z 2025-05-07T19:44:43.6654339Z 2025-05-07T19:44:43.6654516Z  2025-05-07T19:44:43.6654733Z 2025-05-07T19:44:43.6654737Z 2025-05-07T19:44:43.6654740Z 2025-05-07T19:44:43.6655000Z 2025-05-07T19:44:43.6655201Z  2025-05-07T19:44:43.6655419Z 2025-05-07T19:44:43.6655423Z 2025-05-07T19:44:43.6655426Z 2025-05-07T19:44:43.6655430Z 2025-05-07T19:44:43.6655433Z 2025-05-07T19:44:43.6655627Z  2025-05-07T19:44:43.6655871Z 2025-05-07T19:44:43.6655875Z 2025-05-07T19:44:43.6655885Z 2025-05-07T19:44:43.6655889Z 2025-05-07T19:44:43.6655893Z 2025-05-07T19:44:43.6655896Z 2025-05-07T19:44:43.6656080Z  2025-05-07T19:44:43.6656316Z 2025-05-07T19:44:43.6656319Z 2025-05-07T19:44:43.6656323Z 2025-05-07T19:44:43.6656326Z 2025-05-07T19:44:43.6656330Z 2025-05-07T19:44:43.6656333Z 2025-05-07T19:44:43.6656337Z 2025-05-07T19:44:43.6656517Z  2025-05-07T19:44:43.6656765Z 2025-05-07T19:44:43.6656769Z 2025-05-07T19:44:43.6656780Z 2025-05-07T19:44:43.6656784Z 2025-05-07T19:44:43.6656787Z 2025-05-07T19:44:43.6656791Z 2025-05-07T19:44:43.6656794Z 2025-05-07T19:44:43.6656797Z 2025-05-07T19:44:43.6656980Z  2025-05-07T19:44:43.6657209Z 2025-05-07T19:44:43.6657212Z 2025-05-07T19:44:43.6657228Z 2025-05-07T19:44:43.6657231Z 2025-05-07T19:44:43.6657234Z 2025-05-07T19:44:43.6657243Z 2025-05-07T19:44:43.6657246Z 2025-05-07T19:44:43.6657249Z 2025-05-07T19:44:43.6657253Z 2025-05-07T19:44:43.6657440Z  2025-05-07T19:44:43.6657669Z 2025-05-07T19:44:43.6657673Z 2025-05-07T19:44:43.6657676Z 2025-05-07T19:44:43.6657691Z 2025-05-07T19:44:43.6657695Z 2025-05-07T19:44:43.6657698Z 2025-05-07T19:44:43.6657702Z 2025-05-07T19:44:43.6657705Z 2025-05-07T19:44:43.6657708Z 2025-05-07T19:44:43.6657712Z 2025-05-07T19:44:43.6657906Z  2025-05-07T19:44:43.6658140Z 2025-05-07T19:44:43.6658144Z 2025-05-07T19:44:43.6658148Z 2025-05-07T19:44:43.6658169Z 2025-05-07T19:44:43.6658172Z 2025-05-07T19:44:43.6658175Z 2025-05-07T19:44:43.6658179Z 2025-05-07T19:44:43.6658182Z 2025-05-07T19:44:43.6658186Z 2025-05-07T19:44:43.6658189Z 2025-05-07T19:44:43.6658192Z 2025-05-07T19:44:43.6658396Z  2025-05-07T19:44:43.6658769Z 2025-05-07T19:44:43.6658773Z 2025-05-07T19:44:43.6658792Z 2025-05-07T19:44:43.6658795Z 2025-05-07T19:44:43.6658799Z 2025-05-07T19:44:43.6658802Z 2025-05-07T19:44:43.6658805Z 2025-05-07T19:44:43.6658809Z 2025-05-07T19:44:43.6658812Z 2025-05-07T19:44:43.6658816Z 2025-05-07T19:44:43.6658820Z 2025-05-07T19:44:43.6658823Z 2025-05-07T19:44:43.6659024Z  2025-05-07T19:44:43.6659284Z 2025-05-07T19:44:43.6659292Z 2025-05-07T19:44:43.6659296Z 2025-05-07T19:44:43.6659300Z 2025-05-07T19:44:43.6659303Z 2025-05-07T19:44:43.6659307Z 2025-05-07T19:44:43.6659310Z 2025-05-07T19:44:43.6659314Z 2025-05-07T19:44:43.6659317Z 2025-05-07T19:44:43.6659321Z 2025-05-07T19:44:43.6659324Z 2025-05-07T19:44:43.6659328Z 2025-05-07T19:44:43.6659331Z 2025-05-07T19:44:43.6659534Z  2025-05-07T19:44:43.6659798Z 2025-05-07T19:44:43.6659801Z 2025-05-07T19:44:43.6659805Z 2025-05-07T19:44:43.6659808Z 2025-05-07T19:44:43.6659811Z 2025-05-07T19:44:43.6659815Z 2025-05-07T19:44:43.6659819Z 2025-05-07T19:44:43.6659822Z 2025-05-07T19:44:43.6659825Z 2025-05-07T19:44:43.6659829Z 2025-05-07T19:44:43.6659832Z 2025-05-07T19:44:43.6659836Z 2025-05-07T19:44:43.6659839Z 2025-05-07T19:44:43.6659843Z 2025-05-07T19:44:43.6660070Z  2025-05-07T19:44:43.6660409Z 2025-05-07T19:44:43.6660414Z 2025-05-07T19:44:43.6660417Z 2025-05-07T19:44:43.6660421Z 2025-05-07T19:44:43.6660424Z 2025-05-07T19:44:43.6660428Z 2025-05-07T19:44:43.6660431Z 2025-05-07T19:44:43.6660435Z 2025-05-07T19:44:43.6660438Z 2025-05-07T19:44:43.6660441Z 2025-05-07T19:44:43.6660445Z 2025-05-07T19:44:43.6660448Z 2025-05-07T19:44:43.6660451Z 2025-05-07T19:44:43.6660455Z 2025-05-07T19:44:43.6660458Z 2025-05-07T19:44:43.6660718Z  done 2025-05-07T19:44:43.7666552Z Preparing transaction: \ done 2025-05-07T19:44:43.8675345Z Verifying transaction: / done 2025-05-07T19:44:43.9693946Z Executing transaction: \ done 2025-05-07T19:44:44.0594207Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:47.7369843Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:47.7370459Z 2025-05-07T19:44:47.7383702Z 2025-05-07T19:44:47.7396904Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:47.7397419Z 2025-05-07T19:44:47.7420777Z 2025-05-07T19:44:47.7450370Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:47.7451904Z 2025-05-07T19:44:47.7463225Z 2025-05-07T19:44:47.7479793Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:47.7481348Z 2025-05-07T19:44:47.7497205Z 2025-05-07T19:44:47.7497854Z + conda env config vars set -n build_binary CC= 2025-05-07T19:44:47.7498600Z 2025-05-07T19:44:48.1618767Z 2025-05-07T19:44:48.1619819Z + conda env config vars set -n build_binary CXX= 2025-05-07T19:44:48.1620626Z 2025-05-07T19:44:48.5759432Z 2025-05-07T19:44:48.5760311Z + conda run -n build_binary printenv CC 2025-05-07T19:44:48.5761011Z 2025-05-07T19:44:50.3460750Z /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc 2025-05-07T19:44:50.3461323Z 2025-05-07T19:44:50.4018530Z 2025-05-07T19:44:50.4019359Z + conda run -n build_binary printenv CXX 2025-05-07T19:44:50.4020104Z 2025-05-07T19:44:52.1588395Z /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ 2025-05-07T19:44:52.1589493Z 2025-05-07T19:44:52.2147166Z 2025-05-07T19:44:54.0604804Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/lib ... 2025-05-07T19:44:55.8574681Z ERROR conda.cli.main_run:execute(125): `conda run printenv LD_LIBRARY_PATH` failed. (See above for error) 2025-05-07T19:44:55.9150645Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib 2025-05-07T19:44:55.9151177Z 2025-05-07T19:44:56.3223094Z 2025-05-07T19:44:58.1223039Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:58.1223405Z 2025-05-07T19:44:58.1789547Z [CHECK] Binary cc found in PATH 2025-05-07T19:44:59.9683154Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:59.9683479Z 2025-05-07T19:45:00.0469516Z [CHECK] Binary gcc found in PATH 2025-05-07T19:45:01.8334091Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:45:01.8334422Z 2025-05-07T19:45:01.9114532Z [CHECK] Binary c++ found in PATH 2025-05-07T19:45:03.7044910Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:45:03.7045457Z 2025-05-07T19:45:03.7795100Z [CHECK] Binary g++ found in PATH 2025-05-07T19:45:03.7801484Z [INFO] Printing out all preprocessor defines in the C compiler ... 2025-05-07T19:45:03.7802195Z + conda run -n build_binary cc -dM -E - 2025-05-07T19:45:03.7802438Z 2025-05-07T19:45:05.6043328Z #define _LP64 1 2025-05-07T19:45:05.6043692Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:45:05.6044015Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:45:05.6044314Z #define __ATOMIC_CONSUME 1 2025-05-07T19:45:05.6044604Z #define __ATOMIC_RELAXED 0 2025-05-07T19:45:05.6044860Z #define __ATOMIC_RELEASE 3 2025-05-07T19:45:05.6045498Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:45:05.6045843Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:45:05.6046152Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:45:05.6046452Z #define __BOOL_WIDTH__ 8 2025-05-07T19:45:05.6046734Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:45:05.6047089Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:45:05.6047410Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:45:05.6047745Z #define __CHAR_BIT__ 8 2025-05-07T19:45:05.6048021Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:05.6048377Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:05.6048722Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:05.6049072Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:05.6049409Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:05.6049722Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:05.6050047Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:05.6050385Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:05.6050725Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:05.6051045Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:05.6051403Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:45:05.6051719Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:45:05.6052059Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:45:05.6052500Z #define __DBL_DIG__ 15 2025-05-07T19:45:05.6052785Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:45:05.6053132Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:45:05.6053410Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:45:05.6053698Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:05.6053965Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:45:05.6054248Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:45:05.6054514Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:45:05.6054802Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:45:05.6055138Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:45:05.6055423Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:45:05.6055733Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:45:05.6056079Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:45:05.6056401Z #define __ELF__ 1 2025-05-07T19:45:05.6056626Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:45:05.6056938Z #define __FLOAT128__ 1 2025-05-07T19:45:05.6057195Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:45:05.6057516Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:45:05.6058017Z #define __FLT16_DIG__ 3 2025-05-07T19:45:05.6058328Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:45:05.6058693Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:45:05.6058992Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:45:05.6059322Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:45:05.6059630Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:45:05.6059936Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:45:05.6060201Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:45:05.6060484Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:45:05.6060786Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:45:05.6061113Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:45:05.6061385Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:45:05.6061731Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:45:05.6062177Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:45:05.6062479Z #define __FLT_DIG__ 6 2025-05-07T19:45:05.6062736Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:45:05.6063126Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:45:05.6063403Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:45:05.6063664Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:45:05.6063937Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:45:05.6064180Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:45:05.6064449Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:45:05.6064691Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:45:05.6064987Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:45:05.6065272Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:45:05.6065618Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:45:05.6065921Z #define __FLT_RADIX__ 2 2025-05-07T19:45:05.6066143Z #define __FXSR__ 1 2025-05-07T19:45:05.6066379Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:45:05.6066698Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:05.6067042Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:05.6067370Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:05.6067719Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:05.6068028Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:05.6068369Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:05.6068685Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:05.6069020Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:05.6069341Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:05.6069706Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:45:05.6070045Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:05.6070402Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:45:05.6070763Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:45:05.6071103Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:45:05.6071480Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:45:05.6071826Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:45:05.6072176Z #define __GNUC_MINOR__ 2 2025-05-07T19:45:05.6072451Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:45:05.6072846Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:45:05.6073132Z #define __GNUC__ 4 2025-05-07T19:45:05.6073593Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:45:05.6073888Z #define __INT16_C_SUFFIX__ 2025-05-07T19:45:05.6074277Z #define __INT16_FMTd__ "hd" 2025-05-07T19:45:05.6074598Z #define __INT16_FMTi__ "hi" 2025-05-07T19:45:05.6074882Z #define __INT16_MAX__ 32767 2025-05-07T19:45:05.6075198Z #define __INT16_TYPE__ short 2025-05-07T19:45:05.6075481Z #define __INT32_C_SUFFIX__ 2025-05-07T19:45:05.6075795Z #define __INT32_FMTd__ "d" 2025-05-07T19:45:05.6076065Z #define __INT32_FMTi__ "i" 2025-05-07T19:45:05.6076382Z #define __INT32_MAX__ 2147483647 2025-05-07T19:45:05.6076678Z #define __INT32_TYPE__ int 2025-05-07T19:45:05.6076988Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:45:05.6077270Z #define __INT64_FMTd__ "ld" 2025-05-07T19:45:05.6077578Z #define __INT64_FMTi__ "li" 2025-05-07T19:45:05.6077900Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:45:05.6078222Z #define __INT64_TYPE__ long int 2025-05-07T19:45:05.6078536Z #define __INT8_C_SUFFIX__ 2025-05-07T19:45:05.6078901Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:45:05.6079210Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:45:05.6079486Z #define __INT8_MAX__ 127 2025-05-07T19:45:05.6079801Z #define __INT8_TYPE__ signed char 2025-05-07T19:45:05.6080114Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:45:05.6080434Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:45:05.6080728Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:45:05.6081055Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:45:05.6081414Z #define __INTMAX_TYPE__ long int 2025-05-07T19:45:05.6081722Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:45:05.6082038Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:45:05.6082328Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:45:05.6082659Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:45:05.6082990Z #define __INTPTR_TYPE__ long int 2025-05-07T19:45:05.6083323Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:45:05.6083613Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:45:05.6083946Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:45:05.6084249Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:45:05.6084582Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:45:05.6084917Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:45:05.6085208Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:45:05.6085632Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:45:05.6085910Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:45:05.6086232Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:45:05.6086504Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:45:05.6086804Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:45:05.6087153Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:45:05.6087483Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:05.6087816Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:45:05.6088144Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:45:05.6088453Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:45:05.6088736Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:45:05.6089043Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:45:05.6089329Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:45:05.6089656Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:45:05.6089932Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:45:05.6090248Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:45:05.6090533Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:45:05.6090843Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:45:05.6091127Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:45:05.6091431Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:45:05.6091736Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:45:05.6092023Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:45:05.6092348Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:45:05.6092630Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:45:05.6092934Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:45:05.6093215Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:45:05.6093548Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:05.6093883Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:45:05.6094211Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:45:05.6094521Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:45:05.6094807Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:45:05.6095124Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:45:05.6095410Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:45:05.6095751Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:45:05.6096027Z #define __INT_MAX__ 2147483647 2025-05-07T19:45:05.6096314Z #define __INT_WIDTH__ 32 2025-05-07T19:45:05.6096575Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:45:05.6096926Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:45:05.6097282Z #define __LDBL_DIG__ 18 2025-05-07T19:45:05.6097598Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:45:05.6097968Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:45:05.6098251Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:45:05.6098566Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:05.6098851Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:45:05.6099242Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:45:05.6099532Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:45:05.6099872Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:45:05.6100214Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:45:05.6100552Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:45:05.6100870Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:45:05.6101240Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:45:05.6101549Z #define __LLONG_WIDTH__ 64 2025-05-07T19:45:05.6101850Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:45:05.6102915Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:45:05.6103245Z #define __LONG_WIDTH__ 64 2025-05-07T19:45:05.6103638Z #define __LP64__ 1 2025-05-07T19:45:05.6103889Z #define __MMX__ 1 2025-05-07T19:45:05.6104174Z #define __NO_INLINE__ 1 2025-05-07T19:45:05.6104456Z #define __NO_MATH_INLINES 1 2025-05-07T19:45:05.6104785Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:45:05.6105122Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:45:05.6105542Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:45:05.6105935Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:45:05.6106300Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:45:05.6106692Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:45:05.6107040Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:45:05.6107392Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:45:05.6107724Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:45:05.6108056Z #define __PIC__ 2 2025-05-07T19:45:05.6108434Z #define __PIE__ 2 2025-05-07T19:45:05.6108719Z #define __POINTER_WIDTH__ 64 2025-05-07T19:45:05.6109026Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:45:05.6109371Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:45:05.6109701Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:45:05.6110008Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:45:05.6110380Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:45:05.6110686Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:45:05.6111016Z #define __REGISTER_PREFIX__ 2025-05-07T19:45:05.6111299Z #define __SCHAR_MAX__ 127 2025-05-07T19:45:05.6111600Z #define __SEG_FS 1 2025-05-07T19:45:05.6111842Z #define __SEG_GS 1 2025-05-07T19:45:05.6112127Z #define __SHRT_MAX__ 32767 2025-05-07T19:45:05.6112417Z #define __SHRT_WIDTH__ 16 2025-05-07T19:45:05.6112740Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:45:05.6113171Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:45:05.6113469Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:45:05.6113790Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:45:05.6114087Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:45:05.6114402Z #define __SIZEOF_INT128__ 16 2025-05-07T19:45:05.6114691Z #define __SIZEOF_INT__ 4 2025-05-07T19:45:05.6115007Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:45:05.6115316Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:45:05.6115635Z #define __SIZEOF_LONG__ 8 2025-05-07T19:45:05.6115915Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:45:05.6116241Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:45:05.6116571Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:45:05.6116849Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:45:05.6117163Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:45:05.6117452Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:45:05.6117761Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:45:05.6118038Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:45:05.6118343Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:45:05.6118622Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:45:05.6118936Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.6119279Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:45:05.6119628Z #define __SIZE_WIDTH__ 64 2025-05-07T19:45:05.6119922Z #define __SSE2_MATH__ 1 2025-05-07T19:45:05.6120175Z #define __SSE2__ 1 2025-05-07T19:45:05.6120447Z #define __SSE_MATH__ 1 2025-05-07T19:45:05.6120703Z #define __SSE__ 1 2025-05-07T19:45:05.6120979Z #define __STDC_HOSTED__ 1 2025-05-07T19:45:05.6121251Z #define __STDC_UTF_16__ 1 2025-05-07T19:45:05.6121550Z #define __STDC_UTF_32__ 1 2025-05-07T19:45:05.6121980Z #define __STDC_VERSION__ 201710L 2025-05-07T19:45:05.6122307Z #define __STDC__ 1 2025-05-07T19:45:05.6122557Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:45:05.6122883Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:45:05.6123168Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:45:05.6123481Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:45:05.6123801Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:45:05.6124083Z #define __UINT16_MAX__ 65535 2025-05-07T19:45:05.6124515Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:45:05.6124836Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:45:05.6125144Z #define __UINT32_FMTX__ "X" 2025-05-07T19:45:05.6125415Z #define __UINT32_FMTo__ "o" 2025-05-07T19:45:05.6125711Z #define __UINT32_FMTu__ "u" 2025-05-07T19:45:05.6125982Z #define __UINT32_FMTx__ "x" 2025-05-07T19:45:05.6126283Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:45:05.6126591Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:45:05.6126926Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:45:05.6127249Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:45:05.6127535Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:45:05.6127855Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:45:05.6128138Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:45:05.6128466Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.6128810Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:45:05.6129160Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:45:05.6129437Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:45:05.6129750Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:45:05.6130104Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:45:05.6130418Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:45:05.6130728Z #define __UINT8_MAX__ 255 2025-05-07T19:45:05.6131017Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:45:05.6131389Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:45:05.6131692Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:45:05.6132015Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:45:05.6132300Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:45:05.6132630Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:45:05.6132947Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.6133334Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:45:05.6133658Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:45:05.6133981Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:45:05.6134314Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:45:05.6134611Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:45:05.6134924Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:45:05.6135242Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.6135641Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:45:05.6136150Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:45:05.6136491Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:45:05.6136806Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:45:05.6137157Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:45:05.6137473Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:45:05.6137823Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:45:05.6138184Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:45:05.6138531Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:45:05.6138877Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:45:05.6139186Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:45:05.6139525Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:45:05.6139870Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:45:05.6140243Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:45:05.6140577Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:45:05.6140920Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:45:05.6141265Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:45:05.6141570Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:45:05.6141927Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.6142307Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:45:05.6142681Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:45:05.6142985Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:45:05.6143331Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:45:05.6143710Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:45:05.6144043Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:45:05.6144356Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:45:05.6144729Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:45:05.6145073Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:45:05.6145384Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:45:05.6145727Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:45:05.6146040Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:45:05.6146411Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:45:05.6146757Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:45:05.6147088Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:45:05.6147400Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:45:05.6147730Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:45:05.6148039Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:45:05.6148411Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:45:05.6148784Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:45:05.6149093Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:45:05.6149433Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:45:05.6149744Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:45:05.6150105Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.6150493Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:45:05.6150870Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:45:05.6151179Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:45:05.6151583Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:45:05.6151917Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:45:05.6152221Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:45:05.6152559Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:45:05.6152975Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:45:05.6153690Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:05.6154361Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:45:05.6154691Z #define __WCHAR_TYPE__ int 2025-05-07T19:45:05.6154974Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:45:05.6155280Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:45:05.6155609Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:45:05.6155912Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:45:05.6156224Z #define __WINT_WIDTH__ 32 2025-05-07T19:45:05.6156488Z #define __amd64 1 2025-05-07T19:45:05.6156750Z #define __amd64__ 1 2025-05-07T19:45:05.6156994Z #define __clang__ 1 2025-05-07T19:45:05.6157312Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:45:05.6157643Z #define __clang_major__ 16 2025-05-07T19:45:05.6157944Z #define __clang_minor__ 0 2025-05-07T19:45:05.6158223Z #define __clang_patchlevel__ 6 2025-05-07T19:45:05.6158881Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:05.6159597Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:45:05.6159957Z #define __code_model_small__ 1 2025-05-07T19:45:05.6160269Z #define __gnu_linux__ 1 2025-05-07T19:45:05.6160523Z #define __k8 1 2025-05-07T19:45:05.6160789Z #define __k8__ 1 2025-05-07T19:45:05.6161027Z #define __linux 1 2025-05-07T19:45:05.6161295Z #define __linux__ 1 2025-05-07T19:45:05.6161536Z #define __llvm__ 1 2025-05-07T19:45:05.6161803Z #define __pic__ 2 2025-05-07T19:45:05.6162043Z #define __pie__ 2 2025-05-07T19:45:05.6162374Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:45:05.6162819Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:45:05.6163175Z #define __tune_k8__ 1 2025-05-07T19:45:05.6163428Z #define __unix 1 2025-05-07T19:45:05.6175754Z #define __unix__ 1 2025-05-07T19:45:05.6176026Z #define __x86_64 1 2025-05-07T19:45:05.6176383Z #define __x86_64__ 1 2025-05-07T19:45:05.6176836Z #define linux 1 2025-05-07T19:45:05.6177074Z #define unix 1 2025-05-07T19:45:05.6177221Z 2025-05-07T19:45:05.6784033Z 2025-05-07T19:45:05.6784729Z [INFO] Printing out all preprocessor defines in the C++ compiler ... 2025-05-07T19:45:05.6785746Z + conda run -n build_binary c++ -dM -E -x c++ - 2025-05-07T19:45:05.6786042Z 2025-05-07T19:45:07.5106261Z #define _GNU_SOURCE 1 2025-05-07T19:45:07.5106724Z #define _LP64 1 2025-05-07T19:45:07.5107199Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:45:07.5107499Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:45:07.5107812Z #define __ATOMIC_CONSUME 1 2025-05-07T19:45:07.5108107Z #define __ATOMIC_RELAXED 0 2025-05-07T19:45:07.5108427Z #define __ATOMIC_RELEASE 3 2025-05-07T19:45:07.5108765Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:45:07.5109066Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:45:07.5109410Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:45:07.5109721Z #define __BOOL_WIDTH__ 8 2025-05-07T19:45:07.5110072Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:45:07.5110440Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:45:07.5110807Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:45:07.5111122Z #define __CHAR_BIT__ 8 2025-05-07T19:45:07.5111432Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:07.5111768Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:07.5112149Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:07.5112496Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:07.5112992Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:07.5113359Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:07.5113747Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:07.5114437Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:07.5114790Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:07.5115164Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:07.5115511Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:45:07.5115862Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:45:07.5116195Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:45:07.5116572Z #define __DBL_DIG__ 15 2025-05-07T19:45:07.5116860Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:45:07.5117229Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:45:07.5117545Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:45:07.5117835Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:07.5118151Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:45:07.5118433Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:45:07.5118747Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:45:07.5119046Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:45:07.5119393Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:45:07.5119806Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:45:07.5120109Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:45:07.5120423Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:45:07.5120762Z #define __DEPRECATED 1 2025-05-07T19:45:07.5121029Z #define __ELF__ 1 2025-05-07T19:45:07.5121258Z #define __EXCEPTIONS 1 2025-05-07T19:45:07.5121538Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:45:07.5121927Z #define __FLOAT128__ 1 2025-05-07T19:45:07.5122209Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:45:07.5122521Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:45:07.5122875Z #define __FLT16_DIG__ 3 2025-05-07T19:45:07.5123128Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:45:07.5123462Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:45:07.5123768Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:45:07.5124051Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:45:07.5124353Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:45:07.5124617Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:45:07.5124912Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:45:07.5125178Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:45:07.5125487Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:45:07.5125767Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:45:07.5126068Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:45:07.5126364Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:45:07.5126678Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:45:07.5127011Z #define __FLT_DIG__ 6 2025-05-07T19:45:07.5127407Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:45:07.5127735Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:45:07.5128004Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:45:07.5128312Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:45:07.5128586Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:45:07.5128870Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:45:07.5129141Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:45:07.5129435Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:45:07.5129721Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:45:07.5130031Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:45:07.5130334Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:45:07.5130613Z #define __FLT_RADIX__ 2 2025-05-07T19:45:07.5130880Z #define __FXSR__ 1 2025-05-07T19:45:07.5131123Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:45:07.5131453Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:07.5131765Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:07.5132111Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:07.5132431Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:07.5132765Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:07.5133253Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:07.5133608Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:07.5134057Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:07.5134512Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:07.5134866Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:45:07.5137509Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:07.5137971Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:45:07.5138290Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:45:07.5138660Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:45:07.5139000Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:45:07.5139362Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:45:07.5139906Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:45:07.5140241Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:45:07.5140592Z #define __GNUC_GNU_INLINE__ 1 2025-05-07T19:45:07.5140876Z #define __GNUC_MINOR__ 2 2025-05-07T19:45:07.5141345Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:45:07.5141624Z #define __GNUC__ 4 2025-05-07T19:45:07.5141884Z #define __GNUG__ 4 2025-05-07T19:45:07.5142138Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:45:07.5142468Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:45:07.5142773Z #define __GXX_RTTI 1 2025-05-07T19:45:07.5143055Z #define __GXX_WEAK__ 1 2025-05-07T19:45:07.5143348Z #define __INT16_C_SUFFIX__ 2025-05-07T19:45:07.5143744Z #define __INT16_FMTd__ "hd" 2025-05-07T19:45:07.5144039Z #define __INT16_FMTi__ "hi" 2025-05-07T19:45:07.5144306Z #define __INT16_MAX__ 32767 2025-05-07T19:45:07.5144605Z #define __INT16_TYPE__ short 2025-05-07T19:45:07.5144882Z #define __INT32_C_SUFFIX__ 2025-05-07T19:45:07.5145178Z #define __INT32_FMTd__ "d" 2025-05-07T19:45:07.5145445Z #define __INT32_FMTi__ "i" 2025-05-07T19:45:07.5145744Z #define __INT32_MAX__ 2147483647 2025-05-07T19:45:07.5146031Z #define __INT32_TYPE__ int 2025-05-07T19:45:07.5146334Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:45:07.5146635Z #define __INT64_FMTd__ "ld" 2025-05-07T19:45:07.5147023Z #define __INT64_FMTi__ "li" 2025-05-07T19:45:07.5147323Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:45:07.5147625Z #define __INT64_TYPE__ long int 2025-05-07T19:45:07.5147924Z #define __INT8_C_SUFFIX__ 2025-05-07T19:45:07.5148180Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:45:07.5148467Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:45:07.5148729Z #define __INT8_MAX__ 127 2025-05-07T19:45:07.5149014Z #define __INT8_TYPE__ signed char 2025-05-07T19:45:07.5149304Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:45:07.5149611Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:45:07.5149884Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:45:07.5150191Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:45:07.5150527Z #define __INTMAX_TYPE__ long int 2025-05-07T19:45:07.5150920Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:45:07.5151226Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:45:07.5151499Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:45:07.5151811Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:45:07.5152120Z #define __INTPTR_TYPE__ long int 2025-05-07T19:45:07.5152430Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:45:07.5152698Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:45:07.5153128Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:45:07.5153610Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:45:07.5153946Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:45:07.5154283Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:45:07.5154579Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:45:07.5154902Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:45:07.5155196Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:45:07.5155542Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:45:07.5155829Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:45:07.5156146Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:45:07.5156444Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:45:07.5156789Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:07.5157162Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:45:07.5157472Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:45:07.5157787Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:45:07.5158077Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:45:07.5158401Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:45:07.5158701Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:45:07.5159128Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:45:07.5159420Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:45:07.5159753Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:45:07.5160053Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:45:07.5160388Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:45:07.5160731Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:45:07.5161033Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:45:07.5161366Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:45:07.5161677Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:45:07.5162024Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:45:07.5162304Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:45:07.5162635Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:45:07.5162953Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:45:07.5163308Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:07.5163657Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:45:07.5164003Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:45:07.5164335Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:45:07.5164639Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:45:07.5164967Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:45:07.5165271Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:45:07.5165745Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:45:07.5166017Z #define __INT_MAX__ 2147483647 2025-05-07T19:45:07.5166310Z #define __INT_WIDTH__ 32 2025-05-07T19:45:07.5166596Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:45:07.5166928Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:45:07.5167281Z #define __LDBL_DIG__ 18 2025-05-07T19:45:07.5167585Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:45:07.5167914Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:45:07.5168214Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:45:07.5168493Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:07.5168796Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:45:07.5169069Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:45:07.5169372Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:45:07.5169677Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:45:07.5170038Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:45:07.5170354Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:45:07.5170665Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:45:07.5171023Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:45:07.5171297Z #define __LLONG_WIDTH__ 64 2025-05-07T19:45:07.5171609Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:45:07.5172014Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:45:07.5172329Z #define __LONG_WIDTH__ 64 2025-05-07T19:45:07.5172582Z #define __LP64__ 1 2025-05-07T19:45:07.5172837Z #define __MMX__ 1 2025-05-07T19:45:07.5173066Z #define __NO_INLINE__ 1 2025-05-07T19:45:07.5173345Z #define __NO_MATH_INLINES 1 2025-05-07T19:45:07.5173638Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:45:07.5173942Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:45:07.5174313Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:45:07.5174641Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:45:07.5174984Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:45:07.5175306Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:45:07.5175635Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:45:07.5175918Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:45:07.5176228Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:45:07.5176493Z #define __PIC__ 2 2025-05-07T19:45:07.5176728Z #define __PIE__ 2 2025-05-07T19:45:07.5176975Z #define __POINTER_WIDTH__ 64 2025-05-07T19:45:07.5177253Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:45:07.5177570Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:45:07.5177844Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:45:07.5178149Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:45:07.5178462Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:45:07.5178766Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:45:07.5179041Z #define __REGISTER_PREFIX__ 2025-05-07T19:45:07.5179392Z #define __SCHAR_MAX__ 127 2025-05-07T19:45:07.5179642Z #define __SEG_FS 1 2025-05-07T19:45:07.5179887Z #define __SEG_GS 1 2025-05-07T19:45:07.5180137Z #define __SHRT_MAX__ 32767 2025-05-07T19:45:07.5180384Z #define __SHRT_WIDTH__ 16 2025-05-07T19:45:07.5180668Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:45:07.5180962Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:45:07.5181263Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:45:07.5181531Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:45:07.5181823Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:45:07.5182080Z #define __SIZEOF_INT128__ 16 2025-05-07T19:45:07.5182368Z #define __SIZEOF_INT__ 4 2025-05-07T19:45:07.5182627Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:45:07.5182930Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:45:07.5183220Z #define __SIZEOF_LONG__ 8 2025-05-07T19:45:07.5183472Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:45:07.5183765Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:45:07.5184031Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:45:07.5184311Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:45:07.5184566Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:45:07.5184857Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:45:07.5185115Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:45:07.5185390Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:45:07.5185644Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:45:07.5185914Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:45:07.5186176Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:45:07.5186511Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:45:07.5186824Z #define __SIZE_WIDTH__ 64 2025-05-07T19:45:07.5187070Z #define __SSE2_MATH__ 1 2025-05-07T19:45:07.5187332Z #define __SSE2__ 1 2025-05-07T19:45:07.5187553Z #define __SSE_MATH__ 1 2025-05-07T19:45:07.5187818Z #define __SSE__ 1 2025-05-07T19:45:07.5188072Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16UL 2025-05-07T19:45:07.5188412Z #define __STDCPP_THREADS__ 1 2025-05-07T19:45:07.5188674Z #define __STDC_HOSTED__ 1 2025-05-07T19:45:07.5188935Z #define __STDC_UTF_16__ 1 2025-05-07T19:45:07.5189186Z #define __STDC_UTF_32__ 1 2025-05-07T19:45:07.5189450Z #define __STDC__ 1 2025-05-07T19:45:07.5189694Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:45:07.5189950Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:45:07.5190229Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:45:07.5190486Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:45:07.5190762Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:45:07.5191025Z #define __UINT16_MAX__ 65535 2025-05-07T19:45:07.5191397Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:45:07.5191692Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:45:07.5191980Z #define __UINT32_FMTX__ "X" 2025-05-07T19:45:07.5192240Z #define __UINT32_FMTo__ "o" 2025-05-07T19:45:07.5192511Z #define __UINT32_FMTu__ "u" 2025-05-07T19:45:07.5192895Z #define __UINT32_FMTx__ "x" 2025-05-07T19:45:07.5193347Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:45:07.5193679Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:45:07.5193990Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:45:07.5194299Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:45:07.5194584Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:45:07.5194879Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:45:07.5195160Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:45:07.5195478Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:45:07.5195819Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:45:07.5196163Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:45:07.5196460Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:45:07.5196743Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:45:07.5197052Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:45:07.5197329Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:45:07.5197630Z #define __UINT8_MAX__ 255 2025-05-07T19:45:07.5197915Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:45:07.5198257Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:45:07.5198552Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:45:07.5198866Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:45:07.5199155Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:45:07.5199469Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:45:07.5199908Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:45:07.5200267Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:45:07.5200625Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:45:07.5200918Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:45:07.5201235Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:45:07.5201527Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:45:07.5201837Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:45:07.5202386Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:45:07.5202770Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:45:07.5203107Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:45:07.5203421Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:45:07.5203744Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:45:07.5204051Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:45:07.5204375Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:45:07.5204679Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:45:07.5205029Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:45:07.5205357Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:45:07.5205668Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:45:07.5205962Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:45:07.5206260Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:45:07.5206556Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:45:07.5206909Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:45:07.5207252Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:45:07.5207547Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:45:07.5207860Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:45:07.5208156Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:45:07.5208506Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:07.5208881Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:45:07.5209243Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:45:07.5209536Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:45:07.5209855Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:45:07.5210172Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:45:07.5210465Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:45:07.5210788Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:45:07.5211121Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:45:07.5211441Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:45:07.5211744Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:45:07.5212063Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:45:07.5212529Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:45:07.5212877Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:45:07.5213225Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:45:07.5213556Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:45:07.5213882Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:45:07.5214424Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:45:07.5214736Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:45:07.5215049Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:45:07.5215373Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:45:07.5215662Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:45:07.5215963Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:45:07.5216247Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:45:07.5216564Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:07.5216934Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:45:07.5217254Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:45:07.5217559Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:45:07.5217843Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:45:07.5218148Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:45:07.5218426Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:45:07.5218725Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:45:07.5219035Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:45:07.5219654Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:07.5220283Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:45:07.5220631Z #define __WCHAR_TYPE__ int 2025-05-07T19:45:07.5220907Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:45:07.5221159Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:45:07.5221456Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:45:07.5221733Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:45:07.5222014Z #define __WINT_WIDTH__ 32 2025-05-07T19:45:07.5222251Z #define __amd64 1 2025-05-07T19:45:07.5222495Z #define __amd64__ 1 2025-05-07T19:45:07.5222718Z #define __clang__ 1 2025-05-07T19:45:07.5222990Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:45:07.5223291Z #define __clang_major__ 16 2025-05-07T19:45:07.5223563Z #define __clang_minor__ 0 2025-05-07T19:45:07.5223836Z #define __clang_patchlevel__ 6 2025-05-07T19:45:07.5224416Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:07.5225076Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:45:07.5225397Z #define __code_model_small__ 1 2025-05-07T19:45:07.5225685Z #define __cplusplus 201703L 2025-05-07T19:45:07.5225963Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:45:07.5226304Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:45:07.5226609Z #define __cpp_alias_templates 200704L 2025-05-07T19:45:07.5226922Z #define __cpp_aligned_new 201606L 2025-05-07T19:45:07.5227229Z #define __cpp_attributes 200809L 2025-05-07T19:45:07.5227510Z #define __cpp_binary_literals 201304L 2025-05-07T19:45:07.5227838Z #define __cpp_capture_star_this 201603L 2025-05-07T19:45:07.5228137Z #define __cpp_constexpr 201603L 2025-05-07T19:45:07.5228463Z #define __cpp_constexpr_in_decltype 201711L 2025-05-07T19:45:07.5228773Z #define __cpp_decltype 200707L 2025-05-07T19:45:07.5229097Z #define __cpp_decltype_auto 201304L 2025-05-07T19:45:07.5229408Z #define __cpp_deduction_guides 201703L 2025-05-07T19:45:07.5229766Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:45:07.5230127Z #define __cpp_digit_separators 201309L 2025-05-07T19:45:07.5230447Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:45:07.5230796Z #define __cpp_exceptions 199711L 2025-05-07T19:45:07.5231096Z #define __cpp_fold_expressions 201603L 2025-05-07T19:45:07.5231448Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:45:07.5231784Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:45:07.5232142Z #define __cpp_hex_float 201603L 2025-05-07T19:45:07.5232436Z #define __cpp_if_constexpr 201606L 2025-05-07T19:45:07.5232966Z #define __cpp_impl_destroying_delete 201806L 2025-05-07T19:45:07.5233511Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:45:07.5233904Z #define __cpp_init_captures 201304L 2025-05-07T19:45:07.5234475Z #define __cpp_initializer_lists 200806L 2025-05-07T19:45:07.5234815Z #define __cpp_inline_variables 201606L 2025-05-07T19:45:07.5235170Z #define __cpp_lambdas 200907L 2025-05-07T19:45:07.5235497Z #define __cpp_named_character_escapes 202207L 2025-05-07T19:45:07.5235906Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:45:07.5236302Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:45:07.5236731Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:45:07.5237097Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:45:07.5237517Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:45:07.5237928Z #define __cpp_nsdmi 200809L 2025-05-07T19:45:07.5238230Z #define __cpp_range_based_for 201603L 2025-05-07T19:45:07.5238577Z #define __cpp_raw_strings 200710L 2025-05-07T19:45:07.5238881Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:45:07.5239235Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:45:07.5239567Z #define __cpp_rtti 199711L 2025-05-07T19:45:07.5239881Z #define __cpp_rvalue_references 200610L 2025-05-07T19:45:07.5240199Z #define __cpp_static_assert 201411L 2025-05-07T19:45:07.5240552Z #define __cpp_static_call_operator 202207L 2025-05-07T19:45:07.5240918Z #define __cpp_structured_bindings 201606L 2025-05-07T19:45:07.5241241Z #define __cpp_template_auto 201606L 2025-05-07T19:45:07.5241660Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:45:07.5241998Z #define __cpp_unicode_characters 200704L 2025-05-07T19:45:07.5242345Z #define __cpp_unicode_literals 200710L 2025-05-07T19:45:07.5242677Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:45:07.5243039Z #define __cpp_variable_templates 201304L 2025-05-07T19:45:07.5243378Z #define __cpp_variadic_templates 200704L 2025-05-07T19:45:07.5243721Z #define __cpp_variadic_using 201611L 2025-05-07T19:45:07.5244017Z #define __gnu_linux__ 1 2025-05-07T19:45:07.5244298Z #define __k8 1 2025-05-07T19:45:07.5244538Z #define __k8__ 1 2025-05-07T19:45:07.5244759Z #define __linux 1 2025-05-07T19:45:07.5245001Z #define __linux__ 1 2025-05-07T19:45:07.5245233Z #define __llvm__ 1 2025-05-07T19:45:07.5245480Z #define __pic__ 2 2025-05-07T19:45:07.5245809Z #define __pie__ 2 2025-05-07T19:45:07.5246042Z #define __private_extern__ extern 2025-05-07T19:45:07.5246361Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:45:07.5246759Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:45:07.5247075Z #define __tune_k8__ 1 2025-05-07T19:45:07.5247331Z #define __unix 1 2025-05-07T19:45:07.5247569Z #define __unix__ 1 2025-05-07T19:45:07.5247786Z #define __x86_64 1 2025-05-07T19:45:07.5248029Z #define __x86_64__ 1 2025-05-07T19:45:07.5248244Z #define linux 1 2025-05-07T19:45:07.5248470Z #define unix 1 2025-05-07T19:45:07.5248592Z 2025-05-07T19:45:07.5687643Z 2025-05-07T19:45:07.5688081Z + conda run -n build_binary c++ --version 2025-05-07T19:45:07.5688336Z 2025-05-07T19:45:09.3756462Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:45:09.3757137Z Target: x86_64-conda-linux-gnu 2025-05-07T19:45:09.3757418Z Thread model: posix 2025-05-07T19:45:09.3757780Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:45:09.3758442Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:45:09.3758955Z 2025-05-07T19:45:09.4483275Z 2025-05-07T19:45:09.4484024Z [INFO] Printing the default version of the C standard used by the compiler ... 2025-05-07T19:45:09.4484734Z + conda run -n build_binary cc -dM -E - < /dev/null | grep __STDC_VERSION__ 2025-05-07T19:45:09.4485069Z 2025-05-07T19:45:11.3309739Z #define __STDC_VERSION__ 201710L 2025-05-07T19:45:11.3314176Z 2025-05-07T19:45:11.3315007Z [INFO] Printing the default version of the C++ standard used by the compiler ... 2025-05-07T19:45:11.3316263Z + conda run -n build_binary c++ -dM -E -x c++ - < /dev/null | grep __cplusplus 2025-05-07T19:45:11.3316615Z 2025-05-07T19:45:13.1934255Z #define __cplusplus 201703L 2025-05-07T19:45:13.1936461Z 2025-05-07T19:45:13.1936664Z [INSTALL] Successfully installed C/C++ compilers 2025-05-07T19:45:13.2019583Z ##[group]Run . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:45:13.2020094Z . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:45:13.2020685Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:13.2021035Z env: 2025-05-07T19:45:13.2021288Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:13.2021602Z BUILD_ENV: build_binary 2025-05-07T19:45:13.2021874Z BUILD_TARGET: default 2025-05-07T19:45:13.2022114Z BUILD_VARIANT: cuda 2025-05-07T19:45:13.2022373Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:45:13.2022630Z ##[endgroup] 2025-05-07T19:45:13.5993776Z ################################################################################ 2025-05-07T19:45:13.5994876Z # Install Build Tools 2025-05-07T19:45:13.5995673Z # 2025-05-07T19:45:13.6005866Z # [2025-05-07T19:45:13.600Z] + install_build_tools build_binary 2025-05-07T19:45:13.6006300Z ################################################################################ 2025-05-07T19:45:13.6006633Z 2025-05-07T19:45:13.6024245Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:13.6860028Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:13.6862592Z [INSTALL] Installing build tools ... 2025-05-07T19:45:13.6887716Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y auditwheel bazel cmake>=3.30 hypothesis jinja2 make ncurses ninja openblas patchelf rhash scikit-build wheel pyyaml 2025-05-07T19:45:14.4115581Z Channels: 2025-05-07T19:45:14.4116278Z - conda-forge 2025-05-07T19:45:14.4116926Z Platform: linux-64 2025-05-07T19:45:17.5407594Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:45:21.1897287Z Solving environment: \ | / - done 2025-05-07T19:45:21.2481889Z 2025-05-07T19:45:21.2482900Z ## Package Plan ## 2025-05-07T19:45:21.2483229Z 2025-05-07T19:45:21.2483583Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:21.2484112Z 2025-05-07T19:45:21.2484284Z added / updated specs: 2025-05-07T19:45:21.2484616Z - auditwheel 2025-05-07T19:45:21.2485009Z - bazel 2025-05-07T19:45:21.2485431Z - cmake[version='>=3.30'] 2025-05-07T19:45:21.2485800Z - hypothesis 2025-05-07T19:45:21.2486075Z - jinja2 2025-05-07T19:45:21.2486314Z - make 2025-05-07T19:45:21.2486569Z - ncurses 2025-05-07T19:45:21.2486767Z - ninja 2025-05-07T19:45:21.2486972Z - openblas 2025-05-07T19:45:21.2487177Z - patchelf 2025-05-07T19:45:21.2487399Z - pyyaml 2025-05-07T19:45:21.2487634Z - rhash 2025-05-07T19:45:21.2487897Z - scikit-build 2025-05-07T19:45:21.2488175Z - wheel 2025-05-07T19:45:21.2488330Z 2025-05-07T19:45:21.2488335Z 2025-05-07T19:45:21.2488479Z The following packages will be downloaded: 2025-05-07T19:45:21.2488724Z 2025-05-07T19:45:21.2488894Z package | build 2025-05-07T19:45:21.2489257Z ---------------------------|----------------- 2025-05-07T19:45:21.2489682Z alsa-lib-1.2.14 | hb9d3cd8_0 553 KB conda-forge 2025-05-07T19:45:21.2490128Z attrs-25.3.0 | pyh71513ae_0 56 KB conda-forge 2025-05-07T19:45:21.2490586Z auditwheel-6.2.0 | pyha804496_1 40 KB conda-forge 2025-05-07T19:45:21.2491066Z bazel-7.5.0 | h96810dc_2 47.4 MB conda-forge 2025-05-07T19:45:21.2491485Z c-ares-1.34.5 | hb9d3cd8_0 202 KB conda-forge 2025-05-07T19:45:21.2491915Z cairo-1.18.0 | hbb29018_2 961 KB conda-forge 2025-05-07T19:45:21.2492325Z click-8.1.8 | pyh707e725_0 83 KB conda-forge 2025-05-07T19:45:21.2493048Z cmake-4.0.2 | h74e3db0_0 19.4 MB conda-forge 2025-05-07T19:45:21.2493516Z distro-1.9.0 | pyhd8ed1ab_1 41 KB conda-forge 2025-05-07T19:45:21.2494170Z exceptiongroup-1.2.2 | pyhd8ed1ab_1 20 KB conda-forge 2025-05-07T19:45:21.2494667Z expat-2.7.0 | h5888daf_0 137 KB conda-forge 2025-05-07T19:45:21.2495179Z font-ttf-dejavu-sans-mono-2.37| hab24e00_0 388 KB conda-forge 2025-05-07T19:45:21.2495793Z font-ttf-inconsolata-3.000 | h77eed37_0 94 KB conda-forge 2025-05-07T19:45:21.2496364Z font-ttf-source-code-pro-2.038| h77eed37_0 684 KB conda-forge 2025-05-07T19:45:21.2497022Z font-ttf-ubuntu-0.83 | h77eed37_3 1.5 MB conda-forge 2025-05-07T19:45:21.2497522Z fontconfig-2.15.0 | h7e30c49_1 259 KB conda-forge 2025-05-07T19:45:21.2498279Z fonts-conda-ecosystem-1 | 0 4 KB conda-forge 2025-05-07T19:45:21.2498739Z fonts-conda-forge-1 | 0 4 KB conda-forge 2025-05-07T19:45:21.2499181Z freetype-2.13.3 | ha770c72_1 168 KB conda-forge 2025-05-07T19:45:21.2499596Z giflib-5.2.2 | hd590300_0 75 KB conda-forge 2025-05-07T19:45:21.2500012Z graphite2-1.3.13 | h59595ed_1003 95 KB conda-forge 2025-05-07T19:45:21.2500433Z harfbuzz-9.0.0 | hfac3d4d_0 1.5 MB conda-forge 2025-05-07T19:45:21.2500858Z hypothesis-6.131.14 | pyha770c72_0 348 KB conda-forge 2025-05-07T19:45:21.2501302Z ijar-7.5.0 | h5888daf_0 114 KB conda-forge 2025-05-07T19:45:21.2501685Z jinja2-3.1.6 | pyhd8ed1ab_0 110 KB conda-forge 2025-05-07T19:45:21.2502463Z keyutils-1.6.1 | h166bdaf_0 115 KB conda-forge 2025-05-07T19:45:21.2503058Z krb5-1.21.3 | h659f571_0 1.3 MB conda-forge 2025-05-07T19:45:21.2503483Z lcms2-2.17 | h717163a_0 242 KB conda-forge 2025-05-07T19:45:21.2503945Z lerc-4.0.0 | h0aef613_1 258 KB conda-forge 2025-05-07T19:45:21.2504431Z libabseil-20250127.1 | cxx17_hbbce691_0 1.3 MB conda-forge 2025-05-07T19:45:21.2504917Z libcups-2.3.3 | h4637d8d_4 4.3 MB conda-forge 2025-05-07T19:45:21.2505354Z libcurl-8.13.0 | h332b0f4_0 428 KB conda-forge 2025-05-07T19:45:21.2505834Z libdeflate-1.23 | h86f0d12_0 71 KB conda-forge 2025-05-07T19:45:21.2506345Z libedit-3.1.20250104 | pl5321h7949ede_0 132 KB conda-forge 2025-05-07T19:45:21.2506802Z libev-4.33 | hd590300_2 110 KB conda-forge 2025-05-07T19:45:21.2507237Z libexpat-2.7.0 | h5888daf_0 73 KB conda-forge 2025-05-07T19:45:21.2507682Z libfreetype-2.13.3 | ha770c72_1 8 KB conda-forge 2025-05-07T19:45:21.2508170Z libfreetype6-2.13.3 | h48d6fc4_1 371 KB conda-forge 2025-05-07T19:45:21.2508657Z libgfortran-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:45:21.2509165Z libgfortran5-15.1.0 | hcea5267_2 1.5 MB conda-forge 2025-05-07T19:45:21.2509642Z libglib-2.84.0 | h2ff4ddf_0 3.8 MB conda-forge 2025-05-07T19:45:21.2510078Z libgrpc-1.71.0 | h8e591d7_1 7.6 MB conda-forge 2025-05-07T19:45:21.2510563Z libjpeg-turbo-3.1.0 | hb9d3cd8_0 614 KB conda-forge 2025-05-07T19:45:21.2511038Z liblzma-5.8.1 | hb9d3cd8_1 110 KB conda-forge 2025-05-07T19:45:21.2511534Z liblzma-devel-5.8.1 | hb9d3cd8_1 431 KB conda-forge 2025-05-07T19:45:21.2512145Z libnghttp2-1.64.0 | h161d5f1_0 632 KB conda-forge 2025-05-07T19:45:21.2512645Z libopenblas-0.3.29 |pthreads_h94d23a6_0 5.6 MB conda-forge 2025-05-07T19:45:21.2513202Z libpng-1.6.47 | h943b412_0 282 KB conda-forge 2025-05-07T19:45:21.2513767Z libprotobuf-5.29.3 | h501fc15_1 3.2 MB conda-forge 2025-05-07T19:45:21.2514244Z libre2-11-2024.07.02 | hba17884_3 205 KB conda-forge 2025-05-07T19:45:21.2514702Z libsqlite-3.49.2 | hee588c1_0 895 KB conda-forge 2025-05-07T19:45:21.2515192Z libssh2-1.11.1 | hcf80075_0 298 KB conda-forge 2025-05-07T19:45:21.2515672Z libtiff-4.7.0 | hd9ff511_4 419 KB conda-forge 2025-05-07T19:45:21.2516120Z libuuid-2.38.1 | h0b41bf4_0 33 KB conda-forge 2025-05-07T19:45:21.2516559Z libuv-1.50.0 | hb9d3cd8_0 870 KB conda-forge 2025-05-07T19:45:21.2517008Z libwebp-base-1.5.0 | h851e524_0 420 KB conda-forge 2025-05-07T19:45:21.2517474Z libxcb-1.17.0 | h8a09558_0 387 KB conda-forge 2025-05-07T19:45:21.2517895Z libzlib-1.3.1 | hb9d3cd8_2 60 KB conda-forge 2025-05-07T19:45:21.2518352Z make-4.4.1 | hb9d3cd8_2 501 KB conda-forge 2025-05-07T19:45:21.2518839Z markupsafe-3.0.2 | py313h8060acc_1 24 KB conda-forge 2025-05-07T19:45:21.2519307Z ncurses-6.5 | h2d0b736_3 871 KB conda-forge 2025-05-07T19:45:21.2519750Z ninja-1.12.1 | hff21bea_1 158 KB conda-forge 2025-05-07T19:45:21.2520214Z openblas-0.3.29 |pthreads_h6ec200e_0 5.8 MB conda-forge 2025-05-07T19:45:21.2520687Z openjdk-23.0.1 | h4c11d01_0 181.3 MB conda-forge 2025-05-07T19:45:21.2521172Z packaging-25.0 | pyh29332c3_1 61 KB conda-forge 2025-05-07T19:45:21.2521631Z patchelf-0.18.0 | h3f2d84a_2 133 KB conda-forge 2025-05-07T19:45:21.2522107Z pcre2-10.44 | hc749103_2 934 KB conda-forge 2025-05-07T19:45:21.2522553Z pixman-0.46.0 | h29eaf8c_0 389 KB conda-forge 2025-05-07T19:45:21.2523063Z pthread-stubs-0.4 | hb9d3cd8_1002 8 KB conda-forge 2025-05-07T19:45:21.2523551Z pyelftools-0.32 | pyh707e725_1 146 KB conda-forge 2025-05-07T19:45:21.2524065Z python-3.13.2 |hf636f53_101_cp313 31.7 MB conda-forge 2025-05-07T19:45:21.2524572Z pyyaml-6.0.2 | py313h8060acc_2 201 KB conda-forge 2025-05-07T19:45:21.2525140Z re2-2024.07.02 | h9925aae_3 26 KB conda-forge 2025-05-07T19:45:21.2525586Z rhash-1.4.5 | hb9d3cd8_0 183 KB conda-forge 2025-05-07T19:45:21.2526031Z scikit-build-0.18.1 | pyhae55e72_2 114 KB conda-forge 2025-05-07T19:45:21.2526519Z singlejar-7.5.0 | h0e684df_1 122 KB conda-forge 2025-05-07T19:45:21.2526999Z sortedcontainers-2.4.0 | pyhd8ed1ab_1 28 KB conda-forge 2025-05-07T19:45:21.2527495Z sqlite-3.49.2 | h9eae976_0 840 KB conda-forge 2025-05-07T19:45:21.2527950Z tk-8.6.13 |noxft_h4845f30_101 3.2 MB conda-forge 2025-05-07T19:45:21.2528364Z tomli-2.2.1 | pyhd8ed1ab_1 19 KB conda-forge 2025-05-07T19:45:21.2528815Z wheel-0.45.1 | pyhd8ed1ab_1 61 KB conda-forge 2025-05-07T19:45:21.2529255Z xorg-libice-1.1.2 | hb9d3cd8_0 57 KB conda-forge 2025-05-07T19:45:21.2529731Z xorg-libsm-1.2.6 | he73a12e_0 27 KB conda-forge 2025-05-07T19:45:21.2530289Z xorg-libx11-1.8.12 | h4f16b4b_0 816 KB conda-forge 2025-05-07T19:45:21.2530743Z xorg-libxau-1.0.12 | hb9d3cd8_0 14 KB conda-forge 2025-05-07T19:45:21.2531247Z xorg-libxdmcp-1.1.5 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:21.2531815Z xorg-libxext-1.3.6 | hb9d3cd8_0 49 KB conda-forge 2025-05-07T19:45:21.2532318Z xorg-libxfixes-6.0.1 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:21.2532785Z xorg-libxi-1.8.2 | hb9d3cd8_0 46 KB conda-forge 2025-05-07T19:45:21.2533274Z xorg-libxrandr-1.5.4 | hb9d3cd8_0 29 KB conda-forge 2025-05-07T19:45:21.2533795Z xorg-libxrender-0.9.12 | hb9d3cd8_0 32 KB conda-forge 2025-05-07T19:45:21.2534262Z xorg-libxt-1.3.1 | hb9d3cd8_0 371 KB conda-forge 2025-05-07T19:45:21.2534755Z xorg-libxtst-1.2.5 | hb9d3cd8_3 32 KB conda-forge 2025-05-07T19:45:21.2535177Z xz-5.8.1 | hbcc6ac9_1 23 KB conda-forge 2025-05-07T19:45:21.2535638Z xz-gpl-tools-5.8.1 | hbcc6ac9_1 33 KB conda-forge 2025-05-07T19:45:21.2536087Z xz-tools-5.8.1 | hb9d3cd8_1 94 KB conda-forge 2025-05-07T19:45:21.2536538Z yaml-0.2.5 | h7f98852_2 87 KB conda-forge 2025-05-07T19:45:21.2536972Z zlib-1.3.1 | hb9d3cd8_2 90 KB conda-forge 2025-05-07T19:45:21.2537362Z zstd-1.5.7 | hb8e6e7a_2 554 KB conda-forge 2025-05-07T19:45:21.2537784Z ------------------------------------------------------------ 2025-05-07T19:45:21.2538136Z Total: 339.1 MB 2025-05-07T19:45:21.2538384Z 2025-05-07T19:45:21.2538517Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:21.2538745Z 2025-05-07T19:45:21.2539008Z alsa-lib conda-forge/linux-64::alsa-lib-1.2.14-hb9d3cd8_0 2025-05-07T19:45:21.2539446Z attrs conda-forge/noarch::attrs-25.3.0-pyh71513ae_0 2025-05-07T19:45:21.2539936Z auditwheel conda-forge/noarch::auditwheel-6.2.0-pyha804496_1 2025-05-07T19:45:21.2540395Z bazel conda-forge/linux-64::bazel-7.5.0-h96810dc_2 2025-05-07T19:45:21.2540842Z c-ares conda-forge/linux-64::c-ares-1.34.5-hb9d3cd8_0 2025-05-07T19:45:21.2541300Z cairo conda-forge/linux-64::cairo-1.18.0-hbb29018_2 2025-05-07T19:45:21.2541724Z click conda-forge/noarch::click-8.1.8-pyh707e725_0 2025-05-07T19:45:21.2542172Z cmake conda-forge/linux-64::cmake-4.0.2-h74e3db0_0 2025-05-07T19:45:21.2542602Z distro conda-forge/noarch::distro-1.9.0-pyhd8ed1ab_1 2025-05-07T19:45:21.2543141Z exceptiongroup conda-forge/noarch::exceptiongroup-1.2.2-pyhd8ed1ab_1 2025-05-07T19:45:21.2543779Z font-ttf-dejavu-s~ conda-forge/noarch::font-ttf-dejavu-sans-mono-2.37-hab24e00_0 2025-05-07T19:45:21.2544420Z font-ttf-inconsol~ conda-forge/noarch::font-ttf-inconsolata-3.000-h77eed37_0 2025-05-07T19:45:21.2545062Z font-ttf-source-c~ conda-forge/noarch::font-ttf-source-code-pro-2.038-h77eed37_0 2025-05-07T19:45:21.2545651Z font-ttf-ubuntu conda-forge/noarch::font-ttf-ubuntu-0.83-h77eed37_3 2025-05-07T19:45:21.2546204Z fontconfig conda-forge/linux-64::fontconfig-2.15.0-h7e30c49_1 2025-05-07T19:45:21.2546746Z fonts-conda-ecosy~ conda-forge/noarch::fonts-conda-ecosystem-1-0 2025-05-07T19:45:21.2547250Z fonts-conda-forge conda-forge/noarch::fonts-conda-forge-1-0 2025-05-07T19:45:21.2547761Z freetype conda-forge/linux-64::freetype-2.13.3-ha770c72_1 2025-05-07T19:45:21.2548212Z giflib conda-forge/linux-64::giflib-5.2.2-hd590300_0 2025-05-07T19:45:21.2548695Z graphite2 conda-forge/linux-64::graphite2-1.3.13-h59595ed_1003 2025-05-07T19:45:21.2549177Z harfbuzz conda-forge/linux-64::harfbuzz-9.0.0-hfac3d4d_0 2025-05-07T19:45:21.2551028Z hypothesis conda-forge/noarch::hypothesis-6.131.14-pyha770c72_0 2025-05-07T19:45:21.2551535Z ijar conda-forge/linux-64::ijar-7.5.0-h5888daf_0 2025-05-07T19:45:21.2551964Z jinja2 conda-forge/noarch::jinja2-3.1.6-pyhd8ed1ab_0 2025-05-07T19:45:21.2552520Z keyutils conda-forge/linux-64::keyutils-1.6.1-h166bdaf_0 2025-05-07T19:45:21.2553034Z krb5 conda-forge/linux-64::krb5-1.21.3-h659f571_0 2025-05-07T19:45:21.2553671Z lcms2 conda-forge/linux-64::lcms2-2.17-h717163a_0 2025-05-07T19:45:21.2554149Z lerc conda-forge/linux-64::lerc-4.0.0-h0aef613_1 2025-05-07T19:45:21.2554665Z libabseil conda-forge/linux-64::libabseil-20250127.1-cxx17_hbbce691_0 2025-05-07T19:45:21.2555230Z libcups conda-forge/linux-64::libcups-2.3.3-h4637d8d_4 2025-05-07T19:45:21.2555708Z libcurl conda-forge/linux-64::libcurl-8.13.0-h332b0f4_0 2025-05-07T19:45:21.2556236Z libdeflate conda-forge/linux-64::libdeflate-1.23-h86f0d12_0 2025-05-07T19:45:21.2556806Z libedit conda-forge/linux-64::libedit-3.1.20250104-pl5321h7949ede_0 2025-05-07T19:45:21.2557307Z libev conda-forge/linux-64::libev-4.33-hd590300_2 2025-05-07T19:45:21.2557798Z libexpat conda-forge/linux-64::libexpat-2.7.0-h5888daf_0 2025-05-07T19:45:21.2558319Z libfreetype conda-forge/linux-64::libfreetype-2.13.3-ha770c72_1 2025-05-07T19:45:21.2558901Z libfreetype6 conda-forge/linux-64::libfreetype6-2.13.3-h48d6fc4_1 2025-05-07T19:45:21.2559482Z libgfortran conda-forge/linux-64::libgfortran-15.1.0-h69a702a_2 2025-05-07T19:45:21.2560036Z libgfortran5 conda-forge/linux-64::libgfortran5-15.1.0-hcea5267_2 2025-05-07T19:45:21.2560580Z libglib conda-forge/linux-64::libglib-2.84.0-h2ff4ddf_0 2025-05-07T19:45:21.2561062Z libgrpc conda-forge/linux-64::libgrpc-1.71.0-h8e591d7_1 2025-05-07T19:45:21.2561617Z libjpeg-turbo conda-forge/linux-64::libjpeg-turbo-3.1.0-hb9d3cd8_0 2025-05-07T19:45:21.2562166Z liblzma conda-forge/linux-64::liblzma-5.8.1-hb9d3cd8_1 2025-05-07T19:45:21.2562684Z liblzma-devel conda-forge/linux-64::liblzma-devel-5.8.1-hb9d3cd8_1 2025-05-07T19:45:21.2563260Z libnghttp2 conda-forge/linux-64::libnghttp2-1.64.0-h161d5f1_0 2025-05-07T19:45:21.2563834Z libopenblas conda-forge/linux-64::libopenblas-0.3.29-pthreads_h94d23a6_0 2025-05-07T19:45:21.2564405Z libpng conda-forge/linux-64::libpng-1.6.47-h943b412_0 2025-05-07T19:45:21.2564932Z libprotobuf conda-forge/linux-64::libprotobuf-5.29.3-h501fc15_1 2025-05-07T19:45:21.2565570Z libre2-11 conda-forge/linux-64::libre2-11-2024.07.02-hba17884_3 2025-05-07T19:45:21.2566082Z libsqlite conda-forge/linux-64::libsqlite-3.49.2-hee588c1_0 2025-05-07T19:45:21.2566541Z libssh2 conda-forge/linux-64::libssh2-1.11.1-hcf80075_0 2025-05-07T19:45:21.2567023Z libtiff conda-forge/linux-64::libtiff-4.7.0-hd9ff511_4 2025-05-07T19:45:21.2567481Z libuv conda-forge/linux-64::libuv-1.50.0-hb9d3cd8_0 2025-05-07T19:45:21.2567950Z libwebp-base conda-forge/linux-64::libwebp-base-1.5.0-h851e524_0 2025-05-07T19:45:21.2568456Z libxcb conda-forge/linux-64::libxcb-1.17.0-h8a09558_0 2025-05-07T19:45:21.2568878Z make conda-forge/linux-64::make-4.4.1-hb9d3cd8_2 2025-05-07T19:45:21.2569367Z markupsafe conda-forge/linux-64::markupsafe-3.0.2-py313h8060acc_1 2025-05-07T19:45:21.2569846Z ninja conda-forge/linux-64::ninja-1.12.1-hff21bea_1 2025-05-07T19:45:21.2570366Z openblas conda-forge/linux-64::openblas-0.3.29-pthreads_h6ec200e_0 2025-05-07T19:45:21.2570893Z openjdk conda-forge/linux-64::openjdk-23.0.1-h4c11d01_0 2025-05-07T19:45:21.2571364Z packaging conda-forge/noarch::packaging-25.0-pyh29332c3_1 2025-05-07T19:45:21.2572107Z patchelf conda-forge/linux-64::patchelf-0.18.0-h3f2d84a_2 2025-05-07T19:45:21.2572577Z pcre2 conda-forge/linux-64::pcre2-10.44-hc749103_2 2025-05-07T19:45:21.2573059Z pixman conda-forge/linux-64::pixman-0.46.0-h29eaf8c_0 2025-05-07T19:45:21.2573671Z pthread-stubs conda-forge/linux-64::pthread-stubs-0.4-hb9d3cd8_1002 2025-05-07T19:45:21.2574214Z pyelftools conda-forge/noarch::pyelftools-0.32-pyh707e725_1 2025-05-07T19:45:21.2574740Z pyyaml conda-forge/linux-64::pyyaml-6.0.2-py313h8060acc_2 2025-05-07T19:45:21.2575360Z re2 conda-forge/linux-64::re2-2024.07.02-h9925aae_3 2025-05-07T19:45:21.2575825Z rhash conda-forge/linux-64::rhash-1.4.5-hb9d3cd8_0 2025-05-07T19:45:21.2576388Z scikit-build conda-forge/noarch::scikit-build-0.18.1-pyhae55e72_2 2025-05-07T19:45:21.2576984Z singlejar conda-forge/linux-64::singlejar-7.5.0-h0e684df_1 2025-05-07T19:45:21.2577720Z sortedcontainers conda-forge/noarch::sortedcontainers-2.4.0-pyhd8ed1ab_1 2025-05-07T19:45:21.2578342Z tomli conda-forge/noarch::tomli-2.2.1-pyhd8ed1ab_1 2025-05-07T19:45:21.2578881Z xorg-libice conda-forge/linux-64::xorg-libice-1.1.2-hb9d3cd8_0 2025-05-07T19:45:21.2579442Z xorg-libsm conda-forge/linux-64::xorg-libsm-1.2.6-he73a12e_0 2025-05-07T19:45:21.2579973Z xorg-libx11 conda-forge/linux-64::xorg-libx11-1.8.12-h4f16b4b_0 2025-05-07T19:45:21.2580537Z xorg-libxau conda-forge/linux-64::xorg-libxau-1.0.12-hb9d3cd8_0 2025-05-07T19:45:21.2581181Z xorg-libxdmcp conda-forge/linux-64::xorg-libxdmcp-1.1.5-hb9d3cd8_0 2025-05-07T19:45:21.2581788Z xorg-libxext conda-forge/linux-64::xorg-libxext-1.3.6-hb9d3cd8_0 2025-05-07T19:45:21.2582397Z xorg-libxfixes conda-forge/linux-64::xorg-libxfixes-6.0.1-hb9d3cd8_0 2025-05-07T19:45:21.2582950Z xorg-libxi conda-forge/linux-64::xorg-libxi-1.8.2-hb9d3cd8_0 2025-05-07T19:45:21.2583545Z xorg-libxrandr conda-forge/linux-64::xorg-libxrandr-1.5.4-hb9d3cd8_0 2025-05-07T19:45:21.2584163Z xorg-libxrender conda-forge/linux-64::xorg-libxrender-0.9.12-hb9d3cd8_0 2025-05-07T19:45:21.2584768Z xorg-libxt conda-forge/linux-64::xorg-libxt-1.3.1-hb9d3cd8_0 2025-05-07T19:45:21.2585345Z xorg-libxtst conda-forge/linux-64::xorg-libxtst-1.2.5-hb9d3cd8_3 2025-05-07T19:45:21.2585898Z xz-gpl-tools conda-forge/linux-64::xz-gpl-tools-5.8.1-hbcc6ac9_1 2025-05-07T19:45:21.2586447Z xz-tools conda-forge/linux-64::xz-tools-5.8.1-hb9d3cd8_1 2025-05-07T19:45:21.2586911Z yaml conda-forge/linux-64::yaml-0.2.5-h7f98852_2 2025-05-07T19:45:21.2587221Z 2025-05-07T19:45:21.2587357Z The following packages will be UPDATED: 2025-05-07T19:45:21.2587588Z 2025-05-07T19:45:21.2587931Z libuuid pkgs/main::libuuid-1.41.5-h5eee18b_0 --> conda-forge::libuuid-2.38.1-h0b41bf4_0 2025-05-07T19:45:21.2588517Z libzlib 1.2.13-h4ab18f5_6 --> 1.3.1-hb9d3cd8_2 2025-05-07T19:45:21.2589122Z ncurses pkgs/main::ncurses-6.4-h6a678d5_0 --> conda-forge::ncurses-6.5-h2d0b736_3 2025-05-07T19:45:21.2589933Z python pkgs/main::python-3.13.2-hf623796_100~ --> conda-forge::python-3.13.2-hf636f53_101_cp313 2025-05-07T19:45:21.2590683Z sqlite pkgs/main::sqlite-3.45.3-h5eee18b_0 --> conda-forge::sqlite-3.49.2-h9eae976_0 2025-05-07T19:45:21.2591426Z wheel pkgs/main/linux-64::wheel-0.45.1-py31~ --> conda-forge/noarch::wheel-0.45.1-pyhd8ed1ab_1 2025-05-07T19:45:21.2592074Z xz pkgs/main::xz-5.6.4-h5eee18b_1 --> conda-forge::xz-5.8.1-hbcc6ac9_1 2025-05-07T19:45:21.2592589Z zlib 1.2.13-h4ab18f5_6 --> 1.3.1-hb9d3cd8_2 2025-05-07T19:45:21.2593081Z zstd 1.5.6-ha6fb4c9_0 --> 1.5.7-hb8e6e7a_2 2025-05-07T19:45:21.2593380Z 2025-05-07T19:45:21.2593619Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:45:21.2594061Z 2025-05-07T19:45:21.2594359Z expat pkgs/main::expat-2.7.1-h6a678d5_0 --> conda-forge::expat-2.7.0-h5888daf_0 2025-05-07T19:45:21.2594991Z tk pkgs/main::tk-8.6.14-h39e8969_0 --> conda-forge::tk-8.6.13-noxft_h4845f30_101 2025-05-07T19:45:21.2595386Z 2025-05-07T19:45:21.2595475Z 2025-05-07T19:45:21.2595480Z 2025-05-07T19:45:21.2595646Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:21.2596089Z openjdk-23.0.1 | 181.3 MB | | 0% 2025-05-07T19:45:21.2596343Z 2025-05-07T19:45:21.2596778Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:21.2597060Z 2025-05-07T19:45:21.2597063Z 2025-05-07T19:45:21.2600560Z python-3.13.2 | 31.7 MB | | 0%  2025-05-07T19:45:21.2600900Z 2025-05-07T19:45:21.2600904Z 2025-05-07T19:45:21.2600977Z 2025-05-07T19:45:21.2636240Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:21.2637146Z 2025-05-07T19:45:21.2637159Z 2025-05-07T19:45:21.2637171Z 2025-05-07T19:45:21.2637181Z 2025-05-07T19:45:21.2639406Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:21.2640192Z 2025-05-07T19:45:21.2640205Z 2025-05-07T19:45:21.2640218Z 2025-05-07T19:45:21.2640230Z 2025-05-07T19:45:21.2640315Z 2025-05-07T19:45:21.2641107Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:21.2641382Z 2025-05-07T19:45:21.2641387Z 2025-05-07T19:45:21.2641390Z 2025-05-07T19:45:21.2641394Z 2025-05-07T19:45:21.2641397Z 2025-05-07T19:45:21.2641401Z 2025-05-07T19:45:21.2641680Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:21.2641988Z 2025-05-07T19:45:21.2641991Z 2025-05-07T19:45:21.2641995Z 2025-05-07T19:45:21.2641999Z 2025-05-07T19:45:21.2642002Z 2025-05-07T19:45:21.2642006Z 2025-05-07T19:45:21.2642009Z 2025-05-07T19:45:21.2642572Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:21.2642870Z 2025-05-07T19:45:21.2642895Z 2025-05-07T19:45:21.2642898Z 2025-05-07T19:45:21.2642901Z 2025-05-07T19:45:21.2642905Z 2025-05-07T19:45:21.2642909Z 2025-05-07T19:45:21.2642912Z 2025-05-07T19:45:21.2642915Z 2025-05-07T19:45:21.2644572Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:21.2644922Z 2025-05-07T19:45:21.2644929Z 2025-05-07T19:45:21.2644932Z 2025-05-07T19:45:21.2644936Z 2025-05-07T19:45:21.2644939Z 2025-05-07T19:45:21.2644943Z 2025-05-07T19:45:21.2644946Z 2025-05-07T19:45:21.2644952Z 2025-05-07T19:45:21.2644992Z 2025-05-07T19:45:21.2652788Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:21.2653326Z 2025-05-07T19:45:21.2653331Z 2025-05-07T19:45:21.2653336Z 2025-05-07T19:45:21.2653341Z 2025-05-07T19:45:21.2653345Z 2025-05-07T19:45:21.2653349Z 2025-05-07T19:45:21.2653394Z 2025-05-07T19:45:21.2653401Z 2025-05-07T19:45:21.2653408Z 2025-05-07T19:45:21.2653437Z 2025-05-07T19:45:21.2653839Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:21.2654364Z 2025-05-07T19:45:21.2654371Z 2025-05-07T19:45:21.2654379Z 2025-05-07T19:45:21.2654385Z 2025-05-07T19:45:21.2654392Z 2025-05-07T19:45:21.2654433Z 2025-05-07T19:45:21.2654438Z 2025-05-07T19:45:21.2654453Z 2025-05-07T19:45:21.2654459Z 2025-05-07T19:45:21.2654465Z 2025-05-07T19:45:21.2654471Z 2025-05-07T19:45:21.2654998Z font-ttf-ubuntu-0.83 | 1.5 MB | | 0%  2025-05-07T19:45:21.2655617Z 2025-05-07T19:45:21.2655621Z 2025-05-07T19:45:21.2655625Z 2025-05-07T19:45:21.2655671Z 2025-05-07T19:45:21.2655678Z 2025-05-07T19:45:21.2655685Z 2025-05-07T19:45:21.2655690Z 2025-05-07T19:45:21.2655695Z 2025-05-07T19:45:21.2655701Z 2025-05-07T19:45:21.2655706Z 2025-05-07T19:45:21.2655711Z 2025-05-07T19:45:21.2655718Z 2025-05-07T19:45:21.2656203Z harfbuzz-9.0.0 | 1.5 MB | | 0%  2025-05-07T19:45:21.2657684Z 2025-05-07T19:45:21.2657688Z 2025-05-07T19:45:21.2657691Z 2025-05-07T19:45:21.2657695Z 2025-05-07T19:45:21.2657698Z 2025-05-07T19:45:21.2657702Z 2025-05-07T19:45:21.2657705Z 2025-05-07T19:45:21.2657709Z 2025-05-07T19:45:21.2657712Z 2025-05-07T19:45:21.2657717Z 2025-05-07T19:45:21.2657809Z 2025-05-07T19:45:21.2657813Z 2025-05-07T19:45:21.2657820Z 2025-05-07T19:45:21.2679893Z libgfortran5-15.1.0 | 1.5 MB | | 0%  2025-05-07T19:45:21.2680534Z 2025-05-07T19:45:21.2680542Z 2025-05-07T19:45:21.2680547Z 2025-05-07T19:45:21.2680552Z 2025-05-07T19:45:21.2680558Z 2025-05-07T19:45:21.2680563Z 2025-05-07T19:45:21.2680568Z 2025-05-07T19:45:21.2680573Z 2025-05-07T19:45:21.2680578Z 2025-05-07T19:45:21.2680583Z 2025-05-07T19:45:21.2680589Z 2025-05-07T19:45:21.2680595Z 2025-05-07T19:45:21.2680603Z 2025-05-07T19:45:21.2680622Z 2025-05-07T19:45:21.2681154Z krb5-1.21.3 | 1.3 MB | | 0%  2025-05-07T19:45:21.2681539Z 2025-05-07T19:45:21.2681543Z 2025-05-07T19:45:21.2681546Z 2025-05-07T19:45:21.2681550Z 2025-05-07T19:45:21.2681553Z 2025-05-07T19:45:21.2681557Z 2025-05-07T19:45:21.2681560Z 2025-05-07T19:45:21.2681564Z 2025-05-07T19:45:21.2681567Z 2025-05-07T19:45:21.2681576Z 2025-05-07T19:45:21.2681605Z 2025-05-07T19:45:21.2681609Z 2025-05-07T19:45:21.2681612Z 2025-05-07T19:45:21.2681615Z 2025-05-07T19:45:21.2681619Z 2025-05-07T19:45:21.2684216Z libabseil-20250127.1 | 1.3 MB | | 0%  2025-05-07T19:45:21.2684812Z 2025-05-07T19:45:21.2684818Z 2025-05-07T19:45:21.2684824Z 2025-05-07T19:45:21.2684863Z 2025-05-07T19:45:21.2684868Z 2025-05-07T19:45:21.2684873Z 2025-05-07T19:45:21.2684878Z 2025-05-07T19:45:21.2684886Z 2025-05-07T19:45:21.2684892Z 2025-05-07T19:45:21.2684899Z 2025-05-07T19:45:21.2684907Z 2025-05-07T19:45:21.2684926Z 2025-05-07T19:45:21.2684938Z 2025-05-07T19:45:21.2684946Z 2025-05-07T19:45:21.2684952Z 2025-05-07T19:45:21.2684959Z 2025-05-07T19:45:21.2685478Z cairo-1.18.0 | 961 KB | | 0%  2025-05-07T19:45:21.2686033Z 2025-05-07T19:45:21.2686041Z 2025-05-07T19:45:21.2686049Z 2025-05-07T19:45:21.2686061Z 2025-05-07T19:45:21.2686069Z 2025-05-07T19:45:21.2686074Z 2025-05-07T19:45:21.2686081Z 2025-05-07T19:45:21.2686089Z 2025-05-07T19:45:21.2686094Z 2025-05-07T19:45:21.2686101Z 2025-05-07T19:45:21.2686107Z 2025-05-07T19:45:21.2686112Z 2025-05-07T19:45:21.2686118Z 2025-05-07T19:45:21.2686123Z 2025-05-07T19:45:21.2686128Z 2025-05-07T19:45:21.2686133Z 2025-05-07T19:45:21.2686183Z 2025-05-07T19:45:21.2686737Z pcre2-10.44 | 934 KB | | 0%  2025-05-07T19:45:21.2687268Z 2025-05-07T19:45:21.2687275Z 2025-05-07T19:45:21.2687281Z 2025-05-07T19:45:21.2687288Z 2025-05-07T19:45:21.2687302Z 2025-05-07T19:45:21.2687308Z 2025-05-07T19:45:21.2687316Z 2025-05-07T19:45:21.2687322Z 2025-05-07T19:45:21.2687364Z 2025-05-07T19:45:21.2687370Z 2025-05-07T19:45:21.2687377Z 2025-05-07T19:45:21.2687385Z 2025-05-07T19:45:21.2687392Z 2025-05-07T19:45:21.2687398Z 2025-05-07T19:45:21.2687405Z 2025-05-07T19:45:21.2687413Z 2025-05-07T19:45:21.2687425Z 2025-05-07T19:45:21.2687431Z 2025-05-07T19:45:21.2687958Z libsqlite-3.49.2 | 895 KB | | 0%  2025-05-07T19:45:21.2688320Z 2025-05-07T19:45:21.2688324Z 2025-05-07T19:45:21.2688327Z 2025-05-07T19:45:21.2688331Z 2025-05-07T19:45:21.2688335Z 2025-05-07T19:45:21.2688338Z 2025-05-07T19:45:21.2688341Z 2025-05-07T19:45:21.2688345Z 2025-05-07T19:45:21.2688348Z 2025-05-07T19:45:21.2688352Z 2025-05-07T19:45:21.2688356Z 2025-05-07T19:45:21.2688359Z 2025-05-07T19:45:21.2688363Z 2025-05-07T19:45:21.2688366Z 2025-05-07T19:45:21.2688369Z 2025-05-07T19:45:21.2688373Z 2025-05-07T19:45:21.2688450Z 2025-05-07T19:45:21.2688454Z 2025-05-07T19:45:21.2688457Z 2025-05-07T19:45:21.4965097Z ... (more hidden) ... 2025-05-07T19:45:21.4974926Z openjdk-23.0.1 | 181.3 MB | | 0% 2025-05-07T19:45:21.4975485Z 2025-05-07T19:45:21.4975507Z 2025-05-07T19:45:21.4975778Z 2025-05-07T19:45:21.4975785Z 2025-05-07T19:45:21.5659446Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:21.5659863Z 2025-05-07T19:45:21.5659869Z 2025-05-07T19:45:21.5704681Z python-3.13.2 | 31.7 MB | | 0%  2025-05-07T19:45:21.5705116Z 2025-05-07T19:45:21.5742868Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:21.5743552Z 2025-05-07T19:45:21.5743561Z 2025-05-07T19:45:21.5743566Z 2025-05-07T19:45:21.5965105Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:21.5978931Z openjdk-23.0.1 | 181.3 MB | 4 | 5% 2025-05-07T19:45:21.5979531Z 2025-05-07T19:45:21.5979540Z 2025-05-07T19:45:21.5979546Z 2025-05-07T19:45:21.5979553Z 2025-05-07T19:45:21.6660446Z libgrpc-1.71.0 | 7.6 MB | ########3 | 83%  2025-05-07T19:45:21.6660797Z 2025-05-07T19:45:21.6660802Z 2025-05-07T19:45:21.6705434Z python-3.13.2 | 31.7 MB | ## | 21%  2025-05-07T19:45:21.6705732Z 2025-05-07T19:45:21.6743479Z bazel-7.5.0 | 47.4 MB | ## | 20%  2025-05-07T19:45:21.6743974Z 2025-05-07T19:45:21.6743981Z 2025-05-07T19:45:21.6743986Z 2025-05-07T19:45:21.6839236Z cmake-4.0.2 | 19.4 MB | ###2 | 33%  2025-05-07T19:45:21.6839699Z 2025-05-07T19:45:21.6839704Z 2025-05-07T19:45:21.6839708Z 2025-05-07T19:45:21.6840316Z 2025-05-07T19:45:21.6971456Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:21.7293824Z openjdk-23.0.1 | 181.3 MB | 7 | 8% 2025-05-07T19:45:21.7295274Z 2025-05-07T19:45:21.7295316Z 2025-05-07T19:45:21.7295402Z 2025-05-07T19:45:21.7295425Z 2025-05-07T19:45:21.7295449Z 2025-05-07T19:45:21.7662678Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:21.7663036Z 2025-05-07T19:45:21.7663043Z 2025-05-07T19:45:21.7708079Z python-3.13.2 | 31.7 MB | ###7 | 38%  2025-05-07T19:45:21.7708552Z 2025-05-07T19:45:21.7747928Z bazel-7.5.0 | 47.4 MB | ###5 | 35%  2025-05-07T19:45:21.7748394Z 2025-05-07T19:45:21.7748400Z 2025-05-07T19:45:21.7748431Z 2025-05-07T19:45:21.7971903Z cmake-4.0.2 | 19.4 MB | ######8 | 69%  2025-05-07T19:45:21.8295253Z openjdk-23.0.1 | 181.3 MB | #1 | 11% 2025-05-07T19:45:21.8295825Z 2025-05-07T19:45:21.8295832Z 2025-05-07T19:45:21.8295838Z 2025-05-07T19:45:21.8295843Z 2025-05-07T19:45:21.8295850Z 2025-05-07T19:45:21.8664450Z openblas-0.3.29 | 5.8 MB | ######6 | 66%  2025-05-07T19:45:21.8666114Z 2025-05-07T19:45:21.8666155Z 2025-05-07T19:45:21.8750441Z python-3.13.2 | 31.7 MB | #####4 | 55%  2025-05-07T19:45:21.8750918Z 2025-05-07T19:45:21.8750923Z 2025-05-07T19:45:21.8750928Z 2025-05-07T19:45:21.8856442Z cmake-4.0.2 | 19.4 MB | #########6 | 97%  2025-05-07T19:45:21.8856948Z 2025-05-07T19:45:21.8974062Z bazel-7.5.0 | 47.4 MB | ####8 | 49%  2025-05-07T19:45:21.9482786Z openjdk-23.0.1 | 181.3 MB | #4 | 14% 2025-05-07T19:45:21.9483253Z 2025-05-07T19:45:21.9483258Z 2025-05-07T19:45:21.9483286Z 2025-05-07T19:45:21.9483291Z 2025-05-07T19:45:21.9483294Z 2025-05-07T19:45:21.9775320Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:21.9775671Z 2025-05-07T19:45:21.9775676Z 2025-05-07T19:45:21.9856670Z python-3.13.2 | 31.7 MB | ####### | 70%  2025-05-07T19:45:21.9857038Z 2025-05-07T19:45:21.9857045Z 2025-05-07T19:45:21.9857052Z 2025-05-07T19:45:21.9857058Z 2025-05-07T19:45:21.9857064Z 2025-05-07T19:45:21.9857424Z 2025-05-07T19:45:21.9857974Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:21.9858849Z 2025-05-07T19:45:21.9984776Z bazel-7.5.0 | 47.4 MB | ######8 | 69%  2025-05-07T19:45:22.0776941Z openjdk-23.0.1 | 181.3 MB | #9 | 19% 2025-05-07T19:45:22.0777374Z 2025-05-07T19:45:22.0777624Z 2025-05-07T19:45:22.0860648Z python-3.13.2 | 31.7 MB | ########9 | 89%  2025-05-07T19:45:22.0861974Z 2025-05-07T19:45:22.0861989Z 2025-05-07T19:45:22.0861999Z 2025-05-07T19:45:22.0862010Z 2025-05-07T19:45:22.0862021Z 2025-05-07T19:45:22.0862032Z 2025-05-07T19:45:22.0995355Z libopenblas-0.3.29 | 5.6 MB | ########5 | 86%  2025-05-07T19:45:22.0995758Z 2025-05-07T19:45:22.1147905Z bazel-7.5.0 | 47.4 MB | ########3 | 84%  2025-05-07T19:45:22.1493679Z openjdk-23.0.1 | 181.3 MB | ##3 | 23% 2025-05-07T19:45:22.1494112Z 2025-05-07T19:45:22.1494119Z 2025-05-07T19:45:22.1494161Z 2025-05-07T19:45:22.1749159Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:22.1749513Z 2025-05-07T19:45:22.1749519Z 2025-05-07T19:45:22.1749526Z 2025-05-07T19:45:22.1749529Z 2025-05-07T19:45:22.1749534Z 2025-05-07T19:45:22.1749538Z 2025-05-07T19:45:22.2039851Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:22.2040396Z 2025-05-07T19:45:22.2040402Z 2025-05-07T19:45:22.2040407Z 2025-05-07T19:45:22.2040410Z 2025-05-07T19:45:22.2040414Z 2025-05-07T19:45:22.2040418Z 2025-05-07T19:45:22.2040421Z 2025-05-07T19:45:22.2100526Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:22.2101011Z 2025-05-07T19:45:22.2101016Z 2025-05-07T19:45:22.2101020Z 2025-05-07T19:45:22.2101024Z 2025-05-07T19:45:22.2101029Z 2025-05-07T19:45:22.2101033Z 2025-05-07T19:45:22.2101037Z 2025-05-07T19:45:22.2101040Z 2025-05-07T19:45:22.2150291Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:22.3187174Z openjdk-23.0.1 | 181.3 MB | ##8 | 29% 2025-05-07T19:45:22.3315882Z openjdk-23.0.1 | 181.3 MB | ###2 | 33% 2025-05-07T19:45:22.3316412Z 2025-05-07T19:45:22.3316420Z 2025-05-07T19:45:22.3316428Z 2025-05-07T19:45:22.3316435Z 2025-05-07T19:45:22.3316440Z 2025-05-07T19:45:22.3316479Z 2025-05-07T19:45:22.3316483Z 2025-05-07T19:45:22.3316486Z 2025-05-07T19:45:22.3316959Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:22.3317260Z 2025-05-07T19:45:22.3317264Z 2025-05-07T19:45:22.3317268Z 2025-05-07T19:45:22.3317272Z 2025-05-07T19:45:22.3317276Z 2025-05-07T19:45:22.3317281Z 2025-05-07T19:45:22.3317284Z 2025-05-07T19:45:22.3317290Z 2025-05-07T19:45:22.3379770Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:22.3380102Z 2025-05-07T19:45:22.3380106Z 2025-05-07T19:45:22.3380110Z 2025-05-07T19:45:22.3380114Z 2025-05-07T19:45:22.3380143Z 2025-05-07T19:45:22.3380146Z 2025-05-07T19:45:22.3380150Z 2025-05-07T19:45:22.3380437Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:22.3380727Z 2025-05-07T19:45:22.3380731Z 2025-05-07T19:45:22.3380734Z 2025-05-07T19:45:22.3380738Z 2025-05-07T19:45:22.3380741Z 2025-05-07T19:45:22.3380744Z 2025-05-07T19:45:22.3380759Z 2025-05-07T19:45:22.3819876Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:22.3820288Z 2025-05-07T19:45:22.3820309Z 2025-05-07T19:45:22.3820313Z 2025-05-07T19:45:22.3820319Z 2025-05-07T19:45:22.3820324Z 2025-05-07T19:45:22.3820329Z 2025-05-07T19:45:22.3820333Z 2025-05-07T19:45:22.3820339Z 2025-05-07T19:45:22.3820344Z 2025-05-07T19:45:22.3854613Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:22.3855063Z 2025-05-07T19:45:22.3855068Z 2025-05-07T19:45:22.3855072Z 2025-05-07T19:45:22.3855077Z 2025-05-07T19:45:22.3855080Z 2025-05-07T19:45:22.3855375Z 2025-05-07T19:45:22.3855380Z 2025-05-07T19:45:22.3855385Z 2025-05-07T19:45:22.3855388Z 2025-05-07T19:45:22.3855393Z 2025-05-07T19:45:22.4187090Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:22.4704815Z openjdk-23.0.1 | 181.3 MB | ###7 | 37% 2025-05-07T19:45:22.4705553Z 2025-05-07T19:45:22.4705562Z 2025-05-07T19:45:22.4705567Z 2025-05-07T19:45:22.4705572Z 2025-05-07T19:45:22.4705576Z 2025-05-07T19:45:22.4705579Z 2025-05-07T19:45:22.4705584Z 2025-05-07T19:45:22.4705587Z 2025-05-07T19:45:22.4705591Z 2025-05-07T19:45:22.4705594Z 2025-05-07T19:45:22.4774858Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:22.4775449Z 2025-05-07T19:45:22.4775455Z 2025-05-07T19:45:22.4775463Z 2025-05-07T19:45:22.4775469Z 2025-05-07T19:45:22.4775476Z 2025-05-07T19:45:22.4775485Z 2025-05-07T19:45:22.4775492Z 2025-05-07T19:45:22.4775499Z 2025-05-07T19:45:22.4775504Z 2025-05-07T19:45:22.4937743Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:22.4938166Z 2025-05-07T19:45:22.4938172Z 2025-05-07T19:45:22.4938176Z 2025-05-07T19:45:22.4938180Z 2025-05-07T19:45:22.5187402Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:22.5193150Z openjdk-23.0.1 | 181.3 MB | ####2 | 42% 2025-05-07T19:45:22.5193655Z 2025-05-07T19:45:22.5193664Z 2025-05-07T19:45:22.5193671Z 2025-05-07T19:45:22.5193708Z 2025-05-07T19:45:22.5193714Z 2025-05-07T19:45:22.5193720Z 2025-05-07T19:45:22.5193728Z 2025-05-07T19:45:22.5193735Z 2025-05-07T19:45:22.5193740Z 2025-05-07T19:45:22.5193744Z 2025-05-07T19:45:22.5193919Z 2025-05-07T19:45:22.5228360Z font-ttf-ubuntu-0.83 | 1.5 MB | 1 | 1%  2025-05-07T19:45:22.5228822Z 2025-05-07T19:45:22.5228830Z 2025-05-07T19:45:22.5228962Z 2025-05-07T19:45:22.5228971Z 2025-05-07T19:45:22.5228980Z 2025-05-07T19:45:22.5229017Z 2025-05-07T19:45:22.5229024Z 2025-05-07T19:45:22.5229029Z 2025-05-07T19:45:22.5229037Z 2025-05-07T19:45:22.5229043Z 2025-05-07T19:45:22.5229050Z 2025-05-07T19:45:22.5229058Z 2025-05-07T19:45:22.5360276Z harfbuzz-9.0.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:22.5360694Z 2025-05-07T19:45:22.5360880Z 2025-05-07T19:45:22.5607999Z python-3.13.2 | 31.7 MB | ########## | 100%  2025-05-07T19:45:22.5608393Z 2025-05-07T19:45:22.5608399Z 2025-05-07T19:45:22.5608404Z 2025-05-07T19:45:22.5608409Z 2025-05-07T19:45:22.5608414Z 2025-05-07T19:45:22.5608420Z 2025-05-07T19:45:22.5608424Z 2025-05-07T19:45:22.5608429Z 2025-05-07T19:45:22.5608432Z 2025-05-07T19:45:22.5608437Z 2025-05-07T19:45:22.5608442Z 2025-05-07T19:45:22.5608446Z 2025-05-07T19:45:22.5612144Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:22.5612757Z 2025-05-07T19:45:22.5612763Z 2025-05-07T19:45:22.5612812Z 2025-05-07T19:45:22.5612820Z 2025-05-07T19:45:22.5612829Z 2025-05-07T19:45:22.5612834Z 2025-05-07T19:45:22.5612851Z 2025-05-07T19:45:22.5612857Z 2025-05-07T19:45:22.5612864Z 2025-05-07T19:45:22.5612872Z 2025-05-07T19:45:22.5612879Z 2025-05-07T19:45:22.5879618Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:22.5880287Z 2025-05-07T19:45:22.5880323Z 2025-05-07T19:45:22.5880329Z 2025-05-07T19:45:22.5880338Z 2025-05-07T19:45:22.5880346Z 2025-05-07T19:45:22.5880351Z 2025-05-07T19:45:22.5880360Z 2025-05-07T19:45:22.5880369Z 2025-05-07T19:45:22.5880374Z 2025-05-07T19:45:22.5880381Z 2025-05-07T19:45:22.5880392Z 2025-05-07T19:45:22.5880398Z 2025-05-07T19:45:22.5880405Z 2025-05-07T19:45:22.5955928Z libgfortran5-15.1.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:22.5956323Z 2025-05-07T19:45:22.5956331Z 2025-05-07T19:45:22.5956334Z 2025-05-07T19:45:22.5956339Z 2025-05-07T19:45:22.5956343Z 2025-05-07T19:45:22.6037906Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:22.6038481Z 2025-05-07T19:45:22.6038489Z 2025-05-07T19:45:22.6038499Z 2025-05-07T19:45:22.6038506Z 2025-05-07T19:45:22.6038512Z 2025-05-07T19:45:22.6038519Z 2025-05-07T19:45:22.6038527Z 2025-05-07T19:45:22.6038533Z 2025-05-07T19:45:22.6038825Z 2025-05-07T19:45:22.6038833Z 2025-05-07T19:45:22.6038839Z 2025-05-07T19:45:22.6038877Z 2025-05-07T19:45:22.6038881Z 2025-05-07T19:45:22.6038884Z 2025-05-07T19:45:22.6038887Z 2025-05-07T19:45:22.6039242Z libabseil-20250127.1 | 1.3 MB | 1 | 1%  2025-05-07T19:45:22.6039586Z 2025-05-07T19:45:22.6039590Z 2025-05-07T19:45:22.6039594Z 2025-05-07T19:45:22.6039597Z 2025-05-07T19:45:22.6039600Z 2025-05-07T19:45:22.6039604Z 2025-05-07T19:45:22.6039621Z 2025-05-07T19:45:22.6039624Z 2025-05-07T19:45:22.6039628Z 2025-05-07T19:45:22.6039631Z 2025-05-07T19:45:22.6039635Z 2025-05-07T19:45:22.6039645Z 2025-05-07T19:45:22.6039648Z 2025-05-07T19:45:22.6042672Z 2025-05-07T19:45:22.6403689Z krb5-1.21.3 | 1.3 MB | 1 | 1%  2025-05-07T19:45:22.6404027Z 2025-05-07T19:45:22.6404033Z 2025-05-07T19:45:22.6404037Z 2025-05-07T19:45:22.6404042Z 2025-05-07T19:45:22.6404046Z 2025-05-07T19:45:22.6404076Z 2025-05-07T19:45:22.6404080Z 2025-05-07T19:45:22.6404084Z 2025-05-07T19:45:22.6404110Z 2025-05-07T19:45:22.6404113Z 2025-05-07T19:45:22.6404117Z 2025-05-07T19:45:22.6404120Z 2025-05-07T19:45:22.6405251Z 2025-05-07T19:45:22.6427151Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:22.6449777Z openjdk-23.0.1 | 181.3 MB | ####6 | 47% 2025-05-07T19:45:22.6450286Z 2025-05-07T19:45:22.6450297Z 2025-05-07T19:45:22.6450305Z 2025-05-07T19:45:22.6450312Z 2025-05-07T19:45:22.6450319Z 2025-05-07T19:45:22.6450326Z 2025-05-07T19:45:22.6450335Z 2025-05-07T19:45:22.6450366Z 2025-05-07T19:45:22.6450402Z 2025-05-07T19:45:22.6450410Z 2025-05-07T19:45:22.6450416Z 2025-05-07T19:45:22.6450423Z 2025-05-07T19:45:22.6450430Z 2025-05-07T19:45:22.6450436Z 2025-05-07T19:45:22.6450443Z 2025-05-07T19:45:22.6547284Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:22.6547779Z 2025-05-07T19:45:22.6547784Z 2025-05-07T19:45:22.6547788Z 2025-05-07T19:45:22.6547791Z 2025-05-07T19:45:22.6547795Z 2025-05-07T19:45:22.6547799Z 2025-05-07T19:45:22.6547802Z 2025-05-07T19:45:22.6547806Z 2025-05-07T19:45:22.6547809Z 2025-05-07T19:45:22.6547813Z 2025-05-07T19:45:22.6547816Z 2025-05-07T19:45:22.6547820Z 2025-05-07T19:45:22.6547824Z 2025-05-07T19:45:22.6547830Z 2025-05-07T19:45:22.6908397Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:22.6908791Z 2025-05-07T19:45:22.6908815Z 2025-05-07T19:45:22.6908821Z 2025-05-07T19:45:22.6908825Z 2025-05-07T19:45:22.6908861Z 2025-05-07T19:45:22.6908866Z 2025-05-07T19:45:22.6908870Z 2025-05-07T19:45:22.6908876Z 2025-05-07T19:45:22.6908879Z 2025-05-07T19:45:22.6908884Z 2025-05-07T19:45:22.6908888Z 2025-05-07T19:45:22.6908892Z 2025-05-07T19:45:22.6908897Z 2025-05-07T19:45:22.6908902Z 2025-05-07T19:45:22.6908905Z 2025-05-07T19:45:22.6908910Z 2025-05-07T19:45:22.6908958Z 2025-05-07T19:45:22.6908962Z 2025-05-07T19:45:22.6934008Z libsqlite-3.49.2 | 895 KB | 1 | 2%  2025-05-07T19:45:22.6934514Z 2025-05-07T19:45:22.6934519Z 2025-05-07T19:45:22.6934523Z 2025-05-07T19:45:22.6934526Z 2025-05-07T19:45:22.6934545Z 2025-05-07T19:45:22.6934549Z 2025-05-07T19:45:22.6934552Z 2025-05-07T19:45:22.6934556Z 2025-05-07T19:45:22.6934559Z 2025-05-07T19:45:22.6934563Z 2025-05-07T19:45:22.6934567Z 2025-05-07T19:45:22.6934570Z 2025-05-07T19:45:22.6934574Z 2025-05-07T19:45:22.6934577Z 2025-05-07T19:45:22.6934581Z 2025-05-07T19:45:22.6934584Z 2025-05-07T19:45:22.6934819Z 2025-05-07T19:45:22.6983114Z pcre2-10.44 | 934 KB | 1 | 2%  2025-05-07T19:45:22.6983561Z 2025-05-07T19:45:22.6983569Z 2025-05-07T19:45:22.6983576Z 2025-05-07T19:45:22.6983581Z 2025-05-07T19:45:22.6983585Z 2025-05-07T19:45:22.6983591Z 2025-05-07T19:45:22.6983873Z 2025-05-07T19:45:22.6983882Z 2025-05-07T19:45:22.6983887Z 2025-05-07T19:45:22.6983895Z 2025-05-07T19:45:22.6983902Z 2025-05-07T19:45:22.6983908Z 2025-05-07T19:45:22.6983915Z 2025-05-07T19:45:22.6983923Z 2025-05-07T19:45:22.6983928Z 2025-05-07T19:45:22.6983935Z 2025-05-07T19:45:22.7228056Z cairo-1.18.0 | 961 KB | 1 | 2%  2025-05-07T19:45:22.7229085Z 2025-05-07T19:45:22.7229101Z 2025-05-07T19:45:22.7229115Z 2025-05-07T19:45:22.7229128Z 2025-05-07T19:45:22.7229141Z 2025-05-07T19:45:22.7229153Z 2025-05-07T19:45:22.7229166Z 2025-05-07T19:45:22.7229205Z 2025-05-07T19:45:22.7229275Z 2025-05-07T19:45:22.7229287Z 2025-05-07T19:45:22.7229297Z 2025-05-07T19:45:22.7229309Z 2025-05-07T19:45:22.7229320Z 2025-05-07T19:45:22.7229332Z 2025-05-07T19:45:22.7229343Z 2025-05-07T19:45:22.7229355Z 2025-05-07T19:45:22.7229366Z 2025-05-07T19:45:22.7243883Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:22.7244241Z 2025-05-07T19:45:22.7244245Z 2025-05-07T19:45:22.7244248Z 2025-05-07T19:45:22.7244252Z 2025-05-07T19:45:22.7244255Z 2025-05-07T19:45:22.7244259Z 2025-05-07T19:45:22.7244262Z 2025-05-07T19:45:22.7244266Z 2025-05-07T19:45:22.7244269Z 2025-05-07T19:45:22.7244273Z 2025-05-07T19:45:22.7244277Z 2025-05-07T19:45:22.7244280Z 2025-05-07T19:45:22.7244295Z 2025-05-07T19:45:22.7244299Z 2025-05-07T19:45:22.7244302Z 2025-05-07T19:45:22.7244306Z 2025-05-07T19:45:22.7244309Z 2025-05-07T19:45:22.7244312Z 2025-05-07T19:45:22.7284861Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:22.7285312Z 2025-05-07T19:45:22.7285318Z 2025-05-07T19:45:22.7285322Z 2025-05-07T19:45:22.7285326Z 2025-05-07T19:45:22.7285331Z 2025-05-07T19:45:22.7285335Z 2025-05-07T19:45:22.7285340Z 2025-05-07T19:45:22.7285344Z 2025-05-07T19:45:22.7285349Z 2025-05-07T19:45:22.7285367Z 2025-05-07T19:45:22.7285382Z 2025-05-07T19:45:22.7285386Z 2025-05-07T19:45:22.7285390Z 2025-05-07T19:45:22.7285394Z 2025-05-07T19:45:22.7285399Z 2025-05-07T19:45:22.7285601Z 2025-05-07T19:45:22.7427812Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:22.7763152Z openjdk-23.0.1 | 181.3 MB | #####2 | 52% 2025-05-07T19:45:22.7763623Z 2025-05-07T19:45:22.7763633Z 2025-05-07T19:45:22.7763643Z 2025-05-07T19:45:22.7763653Z 2025-05-07T19:45:22.7763662Z 2025-05-07T19:45:22.7763672Z 2025-05-07T19:45:22.7763681Z 2025-05-07T19:45:22.7763688Z 2025-05-07T19:45:22.7763696Z 2025-05-07T19:45:22.7763750Z 2025-05-07T19:45:22.7763757Z 2025-05-07T19:45:22.7763766Z 2025-05-07T19:45:22.7763774Z 2025-05-07T19:45:22.7763781Z 2025-05-07T19:45:22.7763788Z 2025-05-07T19:45:22.7763797Z 2025-05-07T19:45:22.7763805Z 2025-05-07T19:45:22.7763811Z 2025-05-07T19:45:22.7763844Z 2025-05-07T19:45:22.8021088Z ... (more hidden) ... 2025-05-07T19:45:22.8022027Z 2025-05-07T19:45:22.8022680Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:22.8023397Z 2025-05-07T19:45:22.8053757Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:22.8054595Z 2025-05-07T19:45:22.8054609Z 2025-05-07T19:45:22.8054621Z 2025-05-07T19:45:22.8054632Z 2025-05-07T19:45:22.8054644Z 2025-05-07T19:45:22.8054654Z 2025-05-07T19:45:22.8054666Z 2025-05-07T19:45:22.8054676Z 2025-05-07T19:45:22.8054687Z 2025-05-07T19:45:22.8054698Z 2025-05-07T19:45:22.8054708Z 2025-05-07T19:45:22.8054722Z 2025-05-07T19:45:22.8054733Z 2025-05-07T19:45:22.8055179Z 2025-05-07T19:45:22.8055189Z 2025-05-07T19:45:22.8055200Z 2025-05-07T19:45:22.8055238Z 2025-05-07T19:45:22.8055248Z 2025-05-07T19:45:22.8055259Z 2025-05-07T19:45:22.8380408Z ... (more hidden) ... 2025-05-07T19:45:22.8380742Z 2025-05-07T19:45:22.8381020Z 2025-05-07T19:45:22.8381378Z 2025-05-07T19:45:22.8381393Z 2025-05-07T19:45:22.8381401Z 2025-05-07T19:45:22.8381411Z 2025-05-07T19:45:22.8429861Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:22.9542457Z openjdk-23.0.1 | 181.3 MB | #####7 | 57% 2025-05-07T19:45:22.9982888Z openjdk-23.0.1 | 181.3 MB | ######2 | 62% 2025-05-07T19:45:22.9983349Z 2025-05-07T19:45:22.9983355Z 2025-05-07T19:45:22.9983359Z 2025-05-07T19:45:22.9983364Z 2025-05-07T19:45:22.9983369Z 2025-05-07T19:45:22.9983374Z 2025-05-07T19:45:22.9983380Z 2025-05-07T19:45:22.9983386Z 2025-05-07T19:45:23.0277026Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:23.0277566Z 2025-05-07T19:45:23.0277572Z 2025-05-07T19:45:23.0277577Z 2025-05-07T19:45:23.0277582Z 2025-05-07T19:45:23.0277586Z 2025-05-07T19:45:23.0277591Z 2025-05-07T19:45:23.0277596Z 2025-05-07T19:45:23.1355744Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:23.2356881Z openjdk-23.0.1 | 181.3 MB | ######6 | 67% 2025-05-07T19:45:23.3357546Z openjdk-23.0.1 | 181.3 MB | #######1 | 72% 2025-05-07T19:45:23.4359461Z openjdk-23.0.1 | 181.3 MB | #######6 | 77% 2025-05-07T19:45:23.4959953Z openjdk-23.0.1 | 181.3 MB | ########2 | 83% 2025-05-07T19:45:23.4960502Z 2025-05-07T19:45:23.4960509Z 2025-05-07T19:45:23.4960515Z 2025-05-07T19:45:23.4960522Z 2025-05-07T19:45:23.4960528Z 2025-05-07T19:45:23.4960534Z 2025-05-07T19:45:23.4960544Z 2025-05-07T19:45:23.4960553Z 2025-05-07T19:45:23.4960564Z 2025-05-07T19:45:23.4960586Z 2025-05-07T19:45:23.4962791Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:23.4963271Z 2025-05-07T19:45:23.4963276Z 2025-05-07T19:45:23.4963280Z 2025-05-07T19:45:23.4963285Z 2025-05-07T19:45:23.4963291Z 2025-05-07T19:45:23.4963299Z 2025-05-07T19:45:23.4963308Z 2025-05-07T19:45:23.4963313Z 2025-05-07T19:45:23.4963318Z 2025-05-07T19:45:23.4963343Z 2025-05-07T19:45:23.5222154Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:23.5222623Z 2025-05-07T19:45:23.5222628Z 2025-05-07T19:45:23.5222633Z 2025-05-07T19:45:23.5222636Z 2025-05-07T19:45:23.5222641Z 2025-05-07T19:45:23.5222663Z 2025-05-07T19:45:23.5222667Z 2025-05-07T19:45:23.5222670Z 2025-05-07T19:45:23.5222674Z 2025-05-07T19:45:23.5225715Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:23.5226316Z 2025-05-07T19:45:23.5226322Z 2025-05-07T19:45:23.5226327Z 2025-05-07T19:45:23.5226331Z 2025-05-07T19:45:23.5226374Z 2025-05-07T19:45:23.5226380Z 2025-05-07T19:45:23.5226387Z 2025-05-07T19:45:23.5226395Z 2025-05-07T19:45:23.5226406Z 2025-05-07T19:45:23.5359909Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:23.6399453Z openjdk-23.0.1 | 181.3 MB | ########8 | 88% 2025-05-07T19:45:23.6399756Z 2025-05-07T19:45:23.6399807Z 2025-05-07T19:45:23.6399812Z 2025-05-07T19:45:23.6399816Z 2025-05-07T19:45:23.6399822Z 2025-05-07T19:45:23.6399826Z 2025-05-07T19:45:23.6399830Z 2025-05-07T19:45:23.6399835Z 2025-05-07T19:45:23.6399839Z 2025-05-07T19:45:23.6399842Z 2025-05-07T19:45:23.6399846Z 2025-05-07T19:45:23.6399849Z 2025-05-07T19:45:23.6401180Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:23.6401697Z 2025-05-07T19:45:23.6401703Z 2025-05-07T19:45:23.6401707Z 2025-05-07T19:45:23.6401712Z 2025-05-07T19:45:23.6401717Z 2025-05-07T19:45:23.6401721Z 2025-05-07T19:45:23.6401726Z 2025-05-07T19:45:23.6401992Z 2025-05-07T19:45:23.6402162Z 2025-05-07T19:45:23.6402168Z 2025-05-07T19:45:23.6402175Z 2025-05-07T19:45:23.6402194Z 2025-05-07T19:45:23.6833711Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:23.7063248Z openjdk-23.0.1 | 181.3 MB | #########3 | 93% 2025-05-07T19:45:23.7064161Z 2025-05-07T19:45:23.7064175Z 2025-05-07T19:45:23.7064182Z 2025-05-07T19:45:23.7064188Z 2025-05-07T19:45:23.7064192Z 2025-05-07T19:45:23.7064197Z 2025-05-07T19:45:23.7064206Z 2025-05-07T19:45:23.7064214Z 2025-05-07T19:45:23.7064219Z 2025-05-07T19:45:23.7064228Z 2025-05-07T19:45:23.7064257Z 2025-05-07T19:45:23.7064894Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:23.7065244Z 2025-05-07T19:45:23.7065250Z 2025-05-07T19:45:23.7065253Z 2025-05-07T19:45:23.7065259Z 2025-05-07T19:45:23.7065264Z 2025-05-07T19:45:23.7065269Z 2025-05-07T19:45:23.7065274Z 2025-05-07T19:45:23.7065280Z 2025-05-07T19:45:23.7065299Z 2025-05-07T19:45:23.7065321Z 2025-05-07T19:45:23.7065329Z 2025-05-07T19:45:23.8135005Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:23.8355825Z openjdk-23.0.1 | 181.3 MB | #########8 | 98% 2025-05-07T19:45:23.8356304Z 2025-05-07T19:45:23.8356313Z 2025-05-07T19:45:23.8356351Z 2025-05-07T19:45:23.8356355Z 2025-05-07T19:45:23.8356360Z 2025-05-07T19:45:23.8356364Z 2025-05-07T19:45:23.8356367Z 2025-05-07T19:45:23.8356372Z 2025-05-07T19:45:23.8356376Z 2025-05-07T19:45:23.8356379Z 2025-05-07T19:45:23.8356383Z 2025-05-07T19:45:23.8356386Z 2025-05-07T19:45:23.8356399Z 2025-05-07T19:45:23.8358888Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:23.8359362Z 2025-05-07T19:45:23.8359368Z 2025-05-07T19:45:23.8359373Z 2025-05-07T19:45:23.8359377Z 2025-05-07T19:45:23.8359382Z 2025-05-07T19:45:23.8359386Z 2025-05-07T19:45:23.8359396Z 2025-05-07T19:45:23.8359415Z 2025-05-07T19:45:23.8359419Z 2025-05-07T19:45:23.8359435Z 2025-05-07T19:45:23.8359438Z 2025-05-07T19:45:23.8359441Z 2025-05-07T19:45:23.8359445Z 2025-05-07T19:45:24.3433688Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:24.3434290Z 2025-05-07T19:45:24.3434303Z 2025-05-07T19:45:24.3434355Z 2025-05-07T19:45:24.3434360Z 2025-05-07T19:45:24.3434364Z 2025-05-07T19:45:24.3434367Z 2025-05-07T19:45:24.3434372Z 2025-05-07T19:45:24.3434377Z 2025-05-07T19:45:24.3434381Z 2025-05-07T19:45:24.3434384Z 2025-05-07T19:45:24.3434389Z 2025-05-07T19:45:24.3434392Z 2025-05-07T19:45:24.3434396Z 2025-05-07T19:45:24.3434399Z 2025-05-07T19:45:24.3434403Z 2025-05-07T19:45:24.3434823Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:24.3435222Z 2025-05-07T19:45:24.3435227Z 2025-05-07T19:45:24.3435231Z 2025-05-07T19:45:24.3435235Z 2025-05-07T19:45:24.3435239Z 2025-05-07T19:45:24.3435257Z 2025-05-07T19:45:24.3435261Z 2025-05-07T19:45:24.3435265Z 2025-05-07T19:45:24.3435283Z 2025-05-07T19:45:24.3435286Z 2025-05-07T19:45:24.3435289Z 2025-05-07T19:45:24.3435293Z 2025-05-07T19:45:24.3435296Z 2025-05-07T19:45:24.3435300Z 2025-05-07T19:45:24.3435303Z 2025-05-07T19:45:24.4943868Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:24.4944255Z 2025-05-07T19:45:24.4944275Z 2025-05-07T19:45:24.4944280Z 2025-05-07T19:45:24.4944284Z 2025-05-07T19:45:24.4944289Z 2025-05-07T19:45:24.4944292Z 2025-05-07T19:45:24.4944296Z 2025-05-07T19:45:24.4944300Z 2025-05-07T19:45:24.4944304Z 2025-05-07T19:45:24.4944307Z 2025-05-07T19:45:24.4944311Z 2025-05-07T19:45:24.4944314Z 2025-05-07T19:45:24.4944317Z 2025-05-07T19:45:24.4944321Z 2025-05-07T19:45:24.4946346Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:24.4946866Z 2025-05-07T19:45:24.4946875Z 2025-05-07T19:45:24.4947149Z 2025-05-07T19:45:24.4947159Z 2025-05-07T19:45:24.4947166Z 2025-05-07T19:45:24.4947172Z 2025-05-07T19:45:24.4947178Z 2025-05-07T19:45:24.4947186Z 2025-05-07T19:45:24.4947194Z 2025-05-07T19:45:24.4947201Z 2025-05-07T19:45:24.4947206Z 2025-05-07T19:45:24.4947210Z 2025-05-07T19:45:24.4947215Z 2025-05-07T19:45:24.4947921Z 2025-05-07T19:45:24.6943379Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:24.6943893Z 2025-05-07T19:45:24.6943903Z 2025-05-07T19:45:24.6943910Z 2025-05-07T19:45:24.6943918Z 2025-05-07T19:45:24.6943943Z 2025-05-07T19:45:24.6943947Z 2025-05-07T19:45:24.6943953Z 2025-05-07T19:45:24.6943957Z 2025-05-07T19:45:24.6943962Z 2025-05-07T19:45:24.6943966Z 2025-05-07T19:45:24.6943972Z 2025-05-07T19:45:24.6943979Z 2025-05-07T19:45:24.6943985Z 2025-05-07T19:45:24.6943997Z 2025-05-07T19:45:24.6944029Z 2025-05-07T19:45:24.6944037Z 2025-05-07T19:45:24.6944043Z 2025-05-07T19:45:24.6944575Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:24.6945100Z 2025-05-07T19:45:24.6945107Z 2025-05-07T19:45:24.6945113Z 2025-05-07T19:45:24.6945119Z 2025-05-07T19:45:24.6945126Z 2025-05-07T19:45:24.6945132Z 2025-05-07T19:45:24.6945137Z 2025-05-07T19:45:24.6945144Z 2025-05-07T19:45:24.6945174Z 2025-05-07T19:45:24.6945181Z 2025-05-07T19:45:24.6945187Z 2025-05-07T19:45:24.6945192Z 2025-05-07T19:45:24.6945197Z 2025-05-07T19:45:24.6945201Z 2025-05-07T19:45:24.6945208Z 2025-05-07T19:45:24.6945213Z 2025-05-07T19:45:24.6945218Z 2025-05-07T19:45:24.7285442Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:24.7285997Z 2025-05-07T19:45:24.7286028Z 2025-05-07T19:45:24.7286035Z 2025-05-07T19:45:24.7286044Z 2025-05-07T19:45:24.7286052Z 2025-05-07T19:45:24.7286059Z 2025-05-07T19:45:24.7286068Z 2025-05-07T19:45:24.7286078Z 2025-05-07T19:45:24.7286101Z 2025-05-07T19:45:24.7286136Z 2025-05-07T19:45:24.7286141Z 2025-05-07T19:45:24.7286145Z 2025-05-07T19:45:24.7286149Z 2025-05-07T19:45:24.7286154Z 2025-05-07T19:45:24.7286159Z 2025-05-07T19:45:24.7286164Z 2025-05-07T19:45:24.7286167Z 2025-05-07T19:45:24.7286172Z 2025-05-07T19:45:24.7286845Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:24.7287280Z 2025-05-07T19:45:24.7287286Z 2025-05-07T19:45:24.7287293Z 2025-05-07T19:45:24.7287299Z 2025-05-07T19:45:24.7287305Z 2025-05-07T19:45:24.7287321Z 2025-05-07T19:45:24.7287327Z 2025-05-07T19:45:24.7287332Z 2025-05-07T19:45:24.7287338Z 2025-05-07T19:45:24.7287345Z 2025-05-07T19:45:24.7287350Z 2025-05-07T19:45:24.7287357Z 2025-05-07T19:45:24.7287364Z 2025-05-07T19:45:24.7287371Z 2025-05-07T19:45:24.7287379Z 2025-05-07T19:45:24.7287384Z 2025-05-07T19:45:24.7287388Z 2025-05-07T19:45:24.7288684Z 2025-05-07T19:45:24.8067894Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:24.8068272Z 2025-05-07T19:45:24.8068277Z 2025-05-07T19:45:24.8068281Z 2025-05-07T19:45:24.8068301Z 2025-05-07T19:45:24.8068305Z 2025-05-07T19:45:24.8068309Z 2025-05-07T19:45:24.8068313Z 2025-05-07T19:45:24.8068317Z 2025-05-07T19:45:24.8068336Z 2025-05-07T19:45:24.8068341Z 2025-05-07T19:45:24.8068364Z 2025-05-07T19:45:24.8068368Z 2025-05-07T19:45:24.8068371Z 2025-05-07T19:45:24.8068375Z 2025-05-07T19:45:24.8068378Z 2025-05-07T19:45:24.8068382Z 2025-05-07T19:45:24.8068782Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:24.8069193Z 2025-05-07T19:45:24.8069214Z 2025-05-07T19:45:24.8069220Z 2025-05-07T19:45:24.8069227Z 2025-05-07T19:45:24.8069233Z 2025-05-07T19:45:24.8069239Z 2025-05-07T19:45:24.8069246Z 2025-05-07T19:45:24.8069252Z 2025-05-07T19:45:24.8069257Z 2025-05-07T19:45:24.8069262Z 2025-05-07T19:45:24.8069268Z 2025-05-07T19:45:24.8069273Z 2025-05-07T19:45:24.8069521Z 2025-05-07T19:45:24.8069528Z 2025-05-07T19:45:24.8069533Z 2025-05-07T19:45:24.8069539Z 2025-05-07T19:45:25.3336889Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:25.3337349Z 2025-05-07T19:45:25.3337359Z 2025-05-07T19:45:25.3337366Z 2025-05-07T19:45:25.5788794Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:25.5789174Z 2025-05-07T19:45:25.5789180Z 2025-05-07T19:45:25.8862903Z python-3.13.2 | 31.7 MB | ########## | 100%  2025-05-07T19:45:25.9631032Z openjdk-23.0.1 | 181.3 MB | ########## | 100% 2025-05-07T19:45:26.6290371Z 2025-05-07T19:45:26.6291897Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:26.6292778Z 2025-05-07T19:45:26.6292795Z 2025-05-07T19:45:26.6292808Z 2025-05-07T19:45:26.6292821Z 2025-05-07T19:45:26.6292834Z 2025-05-07T19:45:26.6292847Z 2025-05-07T19:45:26.6292861Z 2025-05-07T19:45:26.6292874Z 2025-05-07T19:45:26.6292980Z 2025-05-07T19:45:26.6292994Z 2025-05-07T19:45:26.6293005Z 2025-05-07T19:45:26.6293017Z 2025-05-07T19:45:26.6293030Z 2025-05-07T19:45:26.6293042Z 2025-05-07T19:45:26.6293054Z 2025-05-07T19:45:26.6293066Z 2025-05-07T19:45:26.6293078Z 2025-05-07T19:45:26.6293090Z 2025-05-07T19:45:26.6293102Z 2025-05-07T19:45:26.6293907Z ... (more hidden) ... 2025-05-07T19:45:26.6294809Z 2025-05-07T19:45:26.6294819Z 2025-05-07T19:45:26.6294830Z 2025-05-07T19:45:26.6294840Z 2025-05-07T19:45:26.6294850Z 2025-05-07T19:45:26.6294860Z 2025-05-07T19:45:26.6294870Z 2025-05-07T19:45:26.6294880Z 2025-05-07T19:45:26.6294890Z 2025-05-07T19:45:26.6294900Z 2025-05-07T19:45:26.6294911Z 2025-05-07T19:45:26.6294921Z 2025-05-07T19:45:26.6294932Z 2025-05-07T19:45:26.6294942Z 2025-05-07T19:45:26.6294952Z 2025-05-07T19:45:26.6294963Z 2025-05-07T19:45:26.6294973Z 2025-05-07T19:45:26.6294983Z 2025-05-07T19:45:26.6294993Z 2025-05-07T19:45:27.8241867Z ... (more hidden) ... 2025-05-07T19:45:27.8250146Z openjdk-23.0.1 | 181.3 MB | ########## | 100% 2025-05-07T19:45:27.8250805Z 2025-05-07T19:45:27.8250810Z 2025-05-07T19:45:27.8250815Z 2025-05-07T19:45:27.8250819Z 2025-05-07T19:45:27.8250823Z 2025-05-07T19:45:27.8250827Z 2025-05-07T19:45:27.8250894Z 2025-05-07T19:45:27.8250898Z 2025-05-07T19:45:27.8250902Z 2025-05-07T19:45:27.8250905Z 2025-05-07T19:45:27.8250909Z 2025-05-07T19:45:27.8250912Z 2025-05-07T19:45:27.8250916Z 2025-05-07T19:45:27.8250919Z 2025-05-07T19:45:27.8250923Z 2025-05-07T19:45:27.8250926Z 2025-05-07T19:45:27.8250929Z 2025-05-07T19:45:27.8250933Z 2025-05-07T19:45:27.8250937Z 2025-05-07T19:45:27.8251111Z 2025-05-07T19:45:27.8251712Z  2025-05-07T19:45:27.8252083Z 2025-05-07T19:45:27.8252315Z 2025-05-07T19:45:27.8252514Z  2025-05-07T19:45:27.8252728Z 2025-05-07T19:45:27.8252732Z 2025-05-07T19:45:27.8252938Z  2025-05-07T19:45:27.8253163Z 2025-05-07T19:45:27.8253166Z 2025-05-07T19:45:27.8253170Z 2025-05-07T19:45:27.8253365Z  2025-05-07T19:45:27.8253617Z 2025-05-07T19:45:27.8253620Z 2025-05-07T19:45:27.8253624Z 2025-05-07T19:45:27.8253627Z 2025-05-07T19:45:27.8253811Z  2025-05-07T19:45:27.8254217Z 2025-05-07T19:45:27.8254221Z 2025-05-07T19:45:27.8254225Z 2025-05-07T19:45:27.8254248Z 2025-05-07T19:45:27.8254252Z 2025-05-07T19:45:27.8254472Z  2025-05-07T19:45:27.8254711Z 2025-05-07T19:45:27.8254742Z 2025-05-07T19:45:27.8254745Z 2025-05-07T19:45:27.8254749Z 2025-05-07T19:45:27.8254752Z 2025-05-07T19:45:27.8255054Z 2025-05-07T19:45:27.8255274Z  2025-05-07T19:45:27.8255522Z 2025-05-07T19:45:27.8255526Z 2025-05-07T19:45:27.8255529Z 2025-05-07T19:45:27.8255533Z 2025-05-07T19:45:27.8255537Z 2025-05-07T19:45:27.8255572Z 2025-05-07T19:45:27.8255576Z 2025-05-07T19:45:27.8255979Z  2025-05-07T19:45:27.8256225Z 2025-05-07T19:45:27.8256229Z 2025-05-07T19:45:27.8256233Z 2025-05-07T19:45:27.8256237Z 2025-05-07T19:45:27.8256240Z 2025-05-07T19:45:27.8256244Z 2025-05-07T19:45:27.8256247Z 2025-05-07T19:45:27.8256251Z 2025-05-07T19:45:27.8256483Z  2025-05-07T19:45:27.8256726Z 2025-05-07T19:45:27.8256730Z 2025-05-07T19:45:27.8256733Z 2025-05-07T19:45:27.8256737Z 2025-05-07T19:45:27.8256740Z 2025-05-07T19:45:27.8256744Z 2025-05-07T19:45:27.8256747Z 2025-05-07T19:45:27.8256751Z 2025-05-07T19:45:27.8256759Z 2025-05-07T19:45:27.8256990Z  2025-05-07T19:45:27.8257348Z 2025-05-07T19:45:27.8257352Z 2025-05-07T19:45:27.8257356Z 2025-05-07T19:45:27.8257359Z 2025-05-07T19:45:27.8257363Z 2025-05-07T19:45:27.8257366Z 2025-05-07T19:45:27.8257369Z 2025-05-07T19:45:27.8257377Z 2025-05-07T19:45:27.8257381Z 2025-05-07T19:45:27.8257385Z 2025-05-07T19:45:27.8257619Z  2025-05-07T19:45:27.8257864Z 2025-05-07T19:45:27.8257868Z 2025-05-07T19:45:27.8257872Z 2025-05-07T19:45:27.8257875Z 2025-05-07T19:45:27.8257879Z 2025-05-07T19:45:27.8257882Z 2025-05-07T19:45:27.8257885Z 2025-05-07T19:45:27.8257889Z 2025-05-07T19:45:27.8257892Z 2025-05-07T19:45:27.8257896Z 2025-05-07T19:45:27.8257899Z 2025-05-07T19:45:27.8258142Z  2025-05-07T19:45:27.8258386Z 2025-05-07T19:45:27.8258394Z 2025-05-07T19:45:27.8258397Z 2025-05-07T19:45:27.8258400Z 2025-05-07T19:45:27.8258404Z 2025-05-07T19:45:27.8258407Z 2025-05-07T19:45:27.8258411Z 2025-05-07T19:45:27.8258414Z 2025-05-07T19:45:27.8258418Z 2025-05-07T19:45:27.8258421Z 2025-05-07T19:45:27.8258454Z 2025-05-07T19:45:27.8258457Z 2025-05-07T19:45:27.8258666Z  2025-05-07T19:45:27.8258905Z 2025-05-07T19:45:27.8258908Z 2025-05-07T19:45:27.8258912Z 2025-05-07T19:45:27.8258915Z 2025-05-07T19:45:27.8258919Z 2025-05-07T19:45:27.8258922Z 2025-05-07T19:45:27.8258925Z 2025-05-07T19:45:27.8258929Z 2025-05-07T19:45:27.8258952Z 2025-05-07T19:45:27.8258955Z 2025-05-07T19:45:27.8258959Z 2025-05-07T19:45:27.8258963Z 2025-05-07T19:45:27.8258966Z 2025-05-07T19:45:27.8259206Z  2025-05-07T19:45:27.8259476Z 2025-05-07T19:45:27.8259480Z 2025-05-07T19:45:27.8259487Z 2025-05-07T19:45:27.8259491Z 2025-05-07T19:45:27.8259495Z 2025-05-07T19:45:27.8259498Z 2025-05-07T19:45:27.8259502Z 2025-05-07T19:45:27.8259506Z 2025-05-07T19:45:27.8259509Z 2025-05-07T19:45:27.8259512Z 2025-05-07T19:45:27.8259515Z 2025-05-07T19:45:27.8259519Z 2025-05-07T19:45:27.8259522Z 2025-05-07T19:45:27.8259525Z 2025-05-07T19:45:27.8259749Z  2025-05-07T19:45:27.8260009Z 2025-05-07T19:45:27.8260013Z 2025-05-07T19:45:27.8260017Z 2025-05-07T19:45:27.8260020Z 2025-05-07T19:45:27.8260024Z 2025-05-07T19:45:27.8260027Z 2025-05-07T19:45:27.8260030Z 2025-05-07T19:45:27.8260033Z 2025-05-07T19:45:27.8260037Z 2025-05-07T19:45:27.8260040Z 2025-05-07T19:45:27.8260043Z 2025-05-07T19:45:27.8260047Z 2025-05-07T19:45:27.8260050Z 2025-05-07T19:45:27.8260054Z 2025-05-07T19:45:27.8260058Z 2025-05-07T19:45:27.8260311Z  2025-05-07T19:45:27.8260622Z 2025-05-07T19:45:27.8260626Z 2025-05-07T19:45:27.8260629Z 2025-05-07T19:45:27.8260633Z 2025-05-07T19:45:27.8260636Z 2025-05-07T19:45:27.8260639Z 2025-05-07T19:45:27.8260643Z 2025-05-07T19:45:27.8260646Z 2025-05-07T19:45:27.8260650Z 2025-05-07T19:45:27.8260654Z 2025-05-07T19:45:27.8260716Z 2025-05-07T19:45:27.8260721Z 2025-05-07T19:45:27.8260752Z 2025-05-07T19:45:27.8260755Z 2025-05-07T19:45:27.8260759Z 2025-05-07T19:45:27.8260762Z 2025-05-07T19:45:27.8260997Z  2025-05-07T19:45:27.8261255Z 2025-05-07T19:45:27.8261260Z 2025-05-07T19:45:27.8261264Z 2025-05-07T19:45:27.8261267Z 2025-05-07T19:45:27.8261271Z 2025-05-07T19:45:27.8261308Z 2025-05-07T19:45:27.8261311Z 2025-05-07T19:45:27.8261314Z 2025-05-07T19:45:27.8261318Z 2025-05-07T19:45:27.8261321Z 2025-05-07T19:45:27.8261324Z 2025-05-07T19:45:27.8261327Z 2025-05-07T19:45:27.8261335Z 2025-05-07T19:45:27.8261339Z 2025-05-07T19:45:27.8261342Z 2025-05-07T19:45:27.8261345Z 2025-05-07T19:45:27.8261349Z 2025-05-07T19:45:27.8261585Z  2025-05-07T19:45:27.8261882Z 2025-05-07T19:45:27.8261885Z 2025-05-07T19:45:27.8261889Z 2025-05-07T19:45:27.8261896Z 2025-05-07T19:45:27.8261900Z 2025-05-07T19:45:27.8261903Z 2025-05-07T19:45:27.8261907Z 2025-05-07T19:45:27.8261910Z 2025-05-07T19:45:27.8261913Z 2025-05-07T19:45:27.8261917Z 2025-05-07T19:45:27.8261920Z 2025-05-07T19:45:27.8261923Z 2025-05-07T19:45:27.8261927Z 2025-05-07T19:45:27.8261930Z 2025-05-07T19:45:27.8261933Z 2025-05-07T19:45:27.8261936Z 2025-05-07T19:45:27.8261940Z 2025-05-07T19:45:27.8261944Z 2025-05-07T19:45:27.8262213Z  2025-05-07T19:45:27.8262475Z 2025-05-07T19:45:27.8262479Z 2025-05-07T19:45:27.8262590Z  2025-05-07T19:45:27.8262742Z 2025-05-07T19:45:27.8262745Z 2025-05-07T19:45:27.8262853Z  2025-05-07T19:45:27.8262975Z 2025-05-07T19:45:27.8262979Z 2025-05-07T19:45:27.8262982Z 2025-05-07T19:45:27.8263137Z  2025-05-07T19:45:27.8263254Z 2025-05-07T19:45:27.8263258Z 2025-05-07T19:45:27.8263261Z 2025-05-07T19:45:27.8263265Z 2025-05-07T19:45:27.8263374Z  2025-05-07T19:45:27.8263517Z 2025-05-07T19:45:27.8263521Z 2025-05-07T19:45:27.8263524Z 2025-05-07T19:45:27.8263527Z 2025-05-07T19:45:27.8263531Z 2025-05-07T19:45:27.8263636Z  2025-05-07T19:45:27.8263766Z 2025-05-07T19:45:27.8263770Z 2025-05-07T19:45:27.8263797Z 2025-05-07T19:45:27.8263800Z 2025-05-07T19:45:27.8263804Z 2025-05-07T19:45:27.8263807Z 2025-05-07T19:45:27.8263928Z  2025-05-07T19:45:27.8264069Z 2025-05-07T19:45:27.8264073Z 2025-05-07T19:45:27.8264076Z 2025-05-07T19:45:27.8264080Z 2025-05-07T19:45:27.8264083Z 2025-05-07T19:45:27.8264086Z 2025-05-07T19:45:27.8264090Z 2025-05-07T19:45:27.8264239Z  2025-05-07T19:45:27.8264384Z 2025-05-07T19:45:27.8264388Z 2025-05-07T19:45:27.8264391Z 2025-05-07T19:45:27.8264395Z 2025-05-07T19:45:27.8264398Z 2025-05-07T19:45:27.8264402Z 2025-05-07T19:45:27.8264405Z 2025-05-07T19:45:27.8264408Z 2025-05-07T19:45:27.8264562Z  2025-05-07T19:45:27.8264726Z 2025-05-07T19:45:27.8264730Z 2025-05-07T19:45:27.8264734Z 2025-05-07T19:45:27.8264737Z 2025-05-07T19:45:27.8264740Z 2025-05-07T19:45:27.8264744Z 2025-05-07T19:45:27.8264747Z 2025-05-07T19:45:27.8264751Z 2025-05-07T19:45:27.8264754Z 2025-05-07T19:45:27.8264907Z  2025-05-07T19:45:27.8265077Z 2025-05-07T19:45:27.8265080Z 2025-05-07T19:45:27.8265083Z 2025-05-07T19:45:27.8265087Z 2025-05-07T19:45:27.8265090Z 2025-05-07T19:45:27.8265093Z 2025-05-07T19:45:27.8265096Z 2025-05-07T19:45:27.8265100Z 2025-05-07T19:45:27.8265103Z 2025-05-07T19:45:27.8265107Z 2025-05-07T19:45:27.8265293Z  2025-05-07T19:45:27.8265581Z 2025-05-07T19:45:27.8265584Z 2025-05-07T19:45:27.8265588Z 2025-05-07T19:45:27.8265591Z 2025-05-07T19:45:27.8265595Z 2025-05-07T19:45:27.8265598Z 2025-05-07T19:45:27.8265602Z 2025-05-07T19:45:27.8265605Z 2025-05-07T19:45:27.8265608Z 2025-05-07T19:45:27.8265612Z 2025-05-07T19:45:27.8265615Z 2025-05-07T19:45:27.8265843Z  2025-05-07T19:45:27.8266068Z 2025-05-07T19:45:27.8266071Z 2025-05-07T19:45:27.8266075Z 2025-05-07T19:45:27.8266078Z 2025-05-07T19:45:27.8266082Z 2025-05-07T19:45:27.8266085Z 2025-05-07T19:45:27.8266088Z 2025-05-07T19:45:27.8266092Z 2025-05-07T19:45:27.8266095Z 2025-05-07T19:45:27.8266098Z 2025-05-07T19:45:27.8266101Z 2025-05-07T19:45:27.8266105Z 2025-05-07T19:45:27.8266248Z  2025-05-07T19:45:27.8266470Z 2025-05-07T19:45:27.8266474Z 2025-05-07T19:45:27.8266478Z 2025-05-07T19:45:27.8266482Z 2025-05-07T19:45:27.8266485Z 2025-05-07T19:45:27.8266488Z 2025-05-07T19:45:27.8266496Z 2025-05-07T19:45:27.8266499Z 2025-05-07T19:45:27.8266502Z 2025-05-07T19:45:27.8266505Z 2025-05-07T19:45:27.8266509Z 2025-05-07T19:45:27.8266512Z 2025-05-07T19:45:27.8266515Z 2025-05-07T19:45:27.8266691Z  2025-05-07T19:45:27.8266895Z 2025-05-07T19:45:27.8266899Z 2025-05-07T19:45:27.8266902Z 2025-05-07T19:45:27.8266910Z 2025-05-07T19:45:27.8266913Z 2025-05-07T19:45:27.8266916Z 2025-05-07T19:45:27.8266920Z 2025-05-07T19:45:27.8266924Z 2025-05-07T19:45:27.8266927Z 2025-05-07T19:45:27.8266930Z 2025-05-07T19:45:27.8266934Z 2025-05-07T19:45:27.8266937Z 2025-05-07T19:45:27.8266941Z 2025-05-07T19:45:27.8266945Z 2025-05-07T19:45:27.8267168Z  2025-05-07T19:45:27.8267377Z 2025-05-07T19:45:27.8267381Z 2025-05-07T19:45:27.8267384Z 2025-05-07T19:45:27.8267388Z 2025-05-07T19:45:27.8267392Z 2025-05-07T19:45:27.8267422Z 2025-05-07T19:45:27.8267425Z 2025-05-07T19:45:27.8267428Z 2025-05-07T19:45:27.8267432Z 2025-05-07T19:45:27.8267439Z 2025-05-07T19:45:27.8267442Z 2025-05-07T19:45:27.8267445Z 2025-05-07T19:45:27.8267449Z 2025-05-07T19:45:27.8267452Z 2025-05-07T19:45:27.8267456Z 2025-05-07T19:45:27.8267613Z  2025-05-07T19:45:27.8267827Z 2025-05-07T19:45:27.8267848Z 2025-05-07T19:45:27.8267851Z 2025-05-07T19:45:27.8267858Z 2025-05-07T19:45:27.8267862Z 2025-05-07T19:45:27.8267865Z 2025-05-07T19:45:27.8267868Z 2025-05-07T19:45:27.8267872Z 2025-05-07T19:45:27.8267875Z 2025-05-07T19:45:27.8267879Z 2025-05-07T19:45:27.8267883Z 2025-05-07T19:45:27.8267886Z 2025-05-07T19:45:27.8267889Z 2025-05-07T19:45:27.8267892Z 2025-05-07T19:45:27.8267896Z 2025-05-07T19:45:27.8267899Z 2025-05-07T19:45:27.8268051Z  2025-05-07T19:45:27.8268288Z 2025-05-07T19:45:27.8268291Z 2025-05-07T19:45:27.8268295Z 2025-05-07T19:45:27.8268298Z 2025-05-07T19:45:27.8268302Z 2025-05-07T19:45:27.8268305Z 2025-05-07T19:45:27.8268308Z 2025-05-07T19:45:27.8268315Z 2025-05-07T19:45:27.8268319Z 2025-05-07T19:45:27.8268322Z 2025-05-07T19:45:27.8268325Z 2025-05-07T19:45:27.8268329Z 2025-05-07T19:45:27.8268332Z 2025-05-07T19:45:27.8268335Z 2025-05-07T19:45:27.8268339Z 2025-05-07T19:45:27.8268343Z 2025-05-07T19:45:27.8268347Z 2025-05-07T19:45:27.8268540Z  2025-05-07T19:45:27.8268763Z 2025-05-07T19:45:27.8268766Z 2025-05-07T19:45:27.8268769Z 2025-05-07T19:45:27.8268773Z 2025-05-07T19:45:27.8268776Z 2025-05-07T19:45:27.8268779Z 2025-05-07T19:45:27.8268782Z 2025-05-07T19:45:27.8268786Z 2025-05-07T19:45:27.8268789Z 2025-05-07T19:45:27.8268792Z 2025-05-07T19:45:27.8268795Z 2025-05-07T19:45:27.8268799Z 2025-05-07T19:45:27.8268802Z 2025-05-07T19:45:27.8268834Z 2025-05-07T19:45:27.8268837Z 2025-05-07T19:45:27.8268841Z 2025-05-07T19:45:27.8268844Z 2025-05-07T19:45:27.8268847Z 2025-05-07T19:45:27.8269032Z  2025-05-07T19:45:27.8269317Z 2025-05-07T19:45:27.8269321Z 2025-05-07T19:45:27.8269436Z  2025-05-07T19:45:27.8269550Z 2025-05-07T19:45:27.8269553Z 2025-05-07T19:45:27.8269657Z  2025-05-07T19:45:27.8269796Z 2025-05-07T19:45:27.8269800Z 2025-05-07T19:45:27.8269803Z 2025-05-07T19:45:27.8269926Z  2025-05-07T19:45:27.8270041Z 2025-05-07T19:45:27.8270107Z 2025-05-07T19:45:27.8270112Z 2025-05-07T19:45:27.8270115Z 2025-05-07T19:45:27.8270243Z  2025-05-07T19:45:27.8270361Z 2025-05-07T19:45:27.8270365Z 2025-05-07T19:45:27.8270369Z 2025-05-07T19:45:27.8270372Z 2025-05-07T19:45:27.8270375Z 2025-05-07T19:45:27.8270529Z  2025-05-07T19:45:27.8270658Z 2025-05-07T19:45:27.8270663Z 2025-05-07T19:45:27.8270666Z 2025-05-07T19:45:27.8270670Z 2025-05-07T19:45:27.8270673Z 2025-05-07T19:45:27.8270676Z 2025-05-07T19:45:27.8270796Z  2025-05-07T19:45:27.8270948Z 2025-05-07T19:45:27.8270951Z 2025-05-07T19:45:27.8270954Z 2025-05-07T19:45:27.8270958Z 2025-05-07T19:45:27.8270965Z 2025-05-07T19:45:27.8270969Z 2025-05-07T19:45:27.8270972Z 2025-05-07T19:45:27.8271098Z  2025-05-07T19:45:27.8271256Z 2025-05-07T19:45:27.8271259Z 2025-05-07T19:45:27.8271262Z 2025-05-07T19:45:27.8271266Z 2025-05-07T19:45:27.8271278Z 2025-05-07T19:45:27.8271282Z 2025-05-07T19:45:27.8271285Z 2025-05-07T19:45:27.8271292Z 2025-05-07T19:45:27.8271488Z  2025-05-07T19:45:27.8271643Z 2025-05-07T19:45:27.8271676Z 2025-05-07T19:45:27.8271679Z 2025-05-07T19:45:27.8271696Z 2025-05-07T19:45:27.8271699Z 2025-05-07T19:45:27.8271702Z 2025-05-07T19:45:27.8271706Z 2025-05-07T19:45:27.8271710Z 2025-05-07T19:45:27.8271713Z 2025-05-07T19:45:27.8272470Z  2025-05-07T19:45:27.8272723Z 2025-05-07T19:45:27.8272731Z 2025-05-07T19:45:27.8272739Z 2025-05-07T19:45:27.8272747Z 2025-05-07T19:45:27.8272827Z 2025-05-07T19:45:27.8272839Z 2025-05-07T19:45:27.8272850Z 2025-05-07T19:45:27.8272857Z 2025-05-07T19:45:27.8272866Z 2025-05-07T19:45:27.8272929Z 2025-05-07T19:45:27.8273134Z  2025-05-07T19:45:27.8273349Z 2025-05-07T19:45:27.8273355Z 2025-05-07T19:45:27.8273361Z 2025-05-07T19:45:27.8273366Z 2025-05-07T19:45:27.8273370Z 2025-05-07T19:45:27.8273375Z 2025-05-07T19:45:27.8273378Z 2025-05-07T19:45:27.8273384Z 2025-05-07T19:45:27.8273417Z 2025-05-07T19:45:27.8273459Z 2025-05-07T19:45:27.8273462Z 2025-05-07T19:45:27.8273611Z  2025-05-07T19:45:27.8273811Z 2025-05-07T19:45:27.8273815Z 2025-05-07T19:45:27.8273818Z 2025-05-07T19:45:27.8273822Z 2025-05-07T19:45:27.8273825Z 2025-05-07T19:45:27.8273828Z 2025-05-07T19:45:27.8273832Z 2025-05-07T19:45:27.8273835Z 2025-05-07T19:45:27.8273856Z 2025-05-07T19:45:27.8273860Z 2025-05-07T19:45:27.8273863Z 2025-05-07T19:45:27.8273867Z 2025-05-07T19:45:27.8274014Z  2025-05-07T19:45:27.8274213Z 2025-05-07T19:45:27.8274216Z 2025-05-07T19:45:27.8274220Z 2025-05-07T19:45:27.8274223Z 2025-05-07T19:45:27.8274264Z 2025-05-07T19:45:27.8274267Z 2025-05-07T19:45:27.8274271Z 2025-05-07T19:45:27.8274274Z 2025-05-07T19:45:27.8274278Z 2025-05-07T19:45:27.8274281Z 2025-05-07T19:45:27.8274285Z 2025-05-07T19:45:27.8274288Z 2025-05-07T19:45:27.8274292Z 2025-05-07T19:45:27.8274449Z  2025-05-07T19:45:27.8274674Z 2025-05-07T19:45:27.8274678Z 2025-05-07T19:45:27.8274710Z 2025-05-07T19:45:27.8274713Z 2025-05-07T19:45:27.8274717Z 2025-05-07T19:45:27.8274720Z 2025-05-07T19:45:27.8274724Z 2025-05-07T19:45:27.8274727Z 2025-05-07T19:45:27.8274731Z 2025-05-07T19:45:27.8274734Z 2025-05-07T19:45:27.8274737Z 2025-05-07T19:45:27.8274741Z 2025-05-07T19:45:27.8274744Z 2025-05-07T19:45:27.8274747Z 2025-05-07T19:45:27.8274909Z  2025-05-07T19:45:27.8275159Z 2025-05-07T19:45:27.8275163Z 2025-05-07T19:45:27.8275166Z 2025-05-07T19:45:27.8275169Z 2025-05-07T19:45:27.8275173Z 2025-05-07T19:45:27.8275176Z 2025-05-07T19:45:27.8275384Z 2025-05-07T19:45:27.8275388Z 2025-05-07T19:45:27.8275392Z 2025-05-07T19:45:27.8275395Z 2025-05-07T19:45:27.8275398Z 2025-05-07T19:45:27.8275402Z 2025-05-07T19:45:27.8275405Z 2025-05-07T19:45:27.8275409Z 2025-05-07T19:45:27.8275412Z 2025-05-07T19:45:27.8275576Z  2025-05-07T19:45:27.8275792Z 2025-05-07T19:45:27.8275923Z 2025-05-07T19:45:27.8275928Z 2025-05-07T19:45:27.8275931Z 2025-05-07T19:45:27.8275935Z 2025-05-07T19:45:27.8275938Z 2025-05-07T19:45:27.8275942Z 2025-05-07T19:45:27.8275945Z 2025-05-07T19:45:27.8275948Z 2025-05-07T19:45:27.8275952Z 2025-05-07T19:45:27.8275955Z 2025-05-07T19:45:27.8275958Z 2025-05-07T19:45:27.8275962Z 2025-05-07T19:45:27.8275965Z 2025-05-07T19:45:27.8275983Z 2025-05-07T19:45:27.8275987Z 2025-05-07T19:45:27.8276163Z  2025-05-07T19:45:27.8276381Z 2025-05-07T19:45:27.8276385Z 2025-05-07T19:45:27.8276388Z 2025-05-07T19:45:27.8276392Z 2025-05-07T19:45:27.8276399Z 2025-05-07T19:45:27.8276428Z 2025-05-07T19:45:27.8276432Z 2025-05-07T19:45:27.8276435Z 2025-05-07T19:45:27.8276438Z 2025-05-07T19:45:27.8276442Z 2025-05-07T19:45:27.8276445Z 2025-05-07T19:45:27.8276448Z 2025-05-07T19:45:27.8276452Z 2025-05-07T19:45:27.8276455Z 2025-05-07T19:45:27.8276458Z 2025-05-07T19:45:27.8276462Z 2025-05-07T19:45:27.8276469Z 2025-05-07T19:45:27.8276645Z  2025-05-07T19:45:27.8276894Z 2025-05-07T19:45:27.8276898Z 2025-05-07T19:45:27.8276901Z 2025-05-07T19:45:27.8276904Z 2025-05-07T19:45:27.8276908Z 2025-05-07T19:45:27.8276911Z 2025-05-07T19:45:27.8276914Z 2025-05-07T19:45:27.8276918Z 2025-05-07T19:45:27.8276921Z 2025-05-07T19:45:27.8276925Z 2025-05-07T19:45:27.8276928Z 2025-05-07T19:45:27.8276931Z 2025-05-07T19:45:27.8276935Z 2025-05-07T19:45:27.8276938Z 2025-05-07T19:45:27.8276942Z 2025-05-07T19:45:27.8276945Z 2025-05-07T19:45:27.8276948Z 2025-05-07T19:45:27.8276952Z 2025-05-07T19:45:27.8277136Z  2025-05-07T19:45:27.8277389Z 2025-05-07T19:45:27.8277392Z 2025-05-07T19:45:27.8277486Z  2025-05-07T19:45:27.8277605Z 2025-05-07T19:45:27.8277608Z 2025-05-07T19:45:27.8277704Z  2025-05-07T19:45:27.8277824Z 2025-05-07T19:45:27.8277827Z 2025-05-07T19:45:27.8277831Z 2025-05-07T19:45:27.8277953Z  2025-05-07T19:45:27.8278065Z 2025-05-07T19:45:27.8278068Z 2025-05-07T19:45:27.8278072Z 2025-05-07T19:45:27.8278075Z 2025-05-07T19:45:27.8278187Z  2025-05-07T19:45:27.8278348Z 2025-05-07T19:45:27.8278351Z 2025-05-07T19:45:27.8278354Z 2025-05-07T19:45:27.8278358Z 2025-05-07T19:45:27.8278361Z 2025-05-07T19:45:27.8278483Z  2025-05-07T19:45:27.8278607Z 2025-05-07T19:45:27.8278627Z 2025-05-07T19:45:27.8278631Z 2025-05-07T19:45:27.8278634Z 2025-05-07T19:45:27.8278637Z 2025-05-07T19:45:27.8278641Z 2025-05-07T19:45:27.8278763Z  2025-05-07T19:45:27.8278905Z 2025-05-07T19:45:27.8278908Z 2025-05-07T19:45:27.8278916Z 2025-05-07T19:45:27.8278919Z 2025-05-07T19:45:27.8278922Z 2025-05-07T19:45:27.8278926Z 2025-05-07T19:45:27.8278943Z 2025-05-07T19:45:27.8279053Z  2025-05-07T19:45:27.8279196Z 2025-05-07T19:45:27.8279199Z 2025-05-07T19:45:27.8279203Z 2025-05-07T19:45:27.8279206Z 2025-05-07T19:45:27.8279209Z 2025-05-07T19:45:27.8279217Z 2025-05-07T19:45:27.8279220Z 2025-05-07T19:45:27.8279223Z 2025-05-07T19:45:27.8279352Z  2025-05-07T19:45:27.8279505Z 2025-05-07T19:45:27.8279509Z 2025-05-07T19:45:27.8279512Z 2025-05-07T19:45:27.8279516Z 2025-05-07T19:45:27.8279519Z 2025-05-07T19:45:27.8279522Z 2025-05-07T19:45:27.8279525Z 2025-05-07T19:45:27.8279529Z 2025-05-07T19:45:27.8279532Z 2025-05-07T19:45:27.8279702Z  2025-05-07T19:45:27.8279867Z 2025-05-07T19:45:27.8279870Z 2025-05-07T19:45:27.8279874Z 2025-05-07T19:45:27.8279877Z 2025-05-07T19:45:27.8279880Z 2025-05-07T19:45:27.8279884Z 2025-05-07T19:45:27.8279903Z 2025-05-07T19:45:27.8279971Z 2025-05-07T19:45:27.8279974Z 2025-05-07T19:45:27.8279978Z 2025-05-07T19:45:27.8280116Z  2025-05-07T19:45:27.8280288Z 2025-05-07T19:45:27.8280291Z 2025-05-07T19:45:27.8280295Z 2025-05-07T19:45:27.8280298Z 2025-05-07T19:45:27.8280302Z 2025-05-07T19:45:27.8280305Z 2025-05-07T19:45:27.8280383Z 2025-05-07T19:45:27.8280388Z 2025-05-07T19:45:27.8280391Z 2025-05-07T19:45:27.8280395Z 2025-05-07T19:45:27.8280398Z 2025-05-07T19:45:27.8280527Z  2025-05-07T19:45:27.8280710Z 2025-05-07T19:45:27.8280713Z 2025-05-07T19:45:27.8280717Z 2025-05-07T19:45:27.8280720Z 2025-05-07T19:45:27.8280724Z 2025-05-07T19:45:27.8280727Z 2025-05-07T19:45:27.8280749Z 2025-05-07T19:45:27.8280753Z 2025-05-07T19:45:27.8280756Z 2025-05-07T19:45:27.8280760Z 2025-05-07T19:45:27.8280763Z 2025-05-07T19:45:27.8280767Z 2025-05-07T19:45:27.8280899Z  2025-05-07T19:45:27.8281089Z 2025-05-07T19:45:27.8281093Z 2025-05-07T19:45:27.8281100Z 2025-05-07T19:45:27.8281104Z 2025-05-07T19:45:27.8281107Z 2025-05-07T19:45:27.8281141Z 2025-05-07T19:45:27.8281145Z 2025-05-07T19:45:27.8281148Z 2025-05-07T19:45:27.8281152Z 2025-05-07T19:45:27.8281155Z 2025-05-07T19:45:27.8281158Z 2025-05-07T19:45:27.8281162Z 2025-05-07T19:45:27.8281165Z 2025-05-07T19:45:27.8281318Z  2025-05-07T19:45:27.8281529Z 2025-05-07T19:45:27.8281533Z 2025-05-07T19:45:27.8281537Z 2025-05-07T19:45:27.8281571Z 2025-05-07T19:45:27.8281575Z 2025-05-07T19:45:27.8281578Z 2025-05-07T19:45:27.8281581Z 2025-05-07T19:45:27.8281585Z 2025-05-07T19:45:27.8281588Z 2025-05-07T19:45:27.8281591Z 2025-05-07T19:45:27.8281594Z 2025-05-07T19:45:27.8281598Z 2025-05-07T19:45:27.8281601Z 2025-05-07T19:45:27.8281604Z 2025-05-07T19:45:27.8281810Z  2025-05-07T19:45:27.8282020Z 2025-05-07T19:45:27.8282024Z 2025-05-07T19:45:27.8282027Z 2025-05-07T19:45:27.8282031Z 2025-05-07T19:45:27.8282037Z 2025-05-07T19:45:27.8282041Z 2025-05-07T19:45:27.8282044Z 2025-05-07T19:45:27.8282048Z 2025-05-07T19:45:27.8282051Z 2025-05-07T19:45:27.8282054Z 2025-05-07T19:45:27.8282058Z 2025-05-07T19:45:27.8282061Z 2025-05-07T19:45:27.8282064Z 2025-05-07T19:45:27.8282068Z 2025-05-07T19:45:27.8282072Z 2025-05-07T19:45:27.8282260Z  2025-05-07T19:45:27.8282471Z 2025-05-07T19:45:27.8282474Z 2025-05-07T19:45:27.8282478Z 2025-05-07T19:45:27.8282481Z 2025-05-07T19:45:27.8282484Z 2025-05-07T19:45:27.8282488Z 2025-05-07T19:45:27.8282491Z 2025-05-07T19:45:27.8282495Z 2025-05-07T19:45:27.8282498Z 2025-05-07T19:45:27.8282501Z 2025-05-07T19:45:27.8282505Z 2025-05-07T19:45:27.8282508Z 2025-05-07T19:45:27.8282526Z 2025-05-07T19:45:27.8282529Z 2025-05-07T19:45:27.8282532Z 2025-05-07T19:45:27.8282536Z 2025-05-07T19:45:27.8282687Z  2025-05-07T19:45:27.8282916Z 2025-05-07T19:45:27.8282919Z 2025-05-07T19:45:27.8282926Z 2025-05-07T19:45:27.8282929Z 2025-05-07T19:45:27.8282933Z 2025-05-07T19:45:27.8282937Z 2025-05-07T19:45:27.8282963Z 2025-05-07T19:45:27.8282966Z 2025-05-07T19:45:27.8282969Z 2025-05-07T19:45:27.8282973Z 2025-05-07T19:45:27.8282976Z 2025-05-07T19:45:27.8282980Z 2025-05-07T19:45:27.8282983Z 2025-05-07T19:45:27.8282986Z 2025-05-07T19:45:27.8282994Z 2025-05-07T19:45:27.8282997Z 2025-05-07T19:45:27.8283001Z 2025-05-07T19:45:27.8283160Z  2025-05-07T19:45:27.8283404Z 2025-05-07T19:45:27.8283407Z 2025-05-07T19:45:27.8283411Z 2025-05-07T19:45:27.8283414Z 2025-05-07T19:45:27.8283418Z 2025-05-07T19:45:27.8283422Z 2025-05-07T19:45:27.8283425Z 2025-05-07T19:45:27.8283429Z 2025-05-07T19:45:27.8283432Z 2025-05-07T19:45:27.8283435Z 2025-05-07T19:45:27.8283439Z 2025-05-07T19:45:27.8283442Z 2025-05-07T19:45:27.8283446Z 2025-05-07T19:45:27.8283449Z 2025-05-07T19:45:27.8283453Z 2025-05-07T19:45:27.8283456Z 2025-05-07T19:45:27.8283516Z 2025-05-07T19:45:27.8283520Z 2025-05-07T19:45:27.8283737Z  2025-05-07T19:45:27.8283974Z 2025-05-07T19:45:27.8283977Z 2025-05-07T19:45:27.8284085Z  2025-05-07T19:45:27.8284209Z 2025-05-07T19:45:27.8284247Z 2025-05-07T19:45:27.8284378Z  2025-05-07T19:45:27.8284506Z 2025-05-07T19:45:27.8284566Z 2025-05-07T19:45:27.8284570Z 2025-05-07T19:45:27.8284718Z  2025-05-07T19:45:27.8284849Z 2025-05-07T19:45:27.8284852Z 2025-05-07T19:45:27.8284856Z 2025-05-07T19:45:27.8284859Z 2025-05-07T19:45:27.8284968Z  2025-05-07T19:45:27.8285103Z 2025-05-07T19:45:27.8285107Z 2025-05-07T19:45:27.8285110Z 2025-05-07T19:45:27.8285114Z 2025-05-07T19:45:27.8285117Z 2025-05-07T19:45:27.8285221Z  2025-05-07T19:45:27.8285345Z 2025-05-07T19:45:27.8285363Z 2025-05-07T19:45:27.8285367Z 2025-05-07T19:45:27.8285370Z 2025-05-07T19:45:27.8285374Z 2025-05-07T19:45:27.8285377Z 2025-05-07T19:45:27.8285487Z  2025-05-07T19:45:27.8285629Z 2025-05-07T19:45:27.8285632Z 2025-05-07T19:45:27.8285636Z 2025-05-07T19:45:27.8285639Z 2025-05-07T19:45:27.8285642Z 2025-05-07T19:45:27.8285646Z 2025-05-07T19:45:27.8285666Z 2025-05-07T19:45:27.8285777Z  2025-05-07T19:45:27.8285937Z 2025-05-07T19:45:27.8285940Z 2025-05-07T19:45:27.8285944Z 2025-05-07T19:45:27.8285951Z 2025-05-07T19:45:27.8285954Z 2025-05-07T19:45:27.8285958Z 2025-05-07T19:45:27.8285961Z 2025-05-07T19:45:27.8285965Z 2025-05-07T19:45:27.8286131Z  2025-05-07T19:45:27.8286296Z 2025-05-07T19:45:27.8286299Z 2025-05-07T19:45:27.8286303Z 2025-05-07T19:45:27.8286306Z 2025-05-07T19:45:27.8286310Z 2025-05-07T19:45:27.8286313Z 2025-05-07T19:45:27.8286316Z 2025-05-07T19:45:27.8286320Z 2025-05-07T19:45:27.8286323Z 2025-05-07T19:45:27.8286495Z  2025-05-07T19:45:27.8286672Z 2025-05-07T19:45:27.8286676Z 2025-05-07T19:45:27.8286679Z 2025-05-07T19:45:27.8286683Z 2025-05-07T19:45:27.8286690Z 2025-05-07T19:45:27.8286694Z 2025-05-07T19:45:27.8286697Z 2025-05-07T19:45:27.8286700Z 2025-05-07T19:45:27.8286704Z 2025-05-07T19:45:27.8286707Z 2025-05-07T19:45:27.8286878Z  2025-05-07T19:45:27.8287065Z 2025-05-07T19:45:27.8287069Z 2025-05-07T19:45:27.8287072Z 2025-05-07T19:45:27.8287075Z 2025-05-07T19:45:27.8287083Z 2025-05-07T19:45:27.8287086Z 2025-05-07T19:45:27.8287089Z 2025-05-07T19:45:27.8287093Z 2025-05-07T19:45:27.8287096Z 2025-05-07T19:45:27.8287099Z 2025-05-07T19:45:27.8287103Z 2025-05-07T19:45:27.8287276Z  2025-05-07T19:45:27.8287463Z 2025-05-07T19:45:27.8287466Z 2025-05-07T19:45:27.8287470Z 2025-05-07T19:45:27.8287474Z 2025-05-07T19:45:27.8287477Z 2025-05-07T19:45:27.8287481Z 2025-05-07T19:45:27.8287484Z 2025-05-07T19:45:27.8287487Z 2025-05-07T19:45:27.8287515Z 2025-05-07T19:45:27.8287545Z 2025-05-07T19:45:27.8287549Z 2025-05-07T19:45:27.8287552Z 2025-05-07T19:45:27.8287693Z  2025-05-07T19:45:27.8287889Z 2025-05-07T19:45:27.8287893Z 2025-05-07T19:45:27.8287896Z 2025-05-07T19:45:27.8287900Z 2025-05-07T19:45:27.8287903Z 2025-05-07T19:45:27.8287906Z 2025-05-07T19:45:27.8287910Z 2025-05-07T19:45:27.8287913Z 2025-05-07T19:45:27.8287930Z 2025-05-07T19:45:27.8287933Z 2025-05-07T19:45:27.8287937Z 2025-05-07T19:45:27.8287943Z 2025-05-07T19:45:27.8287947Z 2025-05-07T19:45:27.8288079Z  2025-05-07T19:45:27.8288273Z 2025-05-07T19:45:27.8288278Z 2025-05-07T19:45:27.8288281Z 2025-05-07T19:45:27.8288284Z 2025-05-07T19:45:27.8288288Z 2025-05-07T19:45:27.8288308Z 2025-05-07T19:45:27.8288312Z 2025-05-07T19:45:27.8288315Z 2025-05-07T19:45:27.8288318Z 2025-05-07T19:45:27.8288322Z 2025-05-07T19:45:27.8288325Z 2025-05-07T19:45:27.8288328Z 2025-05-07T19:45:27.8288332Z 2025-05-07T19:45:27.8288335Z 2025-05-07T19:45:27.8288473Z  2025-05-07T19:45:27.8288673Z 2025-05-07T19:45:27.8288677Z 2025-05-07T19:45:27.8288751Z 2025-05-07T19:45:27.8288754Z 2025-05-07T19:45:27.8288757Z 2025-05-07T19:45:27.8288761Z 2025-05-07T19:45:27.8288764Z 2025-05-07T19:45:27.8288768Z 2025-05-07T19:45:27.8288771Z 2025-05-07T19:45:27.8288775Z 2025-05-07T19:45:27.8288778Z 2025-05-07T19:45:27.8288782Z 2025-05-07T19:45:27.8288785Z 2025-05-07T19:45:27.8288865Z 2025-05-07T19:45:27.8288869Z 2025-05-07T19:45:27.8289021Z  2025-05-07T19:45:27.8289277Z 2025-05-07T19:45:27.8289280Z 2025-05-07T19:45:27.8289284Z 2025-05-07T19:45:27.8289287Z 2025-05-07T19:45:27.8289291Z 2025-05-07T19:45:27.8289294Z 2025-05-07T19:45:27.8289297Z 2025-05-07T19:45:27.8289301Z 2025-05-07T19:45:27.8289305Z 2025-05-07T19:45:27.8289308Z 2025-05-07T19:45:27.8289311Z 2025-05-07T19:45:27.8289315Z 2025-05-07T19:45:27.8289318Z 2025-05-07T19:45:27.8289322Z 2025-05-07T19:45:27.8289325Z 2025-05-07T19:45:27.8289329Z 2025-05-07T19:45:27.8289502Z  2025-05-07T19:45:27.8289720Z 2025-05-07T19:45:27.8289724Z 2025-05-07T19:45:27.8289727Z 2025-05-07T19:45:27.8289731Z 2025-05-07T19:45:27.8289735Z 2025-05-07T19:45:27.8289738Z 2025-05-07T19:45:27.8289741Z 2025-05-07T19:45:27.8289745Z 2025-05-07T19:45:27.8289748Z 2025-05-07T19:45:27.8289752Z 2025-05-07T19:45:27.8289755Z 2025-05-07T19:45:27.8289762Z 2025-05-07T19:45:27.8289766Z 2025-05-07T19:45:27.8289769Z 2025-05-07T19:45:27.8289772Z 2025-05-07T19:45:27.8289776Z 2025-05-07T19:45:27.8289792Z 2025-05-07T19:45:27.8289947Z  2025-05-07T19:45:27.8290164Z 2025-05-07T19:45:27.8290168Z 2025-05-07T19:45:27.8290171Z 2025-05-07T19:45:27.8290175Z 2025-05-07T19:45:27.8290178Z 2025-05-07T19:45:27.8290181Z 2025-05-07T19:45:27.8290185Z 2025-05-07T19:45:27.8290188Z 2025-05-07T19:45:27.8290192Z 2025-05-07T19:45:27.8290209Z 2025-05-07T19:45:27.8290212Z 2025-05-07T19:45:27.8290216Z 2025-05-07T19:45:27.8290219Z 2025-05-07T19:45:27.8290223Z 2025-05-07T19:45:27.8290230Z 2025-05-07T19:45:27.8290233Z 2025-05-07T19:45:27.8290237Z 2025-05-07T19:45:27.8290240Z 2025-05-07T19:45:27.8290403Z  2025-05-07T19:45:27.8290630Z 2025-05-07T19:45:27.8290634Z 2025-05-07T19:45:27.8290766Z  2025-05-07T19:45:27.8290881Z 2025-05-07T19:45:27.8290884Z 2025-05-07T19:45:27.8290995Z  2025-05-07T19:45:27.8291149Z 2025-05-07T19:45:27.8291153Z 2025-05-07T19:45:27.8291156Z 2025-05-07T19:45:27.8291274Z  done 2025-05-07T19:45:28.1381301Z Preparing transaction: | / - done 2025-05-07T19:45:31.9677492Z Verifying transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:45:34.7807086Z Executing transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ done 2025-05-07T19:45:35.1941532Z [INSTALL] Adding symlink librhash.so.0, which is needed by CMake ... 2025-05-07T19:45:37.0756269Z + ln -s /github/home/miniconda/envs/build_binary/lib/librhash.so /github/home/miniconda/envs/build_binary/lib/librhash.so.0 2025-05-07T19:45:37.0756948Z 2025-05-07T19:45:37.0772268Z 2025-05-07T19:45:37.0804164Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install build 2025-05-07T19:45:39.4476617Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:45:39.4478179Z 2025-05-07T19:45:39.4478293Z Collecting build 2025-05-07T19:45:39.4478695Z Downloading build-1.2.2.post1-py3-none-any.whl.metadata (6.5 kB) 2025-05-07T19:45:39.4479651Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from build) (25.0) 2025-05-07T19:45:39.4480727Z Collecting pyproject_hooks (from build) 2025-05-07T19:45:39.4481181Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl.metadata (1.3 kB) 2025-05-07T19:45:39.4481664Z Downloading build-1.2.2.post1-py3-none-any.whl (22 kB) 2025-05-07T19:45:39.4482256Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl (10 kB) 2025-05-07T19:45:39.4482682Z Installing collected packages: pyproject_hooks, build 2025-05-07T19:45:39.4482948Z 2025-05-07T19:45:39.4483137Z Successfully installed build-1.2.2.post1 pyproject_hooks-1.2.0 2025-05-07T19:45:39.4483425Z 2025-05-07T19:45:41.3392037Z /github/home/miniconda/envs/build_binary/bin/make 2025-05-07T19:45:41.3393161Z 2025-05-07T19:45:41.3966506Z [CHECK] Binary make found in PATH 2025-05-07T19:45:43.1954040Z /github/home/miniconda/envs/build_binary/bin/cmake 2025-05-07T19:45:43.1954894Z 2025-05-07T19:45:43.2536238Z [CHECK] Binary cmake found in PATH 2025-05-07T19:45:45.0435372Z /github/home/miniconda/envs/build_binary/bin/ninja 2025-05-07T19:45:45.0435718Z 2025-05-07T19:45:45.0996146Z [CHECK] Binary ninja found in PATH 2025-05-07T19:45:46.9979151Z [CHECK] Python (sub-)package 'click' found ... 2025-05-07T19:45:49.0346789Z [CHECK] Python (sub-)package 'hypothesis' found ... 2025-05-07T19:45:50.9544694Z [CHECK] Python (sub-)package 'jinja2' found ... 2025-05-07T19:45:52.9748282Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:45:54.8574224Z [CHECK] Python (sub-)package 'wheel' found ... 2025-05-07T19:45:54.8575497Z [INSTALL] Successfully installed all the build tools 2025-05-07T19:45:54.8641599Z ##[group]Run . $PRELUDE; install_cuda $BUILD_ENV 11.8.0 2025-05-07T19:45:54.8642047Z . $PRELUDE; install_cuda $BUILD_ENV 11.8.0 2025-05-07T19:45:54.8642655Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:54.8642989Z env: 2025-05-07T19:45:54.8643204Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:54.8643519Z BUILD_ENV: build_binary 2025-05-07T19:45:54.8643759Z BUILD_TARGET: default 2025-05-07T19:45:54.8644024Z BUILD_VARIANT: cuda 2025-05-07T19:45:54.8644278Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:45:54.8644519Z ##[endgroup] 2025-05-07T19:45:55.2746856Z ################################################################################ 2025-05-07T19:45:55.2747400Z # Install CUDA 2025-05-07T19:45:55.2747658Z # 2025-05-07T19:45:55.2757606Z # [2025-05-07T19:45:55.275Z] + install_cuda build_binary 11.8.0 2025-05-07T19:45:55.2758004Z ################################################################################ 2025-05-07T19:45:55.2758294Z 2025-05-07T19:45:55.2776671Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:55.3620019Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:55.3621106Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:45:55.3630142Z + conda clean --packages --tarball -y 2025-05-07T19:45:55.3630861Z 2025-05-07T19:45:55.8502765Z Will remove 144 (595.4 MB) tarball(s). 2025-05-07T19:45:55.8503748Z Will remove 19 (4.7 MB) package(s). 2025-05-07T19:45:55.9066250Z 2025-05-07T19:45:55.9077269Z + conda clean --all -y 2025-05-07T19:45:55.9077471Z 2025-05-07T19:45:56.4994377Z There are no unused tarball(s) to remove. 2025-05-07T19:45:56.4995380Z Will remove 1 index cache(s). 2025-05-07T19:45:56.4996212Z There are no unused package(s) to remove. 2025-05-07T19:45:56.4997140Z There are no tempfile(s) to remove. 2025-05-07T19:45:56.4997989Z There are no logfile(s) to remove. 2025-05-07T19:45:56.5544493Z 2025-05-07T19:45:56.5559740Z [INSTALL] Installing CUDA 11.8.0 ... 2025-05-07T19:45:56.5587940Z [EXEC] [ATTEMPT 0/3] + conda install --force-reinstall -n build_binary -c nvidia/label/cuda-11.8.0 -y cuda 2025-05-07T19:45:57.6003177Z Channels: 2025-05-07T19:45:57.6003500Z - nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.6003797Z - defaults 2025-05-07T19:45:57.6004023Z Platform: linux-64 2025-05-07T19:45:58.7765311Z Collecting package metadata (repodata.json): - \ | / - \ | done 2025-05-07T19:45:58.9925301Z Solving environment: - \ done 2025-05-07T19:45:59.1116501Z 2025-05-07T19:45:59.1117703Z ## Package Plan ## 2025-05-07T19:45:59.1118212Z 2025-05-07T19:45:59.1118808Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:59.1119739Z 2025-05-07T19:45:59.1120219Z added / updated specs: 2025-05-07T19:45:59.1120927Z - cuda 2025-05-07T19:45:59.1121284Z 2025-05-07T19:45:59.1121296Z 2025-05-07T19:45:59.1121642Z The following packages will be downloaded: 2025-05-07T19:45:59.1122303Z 2025-05-07T19:45:59.1122692Z package | build 2025-05-07T19:45:59.1123630Z ---------------------------|----------------- 2025-05-07T19:45:59.1124746Z cuda-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1126069Z cuda-cccl-11.8.89 | 0 1.2 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1127128Z cuda-command-line-tools-11.8.0| 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1127824Z cuda-compiler-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1128331Z cuda-cudart-11.8.89 | 0 197 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1128837Z cuda-cudart-dev-11.8.89 | 0 1.1 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1129345Z cuda-cuobjdump-11.8.86 | 0 229 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1130375Z cuda-cupti-11.8.87 | 0 25.3 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1130885Z cuda-cuxxfilt-11.8.86 | 0 291 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1131425Z cuda-demo-suite-11.8.86 | 0 5.0 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1131979Z cuda-documentation-11.8.86 | 0 89 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1132515Z cuda-driver-dev-11.8.89 | 0 16 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1133032Z cuda-gdb-11.8.86 | 0 4.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1133645Z cuda-libraries-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1134187Z cuda-libraries-dev-11.8.0 | 0 2 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1134730Z cuda-memcheck-11.8.86 | 0 168 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1135219Z cuda-nsight-11.8.86 | 0 113.6 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1135752Z cuda-nsight-compute-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1136254Z cuda-nvcc-11.8.89 | 0 50.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1136748Z cuda-nvdisasm-11.8.86 | 0 48.7 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1137260Z cuda-nvml-dev-11.8.86 | 0 83 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1137742Z cuda-nvprof-11.8.87 | 0 4.4 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1138237Z cuda-nvprune-11.8.86 | 0 65 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1138717Z cuda-nvrtc-11.8.89 | 0 19.1 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1139220Z cuda-nvrtc-dev-11.8.89 | 0 17.0 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1139701Z cuda-nvtx-11.8.86 | 0 57 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1140176Z cuda-nvvp-11.8.87 | 0 114.4 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1140688Z cuda-profiler-api-11.8.86 | 0 18 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1141194Z cuda-runtime-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1141854Z cuda-sanitizer-api-11.8.86 | 0 16.6 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1142362Z cuda-toolkit-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1142859Z cuda-tools-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1143376Z cuda-visual-tools-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1143878Z gds-tools-1.4.0.31 | 0 2 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1144369Z libcublas-11.11.3.6 | 0 364.0 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1144859Z libcublas-dev-11.11.3.6 | 0 394.1 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1145361Z libcufft-10.9.0.58 | 0 142.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1145865Z libcufft-dev-10.9.0.58 | 0 275.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1146346Z libcufile-1.4.0.31 | 0 548 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1146847Z libcufile-dev-1.4.0.31 | 0 1.6 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1147334Z libcurand-10.3.0.86 | 0 53.2 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1147962Z libcurand-dev-10.3.0.86 | 0 53.7 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1148461Z libcusolver-11.4.1.48 | 0 96.5 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1148989Z libcusolver-dev-11.4.1.48 | 0 66.3 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1149514Z libcusparse-11.7.5.86 | 0 176.3 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1150023Z libcusparse-dev-11.7.5.86 | 0 359.7 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1150639Z libnpp-11.8.0.86 | 0 147.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1151271Z libnpp-dev-11.8.0.86 | 0 144.5 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1151762Z libnvjpeg-11.9.0.86 | 0 2.4 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1152263Z libnvjpeg-dev-11.9.0.86 | 0 2.1 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1152886Z nsight-compute-2022.3.0.22 | 0 610.0 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.1153550Z ------------------------------------------------------------ 2025-05-07T19:45:59.1153908Z Total: 3.24 GB 2025-05-07T19:45:59.1154225Z 2025-05-07T19:45:59.1154359Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:59.1154630Z 2025-05-07T19:45:59.1154854Z cuda nvidia/label/cuda-11.8.0/linux-64::cuda-11.8.0-0 2025-05-07T19:45:59.1155340Z cuda-cccl nvidia/label/cuda-11.8.0/linux-64::cuda-cccl-11.8.89-0 2025-05-07T19:45:59.1155977Z cuda-command-line~ nvidia/label/cuda-11.8.0/linux-64::cuda-command-line-tools-11.8.0-0 2025-05-07T19:45:59.1156649Z cuda-compiler nvidia/label/cuda-11.8.0/linux-64::cuda-compiler-11.8.0-0 2025-05-07T19:45:59.1157225Z cuda-cudart nvidia/label/cuda-11.8.0/linux-64::cuda-cudart-11.8.89-0 2025-05-07T19:45:59.1157847Z cuda-cudart-dev nvidia/label/cuda-11.8.0/linux-64::cuda-cudart-dev-11.8.89-0 2025-05-07T19:45:59.1158474Z cuda-cuobjdump nvidia/label/cuda-11.8.0/linux-64::cuda-cuobjdump-11.8.86-0 2025-05-07T19:45:59.1159071Z cuda-cupti nvidia/label/cuda-11.8.0/linux-64::cuda-cupti-11.8.87-0 2025-05-07T19:45:59.1160104Z cuda-cuxxfilt nvidia/label/cuda-11.8.0/linux-64::cuda-cuxxfilt-11.8.86-0 2025-05-07T19:45:59.1160708Z cuda-demo-suite nvidia/label/cuda-11.8.0/linux-64::cuda-demo-suite-11.8.86-0 2025-05-07T19:45:59.1161438Z cuda-documentation nvidia/label/cuda-11.8.0/linux-64::cuda-documentation-11.8.86-0 2025-05-07T19:45:59.1162076Z cuda-driver-dev nvidia/label/cuda-11.8.0/linux-64::cuda-driver-dev-11.8.89-0 2025-05-07T19:45:59.1162641Z cuda-gdb nvidia/label/cuda-11.8.0/linux-64::cuda-gdb-11.8.86-0 2025-05-07T19:45:59.1163203Z cuda-libraries nvidia/label/cuda-11.8.0/linux-64::cuda-libraries-11.8.0-0 2025-05-07T19:45:59.1163827Z cuda-libraries-dev nvidia/label/cuda-11.8.0/linux-64::cuda-libraries-dev-11.8.0-0 2025-05-07T19:45:59.1164469Z cuda-memcheck nvidia/label/cuda-11.8.0/linux-64::cuda-memcheck-11.8.86-0 2025-05-07T19:45:59.1165031Z cuda-nsight nvidia/label/cuda-11.8.0/linux-64::cuda-nsight-11.8.86-0 2025-05-07T19:45:59.1165655Z cuda-nsight-compu~ nvidia/label/cuda-11.8.0/linux-64::cuda-nsight-compute-11.8.0-0 2025-05-07T19:45:59.1166257Z cuda-nvcc nvidia/label/cuda-11.8.0/linux-64::cuda-nvcc-11.8.89-0 2025-05-07T19:45:59.1166802Z cuda-nvdisasm nvidia/label/cuda-11.8.0/linux-64::cuda-nvdisasm-11.8.86-0 2025-05-07T19:45:59.1167390Z cuda-nvml-dev nvidia/label/cuda-11.8.0/linux-64::cuda-nvml-dev-11.8.86-0 2025-05-07T19:45:59.1167950Z cuda-nvprof nvidia/label/cuda-11.8.0/linux-64::cuda-nvprof-11.8.87-0 2025-05-07T19:45:59.1168524Z cuda-nvprune nvidia/label/cuda-11.8.0/linux-64::cuda-nvprune-11.8.86-0 2025-05-07T19:45:59.1169090Z cuda-nvrtc nvidia/label/cuda-11.8.0/linux-64::cuda-nvrtc-11.8.89-0 2025-05-07T19:45:59.1169723Z cuda-nvrtc-dev nvidia/label/cuda-11.8.0/linux-64::cuda-nvrtc-dev-11.8.89-0 2025-05-07T19:45:59.1170292Z cuda-nvtx nvidia/label/cuda-11.8.0/linux-64::cuda-nvtx-11.8.86-0 2025-05-07T19:45:59.1170803Z cuda-nvvp nvidia/label/cuda-11.8.0/linux-64::cuda-nvvp-11.8.87-0 2025-05-07T19:45:59.1171410Z cuda-profiler-api nvidia/label/cuda-11.8.0/linux-64::cuda-profiler-api-11.8.86-0 2025-05-07T19:45:59.1172040Z cuda-runtime nvidia/label/cuda-11.8.0/linux-64::cuda-runtime-11.8.0-0 2025-05-07T19:45:59.1172652Z cuda-sanitizer-api nvidia/label/cuda-11.8.0/linux-64::cuda-sanitizer-api-11.8.86-0 2025-05-07T19:45:59.1173281Z cuda-toolkit nvidia/label/cuda-11.8.0/linux-64::cuda-toolkit-11.8.0-0 2025-05-07T19:45:59.1173818Z cuda-tools nvidia/label/cuda-11.8.0/linux-64::cuda-tools-11.8.0-0 2025-05-07T19:45:59.1174423Z cuda-visual-tools nvidia/label/cuda-11.8.0/linux-64::cuda-visual-tools-11.8.0-0 2025-05-07T19:45:59.1175030Z gds-tools nvidia/label/cuda-11.8.0/linux-64::gds-tools-1.4.0.31-0 2025-05-07T19:45:59.1175562Z libcublas nvidia/label/cuda-11.8.0/linux-64::libcublas-11.11.3.6-0 2025-05-07T19:45:59.1176150Z libcublas-dev nvidia/label/cuda-11.8.0/linux-64::libcublas-dev-11.11.3.6-0 2025-05-07T19:45:59.1176710Z libcufft nvidia/label/cuda-11.8.0/linux-64::libcufft-10.9.0.58-0 2025-05-07T19:45:59.1177288Z libcufft-dev nvidia/label/cuda-11.8.0/linux-64::libcufft-dev-10.9.0.58-0 2025-05-07T19:45:59.1177869Z libcufile nvidia/label/cuda-11.8.0/linux-64::libcufile-1.4.0.31-0 2025-05-07T19:45:59.1178427Z libcufile-dev nvidia/label/cuda-11.8.0/linux-64::libcufile-dev-1.4.0.31-0 2025-05-07T19:45:59.1179017Z libcurand nvidia/label/cuda-11.8.0/linux-64::libcurand-10.3.0.86-0 2025-05-07T19:45:59.1179582Z libcurand-dev nvidia/label/cuda-11.8.0/linux-64::libcurand-dev-10.3.0.86-0 2025-05-07T19:45:59.1180199Z libcusolver nvidia/label/cuda-11.8.0/linux-64::libcusolver-11.4.1.48-0 2025-05-07T19:45:59.1180814Z libcusolver-dev nvidia/label/cuda-11.8.0/linux-64::libcusolver-dev-11.4.1.48-0 2025-05-07T19:45:59.1181411Z libcusparse nvidia/label/cuda-11.8.0/linux-64::libcusparse-11.7.5.86-0 2025-05-07T19:45:59.1182022Z libcusparse-dev nvidia/label/cuda-11.8.0/linux-64::libcusparse-dev-11.7.5.86-0 2025-05-07T19:45:59.1182582Z libnpp nvidia/label/cuda-11.8.0/linux-64::libnpp-11.8.0.86-0 2025-05-07T19:45:59.1183178Z libnpp-dev nvidia/label/cuda-11.8.0/linux-64::libnpp-dev-11.8.0.86-0 2025-05-07T19:45:59.1183733Z libnvjpeg nvidia/label/cuda-11.8.0/linux-64::libnvjpeg-11.9.0.86-0 2025-05-07T19:45:59.1184296Z libnvjpeg-dev nvidia/label/cuda-11.8.0/linux-64::libnvjpeg-dev-11.9.0.86-0 2025-05-07T19:45:59.1184929Z nsight-compute nvidia/label/cuda-11.8.0/linux-64::nsight-compute-2022.3.0.22-0 2025-05-07T19:45:59.1185482Z 2025-05-07T19:45:59.1220556Z 2025-05-07T19:45:59.1220807Z 2025-05-07T19:45:59.1221530Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:59.1223356Z nsight-compute-2022. | 610.0 MB | | 0% 2025-05-07T19:45:59.1224134Z 2025-05-07T19:45:59.1242123Z libcublas-dev-11.11. | 394.1 MB | | 0%  2025-05-07T19:45:59.1242900Z 2025-05-07T19:45:59.1242931Z 2025-05-07T19:45:59.1253191Z libcublas-11.11.3.6 | 364.0 MB | | 0%  2025-05-07T19:45:59.1253985Z 2025-05-07T19:45:59.1253996Z 2025-05-07T19:45:59.1254031Z 2025-05-07T19:45:59.1272301Z libcusparse-dev-11.7 | 359.7 MB | | 0%  2025-05-07T19:45:59.1273501Z 2025-05-07T19:45:59.1273515Z 2025-05-07T19:45:59.1273526Z 2025-05-07T19:45:59.1273537Z 2025-05-07T19:45:59.1291946Z libcufft-dev-10.9.0. | 275.8 MB | | 0%  2025-05-07T19:45:59.1292476Z 2025-05-07T19:45:59.1292492Z 2025-05-07T19:45:59.1292497Z 2025-05-07T19:45:59.1292563Z 2025-05-07T19:45:59.1292616Z 2025-05-07T19:45:59.1293526Z libcusparse-11.7.5.8 | 176.3 MB | | 0%  2025-05-07T19:45:59.1293887Z 2025-05-07T19:45:59.1293891Z 2025-05-07T19:45:59.1293895Z 2025-05-07T19:45:59.1293899Z 2025-05-07T19:45:59.1293902Z 2025-05-07T19:45:59.1293912Z 2025-05-07T19:45:59.1294194Z libnpp-11.8.0.86 | 147.8 MB | | 0%  2025-05-07T19:45:59.1295544Z 2025-05-07T19:45:59.1295549Z 2025-05-07T19:45:59.1295552Z 2025-05-07T19:45:59.1295556Z 2025-05-07T19:45:59.1295559Z 2025-05-07T19:45:59.1295572Z 2025-05-07T19:45:59.1295576Z 2025-05-07T19:45:59.1295922Z libnpp-dev-11.8.0.86 | 144.5 MB | | 0%  2025-05-07T19:45:59.1296223Z 2025-05-07T19:45:59.1296239Z 2025-05-07T19:45:59.1296243Z 2025-05-07T19:45:59.1296247Z 2025-05-07T19:45:59.1296250Z 2025-05-07T19:45:59.1296253Z 2025-05-07T19:45:59.1296257Z 2025-05-07T19:45:59.1296260Z 2025-05-07T19:45:59.1296538Z libcufft-10.9.0.58 | 142.8 MB | | 0%  2025-05-07T19:45:59.1296836Z 2025-05-07T19:45:59.1296840Z 2025-05-07T19:45:59.1296848Z 2025-05-07T19:45:59.1296852Z 2025-05-07T19:45:59.1296855Z 2025-05-07T19:45:59.1296858Z 2025-05-07T19:45:59.1296862Z 2025-05-07T19:45:59.1296865Z 2025-05-07T19:45:59.1296902Z 2025-05-07T19:45:59.1297666Z cuda-nvvp-11.8.87 | 114.4 MB | | 0%  2025-05-07T19:45:59.1297998Z 2025-05-07T19:45:59.1298020Z 2025-05-07T19:45:59.1298025Z 2025-05-07T19:45:59.1298046Z 2025-05-07T19:45:59.1298051Z 2025-05-07T19:45:59.1298085Z 2025-05-07T19:45:59.1298089Z 2025-05-07T19:45:59.1298093Z 2025-05-07T19:45:59.1298097Z 2025-05-07T19:45:59.1298342Z 2025-05-07T19:45:59.1310366Z cuda-nsight-11.8.86 | 113.6 MB | | 0%  2025-05-07T19:45:59.1311449Z 2025-05-07T19:45:59.1311453Z 2025-05-07T19:45:59.1311457Z 2025-05-07T19:45:59.1311461Z 2025-05-07T19:45:59.1311464Z 2025-05-07T19:45:59.1311468Z 2025-05-07T19:45:59.1311471Z 2025-05-07T19:45:59.1311475Z 2025-05-07T19:45:59.1311478Z 2025-05-07T19:45:59.1311481Z 2025-05-07T19:45:59.1311499Z 2025-05-07T19:45:59.1311790Z libcusolver-11.4.1.4 | 96.5 MB | | 0%  2025-05-07T19:45:59.1312121Z 2025-05-07T19:45:59.1312125Z 2025-05-07T19:45:59.1312129Z 2025-05-07T19:45:59.1312132Z 2025-05-07T19:45:59.1312136Z 2025-05-07T19:45:59.1312139Z 2025-05-07T19:45:59.1312143Z 2025-05-07T19:45:59.1312146Z 2025-05-07T19:45:59.1312149Z 2025-05-07T19:45:59.1312153Z 2025-05-07T19:45:59.1314283Z 2025-05-07T19:45:59.1314287Z 2025-05-07T19:45:59.1314678Z libcusolver-dev-11.4 | 66.3 MB | | 0%  2025-05-07T19:45:59.1315017Z 2025-05-07T19:45:59.1315020Z 2025-05-07T19:45:59.1315024Z 2025-05-07T19:45:59.1315027Z 2025-05-07T19:45:59.1315031Z 2025-05-07T19:45:59.1315035Z 2025-05-07T19:45:59.1315038Z 2025-05-07T19:45:59.1315041Z 2025-05-07T19:45:59.1315045Z 2025-05-07T19:45:59.1315048Z 2025-05-07T19:45:59.1315052Z 2025-05-07T19:45:59.1315055Z 2025-05-07T19:45:59.1315058Z 2025-05-07T19:45:59.1315377Z libcurand-dev-10.3.0 | 53.7 MB | | 0%  2025-05-07T19:45:59.1315700Z 2025-05-07T19:45:59.1315704Z 2025-05-07T19:45:59.1315707Z 2025-05-07T19:45:59.1315711Z 2025-05-07T19:45:59.1315714Z 2025-05-07T19:45:59.1315718Z 2025-05-07T19:45:59.1315721Z 2025-05-07T19:45:59.1315725Z 2025-05-07T19:45:59.1315728Z 2025-05-07T19:45:59.1315731Z 2025-05-07T19:45:59.1315735Z 2025-05-07T19:45:59.1315742Z 2025-05-07T19:45:59.1315762Z 2025-05-07T19:45:59.1315765Z 2025-05-07T19:45:59.1316062Z libcurand-10.3.0.86 | 53.2 MB | | 0%  2025-05-07T19:45:59.1316379Z 2025-05-07T19:45:59.1316383Z 2025-05-07T19:45:59.1316387Z 2025-05-07T19:45:59.1316390Z 2025-05-07T19:45:59.1316393Z 2025-05-07T19:45:59.1316397Z 2025-05-07T19:45:59.1316400Z 2025-05-07T19:45:59.1316404Z 2025-05-07T19:45:59.1316423Z 2025-05-07T19:45:59.1316426Z 2025-05-07T19:45:59.1316429Z 2025-05-07T19:45:59.1316433Z 2025-05-07T19:45:59.1316548Z 2025-05-07T19:45:59.1316553Z 2025-05-07T19:45:59.1316556Z 2025-05-07T19:45:59.1316883Z cuda-nvcc-11.8.89 | 50.8 MB | | 0%  2025-05-07T19:45:59.1317209Z 2025-05-07T19:45:59.1317212Z 2025-05-07T19:45:59.1317215Z 2025-05-07T19:45:59.1317219Z 2025-05-07T19:45:59.1317222Z 2025-05-07T19:45:59.1317225Z 2025-05-07T19:45:59.1317229Z 2025-05-07T19:45:59.1317232Z 2025-05-07T19:45:59.1317241Z 2025-05-07T19:45:59.1317244Z 2025-05-07T19:45:59.1317248Z 2025-05-07T19:45:59.1317252Z 2025-05-07T19:45:59.1317255Z 2025-05-07T19:45:59.1317258Z 2025-05-07T19:45:59.1317262Z 2025-05-07T19:45:59.1317265Z 2025-05-07T19:45:59.1317576Z cuda-nvdisasm-11.8.8 | 48.7 MB | | 0%  2025-05-07T19:45:59.1317927Z 2025-05-07T19:45:59.1317930Z 2025-05-07T19:45:59.1317934Z 2025-05-07T19:45:59.1317937Z 2025-05-07T19:45:59.1317941Z 2025-05-07T19:45:59.1317944Z 2025-05-07T19:45:59.1317948Z 2025-05-07T19:45:59.1317956Z 2025-05-07T19:45:59.1317960Z 2025-05-07T19:45:59.1317964Z 2025-05-07T19:45:59.1317967Z 2025-05-07T19:45:59.1317970Z 2025-05-07T19:45:59.1317974Z 2025-05-07T19:45:59.1317977Z 2025-05-07T19:45:59.1317981Z 2025-05-07T19:45:59.1317985Z 2025-05-07T19:45:59.1318006Z 2025-05-07T19:45:59.1318331Z cuda-cupti-11.8.87 | 25.3 MB | | 0%  2025-05-07T19:45:59.1318656Z 2025-05-07T19:45:59.1318660Z 2025-05-07T19:45:59.1318663Z 2025-05-07T19:45:59.1318666Z 2025-05-07T19:45:59.1318670Z 2025-05-07T19:45:59.1318674Z 2025-05-07T19:45:59.1318678Z 2025-05-07T19:45:59.1318697Z 2025-05-07T19:45:59.1318700Z 2025-05-07T19:45:59.1318703Z 2025-05-07T19:45:59.1318707Z 2025-05-07T19:45:59.1318710Z 2025-05-07T19:45:59.1318714Z 2025-05-07T19:45:59.1318717Z 2025-05-07T19:45:59.1318720Z 2025-05-07T19:45:59.1318724Z 2025-05-07T19:45:59.1318727Z 2025-05-07T19:45:59.1318731Z 2025-05-07T19:45:59.1325860Z cuda-nvrtc-11.8.89 | 19.1 MB | | 0%  2025-05-07T19:45:59.1326220Z 2025-05-07T19:45:59.1326223Z 2025-05-07T19:45:59.1326227Z 2025-05-07T19:45:59.1326230Z 2025-05-07T19:45:59.1326234Z 2025-05-07T19:45:59.1326237Z 2025-05-07T19:45:59.1326241Z 2025-05-07T19:45:59.1326244Z 2025-05-07T19:45:59.1326248Z 2025-05-07T19:45:59.1326252Z 2025-05-07T19:45:59.1326255Z 2025-05-07T19:45:59.1326259Z 2025-05-07T19:45:59.1326331Z 2025-05-07T19:45:59.1326335Z 2025-05-07T19:45:59.1326338Z 2025-05-07T19:45:59.1326341Z 2025-05-07T19:45:59.1326345Z 2025-05-07T19:45:59.1326349Z 2025-05-07T19:45:59.1326352Z 2025-05-07T19:46:03.8577459Z ... (more hidden) ... 2025-05-07T19:46:03.8578406Z 2025-05-07T19:46:03.8578420Z 2025-05-07T19:46:03.8578431Z 2025-05-07T19:46:03.8578442Z 2025-05-07T19:46:03.8579245Z libcufft-dev-10.9.0. | 275.8 MB | ########## | 100%  2025-05-07T19:46:03.8580099Z 2025-05-07T19:46:03.8580158Z 2025-05-07T19:46:03.8580169Z 2025-05-07T19:46:03.8580179Z 2025-05-07T19:46:05.4522750Z libcufft-dev-10.9.0. | 275.8 MB | ########## | 100%  2025-05-07T19:46:05.4523107Z 2025-05-07T19:46:05.4523350Z libcublas-dev-11.11. | 394.1 MB | ########## | 100%  2025-05-07T19:46:05.4523615Z 2025-05-07T19:46:05.5958286Z libcublas-dev-11.11. | 394.1 MB | ########## | 100%  2025-05-07T19:46:05.5958605Z 2025-05-07T19:46:05.5958641Z 2025-05-07T19:46:05.5958645Z 2025-05-07T19:46:05.5958900Z libcusparse-dev-11.7 | 359.7 MB | ########## | 100%  2025-05-07T19:46:05.5959186Z 2025-05-07T19:46:05.5959191Z 2025-05-07T19:46:05.5959213Z 2025-05-07T19:46:07.7788899Z libcusparse-dev-11.7 | 359.7 MB | ########## | 100%  2025-05-07T19:46:07.7789295Z 2025-05-07T19:46:07.7789301Z 2025-05-07T19:46:07.7789571Z libcublas-11.11.3.6 | 364.0 MB | ########## | 100%  2025-05-07T19:46:07.7789843Z 2025-05-07T19:46:07.7789849Z 2025-05-07T19:46:07.8844599Z libcublas-11.11.3.6 | 364.0 MB | ########## | 100%  2025-05-07T19:46:07.8844930Z 2025-05-07T19:46:07.8844935Z 2025-05-07T19:46:07.8844939Z 2025-05-07T19:46:07.8844944Z 2025-05-07T19:46:07.8844948Z 2025-05-07T19:46:07.8844952Z 2025-05-07T19:46:07.8845236Z libnpp-11.8.0.86 | 147.8 MB | ########## | 100%  2025-05-07T19:46:07.8845520Z 2025-05-07T19:46:07.8845525Z 2025-05-07T19:46:07.8845529Z 2025-05-07T19:46:07.8845536Z 2025-05-07T19:46:07.8845559Z 2025-05-07T19:46:07.8845562Z 2025-05-07T19:46:08.2904504Z libnpp-11.8.0.86 | 147.8 MB | ########## | 100%  2025-05-07T19:46:08.2904845Z 2025-05-07T19:46:08.2904849Z 2025-05-07T19:46:08.2904857Z 2025-05-07T19:46:08.2904861Z 2025-05-07T19:46:08.2904865Z 2025-05-07T19:46:08.2905143Z libcusparse-11.7.5.8 | 176.3 MB | ########## | 100%  2025-05-07T19:46:08.2905456Z 2025-05-07T19:46:08.2905461Z 2025-05-07T19:46:08.2905464Z 2025-05-07T19:46:08.2905467Z 2025-05-07T19:46:08.2905472Z 2025-05-07T19:46:08.7670779Z libcusparse-11.7.5.8 | 176.3 MB | ########## | 100%  2025-05-07T19:46:08.7671184Z 2025-05-07T19:46:08.7671190Z 2025-05-07T19:46:08.7671194Z 2025-05-07T19:46:08.7671198Z 2025-05-07T19:46:08.7671202Z 2025-05-07T19:46:08.7671206Z 2025-05-07T19:46:08.7671213Z 2025-05-07T19:46:08.7671496Z libnpp-dev-11.8.0.86 | 144.5 MB | ########## | 100%  2025-05-07T19:46:08.7671795Z 2025-05-07T19:46:08.7671799Z 2025-05-07T19:46:08.7671823Z 2025-05-07T19:46:08.7671839Z 2025-05-07T19:46:08.7671843Z 2025-05-07T19:46:08.7671846Z 2025-05-07T19:46:08.7671849Z 2025-05-07T19:46:09.7202358Z libnpp-dev-11.8.0.86 | 144.5 MB | ########## | 100%  2025-05-07T19:46:09.7202710Z 2025-05-07T19:46:09.7202715Z 2025-05-07T19:46:09.7202719Z 2025-05-07T19:46:09.7202725Z 2025-05-07T19:46:09.7202729Z 2025-05-07T19:46:09.7202734Z 2025-05-07T19:46:09.7202751Z 2025-05-07T19:46:09.7202755Z 2025-05-07T19:46:09.7202773Z 2025-05-07T19:46:09.7203100Z cuda-nvvp-11.8.87 | 114.4 MB | ########## | 100%  2025-05-07T19:46:09.7203393Z 2025-05-07T19:46:09.7203397Z 2025-05-07T19:46:09.7203401Z 2025-05-07T19:46:09.7203405Z 2025-05-07T19:46:09.7203408Z 2025-05-07T19:46:09.7203411Z 2025-05-07T19:46:09.7203427Z 2025-05-07T19:46:09.7203432Z 2025-05-07T19:46:09.7203435Z 2025-05-07T19:46:10.3773066Z cuda-nvvp-11.8.87 | 114.4 MB | ########## | 100%  2025-05-07T19:46:10.3773775Z 2025-05-07T19:46:10.3773780Z 2025-05-07T19:46:10.3773800Z 2025-05-07T19:46:10.3773805Z 2025-05-07T19:46:10.3773810Z 2025-05-07T19:46:10.3773814Z 2025-05-07T19:46:10.3773818Z 2025-05-07T19:46:10.3773821Z 2025-05-07T19:46:10.3773825Z 2025-05-07T19:46:10.3773828Z 2025-05-07T19:46:10.3774147Z cuda-nsight-11.8.86 | 113.6 MB | ########## | 100%  2025-05-07T19:46:10.3774459Z 2025-05-07T19:46:10.3774463Z 2025-05-07T19:46:10.3774482Z 2025-05-07T19:46:10.3774485Z 2025-05-07T19:46:10.3774489Z 2025-05-07T19:46:10.3774505Z 2025-05-07T19:46:10.3774509Z 2025-05-07T19:46:10.3774513Z 2025-05-07T19:46:10.3774516Z 2025-05-07T19:46:10.3774519Z 2025-05-07T19:46:10.5226319Z cuda-nsight-11.8.86 | 113.6 MB | ########## | 100%  2025-05-07T19:46:10.5226842Z nsight-compute-2022. | 610.0 MB | ########## | 100% 2025-05-07T19:46:10.5762080Z nsight-compute-2022. | 610.0 MB | ########## | 100% 2025-05-07T19:46:10.5762375Z 2025-05-07T19:46:10.5762410Z 2025-05-07T19:46:10.5762414Z 2025-05-07T19:46:10.5762418Z 2025-05-07T19:46:10.5762422Z 2025-05-07T19:46:10.5762426Z 2025-05-07T19:46:10.5762429Z 2025-05-07T19:46:10.5762433Z 2025-05-07T19:46:10.5762436Z 2025-05-07T19:46:10.5762439Z 2025-05-07T19:46:10.5762455Z 2025-05-07T19:46:10.5762875Z libcusolver-11.4.1.4 | 96.5 MB | ########## | 100%  2025-05-07T19:46:10.5763206Z 2025-05-07T19:46:10.5763210Z 2025-05-07T19:46:10.5763214Z 2025-05-07T19:46:10.5763218Z 2025-05-07T19:46:10.5763222Z 2025-05-07T19:46:10.5763444Z 2025-05-07T19:46:10.5763449Z 2025-05-07T19:46:10.5763453Z 2025-05-07T19:46:10.5763457Z 2025-05-07T19:46:10.5763461Z 2025-05-07T19:46:10.5763468Z 2025-05-07T19:46:10.7652185Z libcusolver-11.4.1.4 | 96.5 MB | ########## | 100%  2025-05-07T19:46:10.7652577Z 2025-05-07T19:46:10.7652583Z 2025-05-07T19:46:10.7652589Z 2025-05-07T19:46:10.7652616Z 2025-05-07T19:46:10.7652622Z 2025-05-07T19:46:10.7652628Z 2025-05-07T19:46:10.7652682Z 2025-05-07T19:46:10.7652685Z 2025-05-07T19:46:10.7652689Z 2025-05-07T19:46:10.7652692Z 2025-05-07T19:46:10.7652696Z 2025-05-07T19:46:10.7652699Z 2025-05-07T19:46:10.7653017Z libcusolver-dev-11.4 | 66.3 MB | ########## | 100%  2025-05-07T19:46:10.7653600Z 2025-05-07T19:46:10.7653604Z 2025-05-07T19:46:10.7653608Z 2025-05-07T19:46:10.7653614Z 2025-05-07T19:46:10.7653619Z 2025-05-07T19:46:10.7653623Z 2025-05-07T19:46:10.7653652Z 2025-05-07T19:46:10.7653656Z 2025-05-07T19:46:10.7653662Z 2025-05-07T19:46:10.7653694Z 2025-05-07T19:46:10.7653697Z 2025-05-07T19:46:10.7653717Z 2025-05-07T19:46:11.5379106Z libcusolver-dev-11.4 | 66.3 MB | ########## | 100%  2025-05-07T19:46:11.5379522Z 2025-05-07T19:46:11.5379528Z 2025-05-07T19:46:11.5379534Z 2025-05-07T19:46:11.5379538Z 2025-05-07T19:46:11.5379542Z 2025-05-07T19:46:11.5379546Z 2025-05-07T19:46:11.5379552Z 2025-05-07T19:46:11.5379555Z 2025-05-07T19:46:11.5379600Z 2025-05-07T19:46:11.5379605Z 2025-05-07T19:46:11.5379637Z 2025-05-07T19:46:11.5379640Z 2025-05-07T19:46:11.5379664Z 2025-05-07T19:46:11.5380019Z libcurand-dev-10.3.0 | 53.7 MB | ########## | 100%  2025-05-07T19:46:11.5380363Z 2025-05-07T19:46:11.5380368Z 2025-05-07T19:46:11.5380373Z 2025-05-07T19:46:11.5380378Z 2025-05-07T19:46:11.5380383Z 2025-05-07T19:46:11.5380416Z 2025-05-07T19:46:11.5380420Z 2025-05-07T19:46:11.5380425Z 2025-05-07T19:46:11.5380428Z 2025-05-07T19:46:11.5380432Z 2025-05-07T19:46:11.5380454Z 2025-05-07T19:46:11.5380457Z 2025-05-07T19:46:11.5380461Z 2025-05-07T19:46:11.5554309Z libcurand-dev-10.3.0 | 53.7 MB | ########## | 100%  2025-05-07T19:46:11.5554711Z 2025-05-07T19:46:11.5554716Z 2025-05-07T19:46:11.5554720Z 2025-05-07T19:46:11.5554724Z 2025-05-07T19:46:11.5554728Z 2025-05-07T19:46:11.5554731Z 2025-05-07T19:46:11.5554736Z 2025-05-07T19:46:11.5554739Z 2025-05-07T19:46:11.5555009Z 2025-05-07T19:46:11.5555013Z 2025-05-07T19:46:11.5555016Z 2025-05-07T19:46:11.5555020Z 2025-05-07T19:46:11.5555023Z 2025-05-07T19:46:11.5555026Z 2025-05-07T19:46:11.5555375Z libcurand-10.3.0.86 | 53.2 MB | ########## | 100%  2025-05-07T19:46:11.5555730Z 2025-05-07T19:46:11.5555734Z 2025-05-07T19:46:11.5555737Z 2025-05-07T19:46:11.5555740Z 2025-05-07T19:46:11.5555744Z 2025-05-07T19:46:11.5555747Z 2025-05-07T19:46:11.5555751Z 2025-05-07T19:46:11.5555754Z 2025-05-07T19:46:11.5555758Z 2025-05-07T19:46:11.5555771Z 2025-05-07T19:46:11.5555775Z 2025-05-07T19:46:11.5555778Z 2025-05-07T19:46:11.5555782Z 2025-05-07T19:46:11.5555785Z 2025-05-07T19:46:11.5871369Z libcurand-10.3.0.86 | 53.2 MB | ########## | 100%  2025-05-07T19:46:11.5871775Z 2025-05-07T19:46:11.5871780Z 2025-05-07T19:46:11.5871784Z 2025-05-07T19:46:11.5871787Z 2025-05-07T19:46:11.5871791Z 2025-05-07T19:46:11.5871795Z 2025-05-07T19:46:11.5871813Z 2025-05-07T19:46:11.5871817Z 2025-05-07T19:46:11.5871821Z 2025-05-07T19:46:11.5871848Z 2025-05-07T19:46:11.5871851Z 2025-05-07T19:46:11.5871855Z 2025-05-07T19:46:11.5871858Z 2025-05-07T19:46:11.5871862Z 2025-05-07T19:46:11.5871865Z 2025-05-07T19:46:11.5871870Z 2025-05-07T19:46:11.5872205Z cuda-nvdisasm-11.8.8 | 48.7 MB | ########## | 100%  2025-05-07T19:46:11.5872558Z 2025-05-07T19:46:11.5872562Z 2025-05-07T19:46:11.5872588Z 2025-05-07T19:46:11.5872592Z 2025-05-07T19:46:11.5872898Z 2025-05-07T19:46:11.5872905Z 2025-05-07T19:46:11.5872908Z 2025-05-07T19:46:11.5872911Z 2025-05-07T19:46:11.5872915Z 2025-05-07T19:46:11.5872918Z 2025-05-07T19:46:11.5872921Z 2025-05-07T19:46:11.5872925Z 2025-05-07T19:46:11.5872928Z 2025-05-07T19:46:11.5872931Z 2025-05-07T19:46:11.5872935Z 2025-05-07T19:46:11.5872938Z 2025-05-07T19:46:11.7230372Z cuda-nvdisasm-11.8.8 | 48.7 MB | ########## | 100%  2025-05-07T19:46:11.7230816Z 2025-05-07T19:46:11.7230821Z 2025-05-07T19:46:11.7230825Z 2025-05-07T19:46:11.7230828Z 2025-05-07T19:46:11.7230832Z 2025-05-07T19:46:11.7230835Z 2025-05-07T19:46:11.7230840Z 2025-05-07T19:46:11.7230845Z 2025-05-07T19:46:11.7231133Z libcufft-10.9.0.58 | 142.8 MB | ########## | 100%  2025-05-07T19:46:11.7231468Z 2025-05-07T19:46:11.7231471Z 2025-05-07T19:46:11.7231476Z 2025-05-07T19:46:11.7231481Z 2025-05-07T19:46:11.7231485Z 2025-05-07T19:46:11.7231489Z 2025-05-07T19:46:11.7231514Z 2025-05-07T19:46:11.7231517Z 2025-05-07T19:46:12.0076850Z libcufft-10.9.0.58 | 142.8 MB | ########## | 100%  2025-05-07T19:46:12.0077269Z 2025-05-07T19:46:12.0077277Z 2025-05-07T19:46:12.0077281Z 2025-05-07T19:46:12.0077313Z 2025-05-07T19:46:12.0077318Z 2025-05-07T19:46:12.0077322Z 2025-05-07T19:46:12.0077327Z 2025-05-07T19:46:12.0077333Z 2025-05-07T19:46:12.0077342Z 2025-05-07T19:46:12.0077347Z 2025-05-07T19:46:12.0077393Z 2025-05-07T19:46:12.0077397Z 2025-05-07T19:46:12.0077402Z 2025-05-07T19:46:12.0077407Z 2025-05-07T19:46:12.0077412Z 2025-05-07T19:46:12.0077443Z 2025-05-07T19:46:12.0077447Z 2025-05-07T19:46:12.0077452Z 2025-05-07T19:46:12.0077821Z cuda-nvrtc-11.8.89 | 19.1 MB | ########## | 100%  2025-05-07T19:46:12.0078167Z 2025-05-07T19:46:12.0078172Z 2025-05-07T19:46:12.0078176Z 2025-05-07T19:46:12.0078181Z 2025-05-07T19:46:12.0078186Z 2025-05-07T19:46:12.0078189Z 2025-05-07T19:46:12.0078216Z 2025-05-07T19:46:12.0078244Z 2025-05-07T19:46:12.0078247Z 2025-05-07T19:46:12.0078251Z 2025-05-07T19:46:12.0078254Z 2025-05-07T19:46:12.0078257Z 2025-05-07T19:46:12.0078261Z 2025-05-07T19:46:12.0078264Z 2025-05-07T19:46:12.0078269Z 2025-05-07T19:46:12.0078273Z 2025-05-07T19:46:12.0078276Z 2025-05-07T19:46:12.0078279Z 2025-05-07T19:46:12.0696638Z cuda-nvrtc-11.8.89 | 19.1 MB | ########## | 100%  2025-05-07T19:46:12.0697353Z 2025-05-07T19:46:12.0697359Z 2025-05-07T19:46:12.0697364Z 2025-05-07T19:46:12.0697369Z 2025-05-07T19:46:12.0697374Z 2025-05-07T19:46:12.0697379Z 2025-05-07T19:46:12.0697383Z 2025-05-07T19:46:12.0697387Z 2025-05-07T19:46:12.0697392Z 2025-05-07T19:46:12.0697396Z 2025-05-07T19:46:12.0697399Z 2025-05-07T19:46:12.0697402Z 2025-05-07T19:46:12.0697429Z 2025-05-07T19:46:12.0697433Z 2025-05-07T19:46:12.0697437Z 2025-05-07T19:46:12.0697440Z 2025-05-07T19:46:12.0697443Z 2025-05-07T19:46:12.0697801Z cuda-cupti-11.8.87 | 25.3 MB | ########## | 100%  2025-05-07T19:46:12.0698140Z 2025-05-07T19:46:12.0698143Z 2025-05-07T19:46:12.0698147Z 2025-05-07T19:46:12.0698150Z 2025-05-07T19:46:12.0698154Z 2025-05-07T19:46:12.0698181Z 2025-05-07T19:46:12.0698184Z 2025-05-07T19:46:12.0698188Z 2025-05-07T19:46:12.0698191Z 2025-05-07T19:46:12.0698194Z 2025-05-07T19:46:12.0698198Z 2025-05-07T19:46:12.0698201Z 2025-05-07T19:46:12.0698210Z 2025-05-07T19:46:12.0698213Z 2025-05-07T19:46:12.0698217Z 2025-05-07T19:46:12.0698220Z 2025-05-07T19:46:12.0698223Z 2025-05-07T19:46:12.1882716Z cuda-cupti-11.8.87 | 25.3 MB | ########## | 100%  2025-05-07T19:46:12.1883148Z 2025-05-07T19:46:12.1883157Z 2025-05-07T19:46:12.1883163Z 2025-05-07T19:46:12.1883169Z 2025-05-07T19:46:12.1883175Z 2025-05-07T19:46:12.1883181Z 2025-05-07T19:46:12.1883187Z 2025-05-07T19:46:12.1883191Z 2025-05-07T19:46:12.1883195Z 2025-05-07T19:46:12.1883482Z 2025-05-07T19:46:12.1883488Z 2025-05-07T19:46:12.1883493Z 2025-05-07T19:46:12.1883496Z 2025-05-07T19:46:12.1883499Z 2025-05-07T19:46:12.1883504Z 2025-05-07T19:46:12.1883507Z 2025-05-07T19:46:12.1883534Z 2025-05-07T19:46:12.1883537Z 2025-05-07T19:46:12.1883542Z 2025-05-07T19:46:12.1883829Z ... (more hidden) ... 2025-05-07T19:46:12.1884136Z 2025-05-07T19:46:12.1884139Z 2025-05-07T19:46:12.1884153Z 2025-05-07T19:46:12.1884156Z 2025-05-07T19:46:12.1884160Z 2025-05-07T19:46:12.1884164Z 2025-05-07T19:46:12.1884167Z 2025-05-07T19:46:12.1884193Z 2025-05-07T19:46:12.1884197Z 2025-05-07T19:46:12.1884200Z 2025-05-07T19:46:12.1884204Z 2025-05-07T19:46:12.1884207Z 2025-05-07T19:46:12.1884210Z 2025-05-07T19:46:12.1884214Z 2025-05-07T19:46:12.1884217Z 2025-05-07T19:46:12.1884220Z 2025-05-07T19:46:12.1884224Z 2025-05-07T19:46:12.1884228Z 2025-05-07T19:46:12.1884232Z 2025-05-07T19:46:12.2349005Z ... (more hidden) ... 2025-05-07T19:46:12.2349398Z 2025-05-07T19:46:12.2349403Z 2025-05-07T19:46:12.2349407Z 2025-05-07T19:46:12.2349411Z 2025-05-07T19:46:12.2349414Z 2025-05-07T19:46:12.2349419Z 2025-05-07T19:46:12.2349423Z 2025-05-07T19:46:12.2349429Z 2025-05-07T19:46:12.2349434Z 2025-05-07T19:46:12.2349438Z 2025-05-07T19:46:12.2349443Z 2025-05-07T19:46:12.2349448Z 2025-05-07T19:46:12.2349452Z 2025-05-07T19:46:12.2349468Z 2025-05-07T19:46:12.2349472Z 2025-05-07T19:46:12.2349824Z cuda-nvcc-11.8.89 | 50.8 MB | ########## | 100%  2025-05-07T19:46:12.2350147Z 2025-05-07T19:46:12.2350150Z 2025-05-07T19:46:12.2350154Z 2025-05-07T19:46:12.2350157Z 2025-05-07T19:46:12.2350161Z 2025-05-07T19:46:12.2350164Z 2025-05-07T19:46:12.2350167Z 2025-05-07T19:46:12.2350171Z 2025-05-07T19:46:12.2350174Z 2025-05-07T19:46:12.2350178Z 2025-05-07T19:46:12.2350182Z 2025-05-07T19:46:12.2350186Z 2025-05-07T19:46:12.2350189Z 2025-05-07T19:46:12.2350225Z 2025-05-07T19:46:12.2350229Z 2025-05-07T19:46:34.7144268Z cuda-nvcc-11.8.89 | 50.8 MB | ########## | 100%  2025-05-07T19:46:34.7144676Z 2025-05-07T19:46:34.7144683Z 2025-05-07T19:46:34.7144687Z 2025-05-07T19:46:34.7144693Z 2025-05-07T19:46:40.2027669Z libcufft-dev-10.9.0. | 275.8 MB | ########## | 100%  2025-05-07T19:46:40.2028055Z 2025-05-07T19:46:40.2028061Z 2025-05-07T19:46:40.2028350Z 2025-05-07T19:46:47.2777210Z libcusparse-dev-11.7 | 359.7 MB | ########## | 100%  2025-05-07T19:46:47.2777622Z 2025-05-07T19:46:54.0313232Z libcublas-dev-11.11. | 394.1 MB | ########## | 100%  2025-05-07T19:46:54.0313565Z 2025-05-07T19:46:54.0313571Z 2025-05-07T19:46:54.0313578Z 2025-05-07T19:46:54.0313584Z 2025-05-07T19:46:54.0313588Z 2025-05-07T19:46:54.0313593Z 2025-05-07T19:47:03.5710415Z libnpp-11.8.0.86 | 147.8 MB | ########## | 100%  2025-05-07T19:47:03.5710783Z 2025-05-07T19:47:03.5710833Z 2025-05-07T19:47:03.5710840Z 2025-05-07T19:47:03.5710844Z 2025-05-07T19:47:03.5710848Z 2025-05-07T19:47:07.5435790Z libcusparse-11.7.5.8 | 176.3 MB | ########## | 100%  2025-05-07T19:47:07.5436234Z 2025-05-07T19:47:07.5436241Z 2025-05-07T19:47:07.5436247Z 2025-05-07T19:47:07.5436253Z 2025-05-07T19:47:07.5436259Z 2025-05-07T19:47:07.5436265Z 2025-05-07T19:47:07.5436274Z 2025-05-07T19:47:11.1319999Z libnpp-dev-11.8.0.86 | 144.5 MB | ########## | 100%  2025-05-07T19:47:11.1320463Z 2025-05-07T19:47:11.1320480Z 2025-05-07T19:47:11.6391201Z libcublas-11.11.3.6 | 364.0 MB | ########## | 100%  2025-05-07T19:47:11.6391523Z 2025-05-07T19:47:11.6391820Z 2025-05-07T19:47:11.6391840Z 2025-05-07T19:47:11.6391847Z 2025-05-07T19:47:11.6391854Z 2025-05-07T19:47:11.6391859Z 2025-05-07T19:47:11.6391863Z 2025-05-07T19:47:11.6391869Z 2025-05-07T19:47:11.6391874Z 2025-05-07T19:47:15.1039366Z cuda-nvvp-11.8.87 | 114.4 MB | ########## | 100%  2025-05-07T19:47:15.1039813Z 2025-05-07T19:47:15.1039819Z 2025-05-07T19:47:15.1039824Z 2025-05-07T19:47:15.1039828Z 2025-05-07T19:47:15.1039831Z 2025-05-07T19:47:15.1039834Z 2025-05-07T19:47:15.1039839Z 2025-05-07T19:47:15.1039843Z 2025-05-07T19:47:15.1039847Z 2025-05-07T19:47:15.1039851Z 2025-05-07T19:47:24.5306434Z cuda-nsight-11.8.86 | 113.6 MB | ########## | 100%  2025-05-07T19:47:24.5306924Z 2025-05-07T19:47:24.5306930Z 2025-05-07T19:47:24.5306936Z 2025-05-07T19:47:24.5306941Z 2025-05-07T19:47:24.5306946Z 2025-05-07T19:47:24.5306950Z 2025-05-07T19:47:24.5306955Z 2025-05-07T19:47:24.5306959Z 2025-05-07T19:47:24.5306964Z 2025-05-07T19:47:24.5306969Z 2025-05-07T19:47:24.5306973Z 2025-05-07T19:47:24.5306977Z 2025-05-07T19:47:25.0516210Z libcusolver-dev-11.4 | 66.3 MB | ########## | 100%  2025-05-07T19:47:25.0516580Z 2025-05-07T19:47:25.0516586Z 2025-05-07T19:47:25.0516594Z 2025-05-07T19:47:25.0516654Z 2025-05-07T19:47:25.0516659Z 2025-05-07T19:47:25.0516663Z 2025-05-07T19:47:25.0516669Z 2025-05-07T19:47:25.0516673Z 2025-05-07T19:47:25.0516680Z 2025-05-07T19:47:25.0516684Z 2025-05-07T19:47:25.0516688Z 2025-05-07T19:47:29.0222707Z libcusolver-11.4.1.4 | 96.5 MB | ########## | 100%  2025-05-07T19:47:29.0223158Z 2025-05-07T19:47:29.0223165Z 2025-05-07T19:47:29.0223170Z 2025-05-07T19:47:29.0223175Z 2025-05-07T19:47:29.0223222Z 2025-05-07T19:47:29.0223227Z 2025-05-07T19:47:29.0223232Z 2025-05-07T19:47:29.0223237Z 2025-05-07T19:47:29.0223241Z 2025-05-07T19:47:29.0223248Z 2025-05-07T19:47:29.0223253Z 2025-05-07T19:47:29.0223258Z 2025-05-07T19:47:29.0223262Z 2025-05-07T19:47:29.4856230Z libcurand-dev-10.3.0 | 53.7 MB | ########## | 100%  2025-05-07T19:47:29.4856672Z 2025-05-07T19:47:29.4856679Z 2025-05-07T19:47:29.4856685Z 2025-05-07T19:47:29.4856690Z 2025-05-07T19:47:29.4856694Z 2025-05-07T19:47:29.4856699Z 2025-05-07T19:47:29.4856745Z 2025-05-07T19:47:29.4856748Z 2025-05-07T19:47:29.4856752Z 2025-05-07T19:47:29.4856757Z 2025-05-07T19:47:29.4856761Z 2025-05-07T19:47:29.4856764Z 2025-05-07T19:47:29.4856767Z 2025-05-07T19:47:29.4856771Z 2025-05-07T19:47:32.2596151Z libcurand-10.3.0.86 | 53.2 MB | ########## | 100%  2025-05-07T19:47:32.2596543Z 2025-05-07T19:47:32.2596548Z 2025-05-07T19:47:32.2596553Z 2025-05-07T19:47:32.2596862Z 2025-05-07T19:47:32.2596867Z 2025-05-07T19:47:32.2596871Z 2025-05-07T19:47:32.2596875Z 2025-05-07T19:47:32.2596879Z 2025-05-07T19:47:32.2596883Z 2025-05-07T19:47:32.2596887Z 2025-05-07T19:47:32.2596894Z 2025-05-07T19:47:32.2596898Z 2025-05-07T19:47:32.2596919Z 2025-05-07T19:47:32.2596923Z 2025-05-07T19:47:32.2596927Z 2025-05-07T19:47:32.2596931Z 2025-05-07T19:47:34.4928960Z cuda-nvdisasm-11.8.8 | 48.7 MB | ########## | 100%  2025-05-07T19:47:34.4929402Z 2025-05-07T19:47:34.4929443Z 2025-05-07T19:47:34.4929448Z 2025-05-07T19:47:34.4929452Z 2025-05-07T19:47:34.4929455Z 2025-05-07T19:47:34.4929459Z 2025-05-07T19:47:34.4929462Z 2025-05-07T19:47:34.4929466Z 2025-05-07T19:47:34.4929470Z 2025-05-07T19:47:34.4929473Z 2025-05-07T19:47:34.4929477Z 2025-05-07T19:47:34.4929481Z 2025-05-07T19:47:34.4929485Z 2025-05-07T19:47:34.4929489Z 2025-05-07T19:47:34.4929493Z 2025-05-07T19:47:34.4929497Z 2025-05-07T19:47:34.4929524Z 2025-05-07T19:47:34.4929527Z 2025-05-07T19:47:37.3832519Z cuda-nvrtc-11.8.89 | 19.1 MB | ########## | 100%  2025-05-07T19:47:37.3833086Z 2025-05-07T19:47:37.3833092Z 2025-05-07T19:47:37.3833097Z 2025-05-07T19:47:37.3833117Z 2025-05-07T19:47:37.3833122Z 2025-05-07T19:47:37.3833126Z 2025-05-07T19:47:37.3833131Z 2025-05-07T19:47:37.3833135Z 2025-05-07T19:47:37.3833139Z 2025-05-07T19:47:37.3833142Z 2025-05-07T19:47:37.3833146Z 2025-05-07T19:47:37.3833149Z 2025-05-07T19:47:37.3833154Z 2025-05-07T19:47:37.3833426Z 2025-05-07T19:47:37.3833431Z 2025-05-07T19:47:37.3833435Z 2025-05-07T19:47:37.3833439Z 2025-05-07T19:47:39.5960278Z cuda-cupti-11.8.87 | 25.3 MB | ########## | 100%  2025-05-07T19:47:39.5960679Z 2025-05-07T19:47:39.5960684Z 2025-05-07T19:47:39.5960688Z 2025-05-07T19:47:39.5960692Z 2025-05-07T19:47:39.5960696Z 2025-05-07T19:47:39.5960700Z 2025-05-07T19:47:39.5960704Z 2025-05-07T19:47:39.5960740Z 2025-05-07T19:47:39.5960744Z 2025-05-07T19:47:39.5960748Z 2025-05-07T19:47:39.5960751Z 2025-05-07T19:47:39.5960755Z 2025-05-07T19:47:39.5960758Z 2025-05-07T19:47:39.5960775Z 2025-05-07T19:47:39.5960778Z 2025-05-07T19:47:39.5960782Z 2025-05-07T19:47:39.5960785Z 2025-05-07T19:47:39.5960788Z 2025-05-07T19:47:39.5960792Z 2025-05-07T19:47:43.9867163Z ... (more hidden) ... 2025-05-07T19:47:43.9867572Z 2025-05-07T19:47:43.9867599Z 2025-05-07T19:47:43.9867604Z 2025-05-07T19:47:43.9867654Z 2025-05-07T19:47:43.9867658Z 2025-05-07T19:47:43.9867663Z 2025-05-07T19:47:43.9867667Z 2025-05-07T19:47:43.9867672Z 2025-05-07T19:47:47.2707565Z libcufft-10.9.0.58 | 142.8 MB | ########## | 100%  2025-05-07T19:47:47.2708093Z 2025-05-07T19:47:47.2708100Z 2025-05-07T19:47:47.2708106Z 2025-05-07T19:47:47.2708113Z 2025-05-07T19:47:47.2708119Z 2025-05-07T19:47:47.2708125Z 2025-05-07T19:47:47.2708168Z 2025-05-07T19:47:47.2708173Z 2025-05-07T19:47:47.2708179Z 2025-05-07T19:47:47.2708182Z 2025-05-07T19:47:47.2708200Z 2025-05-07T19:47:47.2708204Z 2025-05-07T19:47:47.2708207Z 2025-05-07T19:47:47.2708211Z 2025-05-07T19:47:47.2708215Z 2025-05-07T19:47:59.1034624Z cuda-nvcc-11.8.89 | 50.8 MB | ########## | 100%  2025-05-07T19:47:59.1050554Z nsight-compute-2022. | 610.0 MB | ########## | 100% 2025-05-07T19:47:59.1050881Z 2025-05-07T19:47:59.1051019Z 2025-05-07T19:47:59.1051026Z 2025-05-07T19:47:59.1051077Z 2025-05-07T19:47:59.1051082Z 2025-05-07T19:47:59.1051113Z 2025-05-07T19:47:59.1054770Z 2025-05-07T19:47:59.1054829Z 2025-05-07T19:47:59.1054859Z 2025-05-07T19:47:59.1054865Z 2025-05-07T19:47:59.1054870Z 2025-05-07T19:47:59.1054877Z 2025-05-07T19:47:59.1054882Z 2025-05-07T19:47:59.1054888Z 2025-05-07T19:47:59.1054894Z 2025-05-07T19:47:59.1054899Z 2025-05-07T19:47:59.1054905Z 2025-05-07T19:47:59.1054910Z 2025-05-07T19:47:59.1054915Z 2025-05-07T19:47:59.1055600Z 2025-05-07T19:47:59.1056165Z  2025-05-07T19:47:59.1056542Z 2025-05-07T19:47:59.1056798Z 2025-05-07T19:47:59.1056982Z  2025-05-07T19:47:59.1057207Z 2025-05-07T19:47:59.1057211Z 2025-05-07T19:47:59.1057426Z  2025-05-07T19:47:59.1057654Z 2025-05-07T19:47:59.1057658Z 2025-05-07T19:47:59.1057771Z 2025-05-07T19:47:59.1057957Z  2025-05-07T19:47:59.1058184Z 2025-05-07T19:47:59.1058188Z 2025-05-07T19:47:59.1058191Z 2025-05-07T19:47:59.1058224Z 2025-05-07T19:47:59.1058408Z  2025-05-07T19:47:59.1058638Z 2025-05-07T19:47:59.1058641Z 2025-05-07T19:47:59.1058645Z 2025-05-07T19:47:59.1058649Z 2025-05-07T19:47:59.1058665Z 2025-05-07T19:47:59.1058884Z  2025-05-07T19:47:59.1059117Z 2025-05-07T19:47:59.1059121Z 2025-05-07T19:47:59.1059124Z 2025-05-07T19:47:59.1059128Z 2025-05-07T19:47:59.1059131Z 2025-05-07T19:47:59.1059135Z 2025-05-07T19:47:59.1059331Z  2025-05-07T19:47:59.1059595Z 2025-05-07T19:47:59.1059599Z 2025-05-07T19:47:59.1059602Z 2025-05-07T19:47:59.1059605Z 2025-05-07T19:47:59.1059609Z 2025-05-07T19:47:59.1059798Z 2025-05-07T19:47:59.1059804Z 2025-05-07T19:47:59.1060010Z  2025-05-07T19:47:59.1060274Z 2025-05-07T19:47:59.1060277Z 2025-05-07T19:47:59.1060281Z 2025-05-07T19:47:59.1060284Z 2025-05-07T19:47:59.1060288Z 2025-05-07T19:47:59.1060291Z 2025-05-07T19:47:59.1060294Z 2025-05-07T19:47:59.1060298Z 2025-05-07T19:47:59.1060494Z  2025-05-07T19:47:59.1060773Z 2025-05-07T19:47:59.1060777Z 2025-05-07T19:47:59.1060780Z 2025-05-07T19:47:59.1060783Z 2025-05-07T19:47:59.1060787Z 2025-05-07T19:47:59.1060791Z 2025-05-07T19:47:59.1060794Z 2025-05-07T19:47:59.1060797Z 2025-05-07T19:47:59.1060801Z 2025-05-07T19:47:59.1061003Z  2025-05-07T19:47:59.1061243Z 2025-05-07T19:47:59.1061275Z 2025-05-07T19:47:59.1061278Z 2025-05-07T19:47:59.1061281Z 2025-05-07T19:47:59.1061285Z 2025-05-07T19:47:59.1061293Z 2025-05-07T19:47:59.1061297Z 2025-05-07T19:47:59.1061301Z 2025-05-07T19:47:59.1061304Z 2025-05-07T19:47:59.1061308Z 2025-05-07T19:47:59.1061513Z  2025-05-07T19:47:59.1061765Z 2025-05-07T19:47:59.1061801Z 2025-05-07T19:47:59.1061805Z 2025-05-07T19:47:59.1061808Z 2025-05-07T19:47:59.1061811Z 2025-05-07T19:47:59.1061815Z 2025-05-07T19:47:59.1061823Z 2025-05-07T19:47:59.1061827Z 2025-05-07T19:47:59.1061831Z 2025-05-07T19:47:59.1061834Z 2025-05-07T19:47:59.1061837Z 2025-05-07T19:47:59.1062059Z  2025-05-07T19:47:59.1062343Z 2025-05-07T19:47:59.1062347Z 2025-05-07T19:47:59.1062351Z 2025-05-07T19:47:59.1062354Z 2025-05-07T19:47:59.1062357Z 2025-05-07T19:47:59.1062360Z 2025-05-07T19:47:59.1062364Z 2025-05-07T19:47:59.1062367Z 2025-05-07T19:47:59.1062370Z 2025-05-07T19:47:59.1062374Z 2025-05-07T19:47:59.1062383Z 2025-05-07T19:47:59.1062387Z 2025-05-07T19:47:59.1062600Z  2025-05-07T19:47:59.1062876Z 2025-05-07T19:47:59.1062880Z 2025-05-07T19:47:59.1062883Z 2025-05-07T19:47:59.1062887Z 2025-05-07T19:47:59.1062890Z 2025-05-07T19:47:59.1062893Z 2025-05-07T19:47:59.1062896Z 2025-05-07T19:47:59.1062900Z 2025-05-07T19:47:59.1062903Z 2025-05-07T19:47:59.1062906Z 2025-05-07T19:47:59.1062986Z 2025-05-07T19:47:59.1062989Z 2025-05-07T19:47:59.1062993Z 2025-05-07T19:47:59.1063213Z  2025-05-07T19:47:59.1063492Z 2025-05-07T19:47:59.1063496Z 2025-05-07T19:47:59.1063501Z 2025-05-07T19:47:59.1063504Z 2025-05-07T19:47:59.1063507Z 2025-05-07T19:47:59.1063511Z 2025-05-07T19:47:59.1063514Z 2025-05-07T19:47:59.1063518Z 2025-05-07T19:47:59.1063521Z 2025-05-07T19:47:59.1063525Z 2025-05-07T19:47:59.1063528Z 2025-05-07T19:47:59.1063532Z 2025-05-07T19:47:59.1063539Z 2025-05-07T19:47:59.1063542Z 2025-05-07T19:47:59.1063792Z  2025-05-07T19:47:59.1064052Z 2025-05-07T19:47:59.1064056Z 2025-05-07T19:47:59.1064059Z 2025-05-07T19:47:59.1064063Z 2025-05-07T19:47:59.1064066Z 2025-05-07T19:47:59.1064069Z 2025-05-07T19:47:59.1064073Z 2025-05-07T19:47:59.1064076Z 2025-05-07T19:47:59.1064080Z 2025-05-07T19:47:59.1064087Z 2025-05-07T19:47:59.1064091Z 2025-05-07T19:47:59.1064094Z 2025-05-07T19:47:59.1064097Z 2025-05-07T19:47:59.1064126Z 2025-05-07T19:47:59.1064130Z 2025-05-07T19:47:59.1064358Z  2025-05-07T19:47:59.1064617Z 2025-05-07T19:47:59.1064621Z 2025-05-07T19:47:59.1064624Z 2025-05-07T19:47:59.1064627Z 2025-05-07T19:47:59.1064631Z 2025-05-07T19:47:59.1064634Z 2025-05-07T19:47:59.1064637Z 2025-05-07T19:47:59.1064641Z 2025-05-07T19:47:59.1064730Z 2025-05-07T19:47:59.1064734Z 2025-05-07T19:47:59.1064738Z 2025-05-07T19:47:59.1064741Z 2025-05-07T19:47:59.1064745Z 2025-05-07T19:47:59.1064748Z 2025-05-07T19:47:59.1064752Z 2025-05-07T19:47:59.1064755Z 2025-05-07T19:47:59.1064989Z  2025-05-07T19:47:59.1065254Z 2025-05-07T19:47:59.1065258Z 2025-05-07T19:47:59.1065390Z 2025-05-07T19:47:59.1065397Z 2025-05-07T19:47:59.1065401Z 2025-05-07T19:47:59.1065404Z 2025-05-07T19:47:59.1065408Z 2025-05-07T19:47:59.1065411Z 2025-05-07T19:47:59.1065415Z 2025-05-07T19:47:59.1065418Z 2025-05-07T19:47:59.1065422Z 2025-05-07T19:47:59.1065425Z 2025-05-07T19:47:59.1065428Z 2025-05-07T19:47:59.1065432Z 2025-05-07T19:47:59.1065436Z 2025-05-07T19:47:59.1065439Z 2025-05-07T19:47:59.1065443Z 2025-05-07T19:47:59.1065683Z  2025-05-07T19:47:59.1065972Z 2025-05-07T19:47:59.1065979Z 2025-05-07T19:47:59.1065983Z 2025-05-07T19:47:59.1065986Z 2025-05-07T19:47:59.1065991Z 2025-05-07T19:47:59.1065994Z 2025-05-07T19:47:59.1065998Z 2025-05-07T19:47:59.1066002Z 2025-05-07T19:47:59.1066005Z 2025-05-07T19:47:59.1066008Z 2025-05-07T19:47:59.1066012Z 2025-05-07T19:47:59.1066015Z 2025-05-07T19:47:59.1066019Z 2025-05-07T19:47:59.1066022Z 2025-05-07T19:47:59.1066025Z 2025-05-07T19:47:59.1066029Z 2025-05-07T19:47:59.1066036Z 2025-05-07T19:47:59.1066065Z 2025-05-07T19:47:59.1066305Z  2025-05-07T19:47:59.1066572Z 2025-05-07T19:47:59.1066576Z 2025-05-07T19:47:59.1066683Z  2025-05-07T19:47:59.1066826Z 2025-05-07T19:47:59.1066831Z 2025-05-07T19:47:59.1066941Z  2025-05-07T19:47:59.1067067Z 2025-05-07T19:47:59.1067071Z 2025-05-07T19:47:59.1067074Z 2025-05-07T19:47:59.1067215Z  2025-05-07T19:47:59.1067340Z 2025-05-07T19:47:59.1067347Z 2025-05-07T19:47:59.1067352Z 2025-05-07T19:47:59.1067355Z 2025-05-07T19:47:59.1067471Z  2025-05-07T19:47:59.1067630Z 2025-05-07T19:47:59.1067634Z 2025-05-07T19:47:59.1067638Z 2025-05-07T19:47:59.1067641Z 2025-05-07T19:47:59.1067645Z 2025-05-07T19:47:59.1067768Z  2025-05-07T19:47:59.1067911Z 2025-05-07T19:47:59.1067942Z 2025-05-07T19:47:59.1067945Z 2025-05-07T19:47:59.1067949Z 2025-05-07T19:47:59.1067952Z 2025-05-07T19:47:59.1068020Z 2025-05-07T19:47:59.1068146Z  2025-05-07T19:47:59.1068295Z 2025-05-07T19:47:59.1068299Z 2025-05-07T19:47:59.1068302Z 2025-05-07T19:47:59.1068305Z 2025-05-07T19:47:59.1068309Z 2025-05-07T19:47:59.1068312Z 2025-05-07T19:47:59.1068341Z 2025-05-07T19:47:59.1068461Z  2025-05-07T19:47:59.1068621Z 2025-05-07T19:47:59.1068625Z 2025-05-07T19:47:59.1068628Z 2025-05-07T19:47:59.1068632Z 2025-05-07T19:47:59.1068635Z 2025-05-07T19:47:59.1068638Z 2025-05-07T19:47:59.1068642Z 2025-05-07T19:47:59.1068649Z 2025-05-07T19:47:59.1068801Z  2025-05-07T19:47:59.1068967Z 2025-05-07T19:47:59.1068971Z 2025-05-07T19:47:59.1068974Z 2025-05-07T19:47:59.1068977Z 2025-05-07T19:47:59.1068981Z 2025-05-07T19:47:59.1068984Z 2025-05-07T19:47:59.1068987Z 2025-05-07T19:47:59.1068991Z 2025-05-07T19:47:59.1068994Z 2025-05-07T19:47:59.1069156Z  2025-05-07T19:47:59.1069327Z 2025-05-07T19:47:59.1069331Z 2025-05-07T19:47:59.1069338Z 2025-05-07T19:47:59.1069342Z 2025-05-07T19:47:59.1069345Z 2025-05-07T19:47:59.1069349Z 2025-05-07T19:47:59.1069352Z 2025-05-07T19:47:59.1069355Z 2025-05-07T19:47:59.1069359Z 2025-05-07T19:47:59.1069362Z 2025-05-07T19:47:59.1069529Z  2025-05-07T19:47:59.1069710Z 2025-05-07T19:47:59.1069714Z 2025-05-07T19:47:59.1069718Z 2025-05-07T19:47:59.1069721Z 2025-05-07T19:47:59.1069724Z 2025-05-07T19:47:59.1069728Z 2025-05-07T19:47:59.1069731Z 2025-05-07T19:47:59.1069735Z 2025-05-07T19:47:59.1069738Z 2025-05-07T19:47:59.1069799Z 2025-05-07T19:47:59.1069803Z 2025-05-07T19:47:59.1069971Z  2025-05-07T19:47:59.1070162Z 2025-05-07T19:47:59.1070166Z 2025-05-07T19:47:59.1070169Z 2025-05-07T19:47:59.1070172Z 2025-05-07T19:47:59.1070176Z 2025-05-07T19:47:59.1070179Z 2025-05-07T19:47:59.1070182Z 2025-05-07T19:47:59.1070186Z 2025-05-07T19:47:59.1070189Z 2025-05-07T19:47:59.1070193Z 2025-05-07T19:47:59.1070196Z 2025-05-07T19:47:59.1070203Z 2025-05-07T19:47:59.1070373Z  2025-05-07T19:47:59.1070577Z 2025-05-07T19:47:59.1070580Z 2025-05-07T19:47:59.1070584Z 2025-05-07T19:47:59.1070587Z 2025-05-07T19:47:59.1070591Z 2025-05-07T19:47:59.1070594Z 2025-05-07T19:47:59.1070597Z 2025-05-07T19:47:59.1070600Z 2025-05-07T19:47:59.1070604Z 2025-05-07T19:47:59.1070607Z 2025-05-07T19:47:59.1070610Z 2025-05-07T19:47:59.1070614Z 2025-05-07T19:47:59.1070618Z 2025-05-07T19:47:59.1070796Z  2025-05-07T19:47:59.1071007Z 2025-05-07T19:47:59.1071011Z 2025-05-07T19:47:59.1071014Z 2025-05-07T19:47:59.1071018Z 2025-05-07T19:47:59.1071021Z 2025-05-07T19:47:59.1071024Z 2025-05-07T19:47:59.1071028Z 2025-05-07T19:47:59.1071031Z 2025-05-07T19:47:59.1071035Z 2025-05-07T19:47:59.1071039Z 2025-05-07T19:47:59.1071043Z 2025-05-07T19:47:59.1071046Z 2025-05-07T19:47:59.1071074Z 2025-05-07T19:47:59.1071077Z 2025-05-07T19:47:59.1071230Z  2025-05-07T19:47:59.1071443Z 2025-05-07T19:47:59.1071447Z 2025-05-07T19:47:59.1071450Z 2025-05-07T19:47:59.1071454Z 2025-05-07T19:47:59.1071457Z 2025-05-07T19:47:59.1071461Z 2025-05-07T19:47:59.1071466Z 2025-05-07T19:47:59.1071470Z 2025-05-07T19:47:59.1071474Z 2025-05-07T19:47:59.1071502Z 2025-05-07T19:47:59.1071505Z 2025-05-07T19:47:59.1071508Z 2025-05-07T19:47:59.1071512Z 2025-05-07T19:47:59.1071515Z 2025-05-07T19:47:59.1071518Z 2025-05-07T19:47:59.1071676Z  2025-05-07T19:47:59.1071898Z 2025-05-07T19:47:59.1071905Z 2025-05-07T19:47:59.1071909Z 2025-05-07T19:47:59.1071913Z 2025-05-07T19:47:59.1071941Z 2025-05-07T19:47:59.1071944Z 2025-05-07T19:47:59.1071947Z 2025-05-07T19:47:59.1071951Z 2025-05-07T19:47:59.1071954Z 2025-05-07T19:47:59.1071957Z 2025-05-07T19:47:59.1071961Z 2025-05-07T19:47:59.1071964Z 2025-05-07T19:47:59.1071968Z 2025-05-07T19:47:59.1071972Z 2025-05-07T19:47:59.1071975Z 2025-05-07T19:47:59.1071978Z 2025-05-07T19:47:59.1072200Z  2025-05-07T19:47:59.1072453Z 2025-05-07T19:47:59.1072456Z 2025-05-07T19:47:59.1072460Z 2025-05-07T19:47:59.1072463Z 2025-05-07T19:47:59.1072466Z 2025-05-07T19:47:59.1072470Z 2025-05-07T19:47:59.1072473Z 2025-05-07T19:47:59.1072477Z 2025-05-07T19:47:59.1072480Z 2025-05-07T19:47:59.1072483Z 2025-05-07T19:47:59.1072487Z 2025-05-07T19:47:59.1072490Z 2025-05-07T19:47:59.1072493Z 2025-05-07T19:47:59.1072496Z 2025-05-07T19:47:59.1072500Z 2025-05-07T19:47:59.1072503Z 2025-05-07T19:47:59.1072510Z 2025-05-07T19:47:59.1072817Z  2025-05-07T19:47:59.1073067Z 2025-05-07T19:47:59.1073071Z 2025-05-07T19:47:59.1073075Z 2025-05-07T19:47:59.1073078Z 2025-05-07T19:47:59.1073082Z 2025-05-07T19:47:59.1073085Z 2025-05-07T19:47:59.1073089Z 2025-05-07T19:47:59.1073092Z 2025-05-07T19:47:59.1073096Z 2025-05-07T19:47:59.1073099Z 2025-05-07T19:47:59.1073103Z 2025-05-07T19:47:59.1073106Z 2025-05-07T19:47:59.1073115Z 2025-05-07T19:47:59.1073119Z 2025-05-07T19:47:59.1073122Z 2025-05-07T19:47:59.1073126Z 2025-05-07T19:47:59.1073154Z 2025-05-07T19:47:59.1073158Z 2025-05-07T19:47:59.1073339Z  2025-05-07T19:47:59.1073572Z 2025-05-07T19:47:59.1073575Z 2025-05-07T19:47:59.1073766Z  2025-05-07T19:47:59.1073915Z 2025-05-07T19:47:59.1073918Z 2025-05-07T19:47:59.1074026Z  2025-05-07T19:47:59.1074144Z 2025-05-07T19:47:59.1074147Z 2025-05-07T19:47:59.1074151Z 2025-05-07T19:47:59.1074372Z  2025-05-07T19:47:59.1074496Z 2025-05-07T19:47:59.1074499Z 2025-05-07T19:47:59.1074503Z 2025-05-07T19:47:59.1074506Z 2025-05-07T19:47:59.1074618Z  2025-05-07T19:47:59.1074774Z 2025-05-07T19:47:59.1074778Z 2025-05-07T19:47:59.1074781Z 2025-05-07T19:47:59.1074785Z 2025-05-07T19:47:59.1074788Z 2025-05-07T19:47:59.1074906Z  2025-05-07T19:47:59.1075044Z 2025-05-07T19:47:59.1075072Z 2025-05-07T19:47:59.1075075Z 2025-05-07T19:47:59.1075083Z 2025-05-07T19:47:59.1075086Z 2025-05-07T19:47:59.1075090Z 2025-05-07T19:47:59.1075207Z  2025-05-07T19:47:59.1075352Z 2025-05-07T19:47:59.1075355Z 2025-05-07T19:47:59.1075359Z 2025-05-07T19:47:59.1075362Z 2025-05-07T19:47:59.1075365Z 2025-05-07T19:47:59.1075369Z 2025-05-07T19:47:59.1075397Z 2025-05-07T19:47:59.1075517Z  2025-05-07T19:47:59.1075672Z 2025-05-07T19:47:59.1075675Z 2025-05-07T19:47:59.1075679Z 2025-05-07T19:47:59.1075682Z 2025-05-07T19:47:59.1075686Z 2025-05-07T19:47:59.1075693Z 2025-05-07T19:47:59.1075697Z 2025-05-07T19:47:59.1075700Z 2025-05-07T19:47:59.1075855Z  2025-05-07T19:47:59.1076018Z 2025-05-07T19:47:59.1076021Z 2025-05-07T19:47:59.1076025Z 2025-05-07T19:47:59.1076028Z 2025-05-07T19:47:59.1076031Z 2025-05-07T19:47:59.1076035Z 2025-05-07T19:47:59.1076038Z 2025-05-07T19:47:59.1076042Z 2025-05-07T19:47:59.1076045Z 2025-05-07T19:47:59.1076206Z  2025-05-07T19:47:59.1076381Z 2025-05-07T19:47:59.1076384Z 2025-05-07T19:47:59.1076388Z 2025-05-07T19:47:59.1076392Z 2025-05-07T19:47:59.1076395Z 2025-05-07T19:47:59.1076399Z 2025-05-07T19:47:59.1076402Z 2025-05-07T19:47:59.1076405Z 2025-05-07T19:47:59.1076409Z 2025-05-07T19:47:59.1076412Z 2025-05-07T19:47:59.1076568Z  2025-05-07T19:47:59.1076748Z 2025-05-07T19:47:59.1076752Z 2025-05-07T19:47:59.1076755Z 2025-05-07T19:47:59.1076759Z 2025-05-07T19:47:59.1076762Z 2025-05-07T19:47:59.1076765Z 2025-05-07T19:47:59.1076769Z 2025-05-07T19:47:59.1076776Z 2025-05-07T19:47:59.1076779Z 2025-05-07T19:47:59.1076783Z 2025-05-07T19:47:59.1076786Z 2025-05-07T19:47:59.1076953Z  2025-05-07T19:47:59.1077143Z 2025-05-07T19:47:59.1077147Z 2025-05-07T19:47:59.1077150Z 2025-05-07T19:47:59.1077154Z 2025-05-07T19:47:59.1077158Z 2025-05-07T19:47:59.1077161Z 2025-05-07T19:47:59.1077164Z 2025-05-07T19:47:59.1077168Z 2025-05-07T19:47:59.1077171Z 2025-05-07T19:47:59.1077233Z 2025-05-07T19:47:59.1077237Z 2025-05-07T19:47:59.1077240Z 2025-05-07T19:47:59.1077418Z  done 2025-05-07T19:47:59.2069698Z Preparing transaction: / done 2025-05-07T19:47:59.4080966Z Verifying transaction: \ | done 2025-05-07T19:47:59.6111467Z Executing transaction: - \ done 2025-05-07T19:48:01.6246913Z [INSTALL] Appending libcuda.so path to LD_LIBRARY_PATH ... 2025-05-07T19:48:01.6630754Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/lib/stubs ... 2025-05-07T19:48:03.5432037Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib:/github/home/miniconda/envs/build_binary/lib/stubs 2025-05-07T19:48:03.5432879Z 2025-05-07T19:48:03.9533025Z 2025-05-07T19:48:03.9539124Z [INSTALL] Setting environment variable NVML_LIB_PATH ... 2025-05-07T19:48:03.9905910Z + conda env config vars set -n build_binary NVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:48:03.9906795Z 2025-05-07T19:48:04.3986167Z 2025-05-07T19:48:04.3987154Z [INSTALL] Setting environment variable CUDA_INCLUDE_DIRS ... 2025-05-07T19:48:04.3990411Z + conda env config vars set -n build_binary CUDA_INCLUDE_DIRS="/github/home/miniconda/envs/build_binary/include/:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/" 2025-05-07T19:48:04.3993016Z 2025-05-07T19:48:04.8090401Z 2025-05-07T19:48:06.7358056Z [CHECK] cuda_runtime.h found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/include/cuda_runtime.h 2025-05-07T19:48:08.7010055Z [CHECK] libcuda.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:48:10.6678782Z [CHECK] libnvToolsExt.so found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:48:12.6378184Z [CHECK] libnvidia-ml.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:48:14.4733178Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:48:14.4734099Z 2025-05-07T19:48:14.5430120Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:48:18.2869702Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:48:18.2870424Z Target: x86_64-conda-linux-gnu 2025-05-07T19:48:18.2870704Z Thread model: posix 2025-05-07T19:48:18.2871035Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:48:18.2871682Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang.cfg 2025-05-07T19:48:18.2872146Z 2025-05-07T19:48:18.3430609Z [INSTALL] Resetting compiler symlinks to clang ... 2025-05-07T19:48:22.1142875Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:48:22.1144436Z 2025-05-07T19:48:22.1165206Z 2025-05-07T19:48:22.1185915Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:48:22.1186607Z 2025-05-07T19:48:22.1203871Z 2025-05-07T19:48:22.1223403Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:48:22.1223937Z 2025-05-07T19:48:22.1236212Z 2025-05-07T19:48:22.1254816Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:48:22.1255429Z 2025-05-07T19:48:22.1275213Z 2025-05-07T19:48:22.1276181Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:48:22.1277192Z 2025-05-07T19:48:22.1296806Z total 36 2025-05-07T19:48:22.1297583Z drwxr-xr-x. 2 root root 188 May 7 19:45 . 2025-05-07T19:48:22.1298020Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:48:22.1298523Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:48:22.1299061Z -rw-r--r--. 2 root root 11630 Jun 10 2024 activate-gcc_linux-64.sh 2025-05-07T19:48:22.1299901Z -rw-r--r--. 2 root root 5190 Jun 10 2024 activate-gxx_linux-64.sh 2025-05-07T19:48:22.1300374Z -rw-r--r--. 2 root root 136 Mar 27 01:27 libglib_activate.sh 2025-05-07T19:48:22.1300855Z -rw-r--r--. 2 root root 873 Jun 5 2024 libxml2_activate.sh 2025-05-07T19:48:22.1301343Z -rw-r--r--. 2 root root 499 Nov 30 04:26 openjdk_activate.sh 2025-05-07T19:48:22.1301631Z 2025-05-07T19:48:22.1301809Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:48:22.1302299Z 2025-05-07T19:48:24.0153770Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:48:24.0155931Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:48:24.0156419Z 2025-05-07T19:48:24.0156575Z [BUILD] Setting Clang as the NVCC host compiler: 2025-05-07T19:48:25.9018761Z [BUILD] Setting prepend flags for NVCC ... 2025-05-07T19:48:25.9019835Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-allow-unsupported-compiler -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++" 2025-05-07T19:48:25.9020633Z 2025-05-07T19:48:26.3267095Z 2025-05-07T19:48:26.3267426Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:48:26.3267728Z 2025-05-07T19:48:28.1417613Z -allow-unsupported-compiler -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:48:28.1418232Z 2025-05-07T19:48:28.2146570Z 2025-05-07T19:48:28.2147343Z [INFO] Printing out all preprocessor defines in nvcc ... 2025-05-07T19:48:28.2147945Z + conda run -n build_binary nvcc --compiler-options -dM -E -x cu - < /dev/null 2025-05-07T19:48:28.2148340Z 2025-05-07T19:48:30.1111902Z #define ADJ_ESTERROR 0x0008 2025-05-07T19:48:30.1112384Z #define ADJ_FREQUENCY 0x0002 2025-05-07T19:48:30.1112816Z #define ADJ_MAXERROR 0x0004 2025-05-07T19:48:30.1113146Z #define ADJ_MICRO 0x1000 2025-05-07T19:48:30.1113478Z #define ADJ_NANO 0x2000 2025-05-07T19:48:30.1113776Z #define ADJ_OFFSET 0x0001 2025-05-07T19:48:30.1114110Z #define ADJ_OFFSET_SINGLESHOT 0x8001 2025-05-07T19:48:30.1114444Z #define ADJ_OFFSET_SS_READ 0xa001 2025-05-07T19:48:30.1114787Z #define ADJ_STATUS 0x0010 2025-05-07T19:48:30.1115064Z #define ADJ_TAI 0x0080 2025-05-07T19:48:30.1115356Z #define ADJ_TICK 0x4000 2025-05-07T19:48:30.1115630Z #define ADJ_TIMECONST 0x0020 2025-05-07T19:48:30.1115950Z #define AIO_PRIO_DELTA_MAX 20 2025-05-07T19:48:30.1116275Z #define BC_BASE_MAX _POSIX2_BC_BASE_MAX 2025-05-07T19:48:30.1116652Z #define BC_DIM_MAX _POSIX2_BC_DIM_MAX 2025-05-07T19:48:30.1116982Z #define BC_SCALE_MAX _POSIX2_BC_SCALE_MAX 2025-05-07T19:48:30.1117353Z #define BC_STRING_MAX _POSIX2_BC_STRING_MAX 2025-05-07T19:48:30.1117712Z #define BIG_ENDIAN __BIG_ENDIAN 2025-05-07T19:48:30.1118001Z #define BUFSIZ _IO_BUFSIZ 2025-05-07T19:48:30.1118302Z #define BYTE_ORDER __BYTE_ORDER 2025-05-07T19:48:30.1118598Z #define CHARCLASS_NAME_MAX 2048 2025-05-07T19:48:30.1118922Z #define CHAR_BIT __CHAR_BIT__ 2025-05-07T19:48:30.1119214Z #define CHAR_MAX __SCHAR_MAX__ 2025-05-07T19:48:30.1119525Z #define CHAR_MIN SCHAR_MIN 2025-05-07T19:48:30.1119802Z #define CLOCKS_PER_SEC 1000000l 2025-05-07T19:48:30.1120136Z #define CLOCK_BOOTTIME 7 2025-05-07T19:48:30.1120424Z #define CLOCK_BOOTTIME_ALARM 9 2025-05-07T19:48:30.1120758Z #define CLOCK_MONOTONIC 1 2025-05-07T19:48:30.1121079Z #define CLOCK_MONOTONIC_COARSE 6 2025-05-07T19:48:30.1121387Z #define CLOCK_MONOTONIC_RAW 4 2025-05-07T19:48:30.1121728Z #define CLOCK_PROCESS_CPUTIME_ID 2 2025-05-07T19:48:30.1122046Z #define CLOCK_REALTIME 0 2025-05-07T19:48:30.1122358Z #define CLOCK_REALTIME_ALARM 8 2025-05-07T19:48:30.1123049Z #define CLOCK_REALTIME_COARSE 5 2025-05-07T19:48:30.1123342Z #define CLOCK_TAI 11 2025-05-07T19:48:30.1123652Z #define CLOCK_THREAD_CPUTIME_ID 3 2025-05-07T19:48:30.1123968Z #define COLL_WEIGHTS_MAX 255 2025-05-07T19:48:30.1124283Z #define CUDARTAPI 2025-05-07T19:48:30.1124565Z #define CUDARTAPI_CDECL 2025-05-07T19:48:30.1125123Z #define CUDART_CB 2025-05-07T19:48:30.1125422Z #define CUDART_DEVICE __device__ 2025-05-07T19:48:30.1125730Z #define CUDART_VERSION 11080 2025-05-07T19:48:30.1141119Z #define CUDA_DOUBLE_MATH_FUNCTIONS 1 2025-05-07T19:48:30.1141618Z #define CUDA_IPC_HANDLE_SIZE 64 2025-05-07T19:48:30.1142062Z #define CU_UUID_HAS_BEEN_DEFINED 2025-05-07T19:48:30.1142365Z #define DELAYTIMER_MAX 2147483647 2025-05-07T19:48:30.1142689Z #define DOMAIN 1 2025-05-07T19:48:30.1142934Z #define EOF (-1) 2025-05-07T19:48:30.1143228Z #define EXIT_FAILURE 1 2025-05-07T19:48:30.1143492Z #define EXIT_SUCCESS 0 2025-05-07T19:48:30.1143780Z #define EXPR_NEST_MAX _POSIX2_EXPR_NEST_MAX 2025-05-07T19:48:30.1144165Z #define FD_CLR(fd,fdsetp) __FD_CLR (fd, fdsetp) 2025-05-07T19:48:30.1144544Z #define FD_ISSET(fd,fdsetp) __FD_ISSET (fd, fdsetp) 2025-05-07T19:48:30.1144974Z #define FD_SET(fd,fdsetp) __FD_SET (fd, fdsetp) 2025-05-07T19:48:30.1145330Z #define FD_SETSIZE __FD_SETSIZE 2025-05-07T19:48:30.1145698Z #define FD_ZERO(fdsetp) __FD_ZERO (fdsetp) 2025-05-07T19:48:30.1146039Z #define FILENAME_MAX 4096 2025-05-07T19:48:30.1146343Z #define FOPEN_MAX 16 2025-05-07T19:48:30.1146599Z #define FP_ILOGB0 (-2147483647 - 1) 2025-05-07T19:48:30.1146911Z #define FP_ILOGBNAN (-2147483647 - 1) 2025-05-07T19:48:30.1147216Z #define FP_INFINITE 1 2025-05-07T19:48:30.1147454Z #define FP_NAN 0 2025-05-07T19:48:30.1147728Z #define FP_NORMAL 4 2025-05-07T19:48:30.1147983Z #define FP_SUBNORMAL 3 2025-05-07T19:48:30.1148278Z #define FP_ZERO 2 2025-05-07T19:48:30.1148837Z #define HOST_NAME_MAX 64 2025-05-07T19:48:30.1149148Z #define HUGE 3.40282347e+38F 2025-05-07T19:48:30.1149436Z #define HUGE_VAL (__builtin_huge_val()) 2025-05-07T19:48:30.1149773Z #define HUGE_VALF (__builtin_huge_valf()) 2025-05-07T19:48:30.1150094Z #define HUGE_VALL (__builtin_huge_vall()) 2025-05-07T19:48:30.1150420Z #define INFINITY (__builtin_inff()) 2025-05-07T19:48:30.1150722Z #define INT_MAX __INT_MAX__ 2025-05-07T19:48:30.1151040Z #define INT_MIN (-__INT_MAX__ -1) 2025-05-07T19:48:30.1151325Z #define IOV_MAX 1024 2025-05-07T19:48:30.1151564Z #define LINE_MAX _POSIX2_LINE_MAX 2025-05-07T19:48:30.1151875Z #define LITTLE_ENDIAN __LITTLE_ENDIAN 2025-05-07T19:48:30.1152194Z #define LLONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:48:30.1152547Z #define LLONG_MIN (-__LONG_LONG_MAX__-1LL) 2025-05-07T19:48:30.1152960Z #define LOGIN_NAME_MAX 256 2025-05-07T19:48:30.1153436Z #define LONG_BIT 64 2025-05-07T19:48:30.1153705Z #define LONG_LONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:48:30.1154133Z #define LONG_LONG_MIN (-__LONG_LONG_MAX__-1LL) 2025-05-07T19:48:30.1154667Z #define LONG_MAX __LONG_MAX__ 2025-05-07T19:48:30.1154966Z #define LONG_MIN (-__LONG_MAX__ -1L) 2025-05-07T19:48:30.1155298Z #define L_ctermid 9 2025-05-07T19:48:30.1155531Z #define L_cuserid 9 2025-05-07T19:48:30.1155765Z #define L_tmpnam 20 2025-05-07T19:48:30.1156000Z #define MATH_ERREXCEPT 2 2025-05-07T19:48:30.1156268Z #define MATH_ERRNO 1 2025-05-07T19:48:30.1156497Z #define MAX_CANON 255 2025-05-07T19:48:30.1156746Z #define MAX_INPUT 255 2025-05-07T19:48:30.1157025Z #define MB_CUR_MAX (__ctype_get_mb_cur_max ()) 2025-05-07T19:48:30.1157364Z #define MB_LEN_MAX 16 2025-05-07T19:48:30.1157615Z #define MOD_CLKA ADJ_OFFSET_SINGLESHOT 2025-05-07T19:48:30.1157944Z #define MOD_CLKB ADJ_TICK 2025-05-07T19:48:30.1158220Z #define MOD_ESTERROR ADJ_ESTERROR 2025-05-07T19:48:30.1158531Z #define MOD_FREQUENCY ADJ_FREQUENCY 2025-05-07T19:48:30.1158881Z #define MOD_MAXERROR ADJ_MAXERROR 2025-05-07T19:48:30.1159276Z #define MOD_MICRO ADJ_MICRO 2025-05-07T19:48:30.1159550Z #define MOD_NANO ADJ_NANO 2025-05-07T19:48:30.1159809Z #define MOD_OFFSET ADJ_OFFSET 2025-05-07T19:48:30.1160126Z #define MOD_STATUS ADJ_STATUS 2025-05-07T19:48:30.1160402Z #define MOD_TAI ADJ_TAI 2025-05-07T19:48:30.1160680Z #define MOD_TIMECONST ADJ_TIMECONST 2025-05-07T19:48:30.1160965Z #define MQ_PRIO_MAX 32768 2025-05-07T19:48:30.1161228Z #define M_1_PI 0.31830988618379067154 2025-05-07T19:48:30.1161570Z #define M_1_PIl 0.318309886183790671537767526745028724L 2025-05-07T19:48:30.1162003Z #define M_2_PI 0.63661977236758134308 2025-05-07T19:48:30.1162371Z #define M_2_PIl 0.636619772367581343075535053490057448L 2025-05-07T19:48:30.1162705Z #define M_2_SQRTPI 1.12837916709551257390 2025-05-07T19:48:30.1163077Z #define M_2_SQRTPIl 1.128379167095512573896158903121545172L 2025-05-07T19:48:30.1163460Z #define M_E 2.7182818284590452354 2025-05-07T19:48:30.1163808Z #define M_El 2.718281828459045235360287471352662498L 2025-05-07T19:48:30.1164147Z #define M_LN10 2.30258509299404568402 2025-05-07T19:48:30.1164482Z #define M_LN10l 2.302585092994045684017991454684364208L 2025-05-07T19:48:30.1164834Z #define M_LN2 0.69314718055994530942 2025-05-07T19:48:30.1165144Z #define M_LN2l 0.693147180559945309417232121458176568L 2025-05-07T19:48:30.1165485Z #define M_LOG10E 0.43429448190325182765 2025-05-07T19:48:30.1165812Z #define M_LOG10El 0.434294481903251827651128918916605082L 2025-05-07T19:48:30.1166184Z #define M_LOG2E 1.4426950408889634074 2025-05-07T19:48:30.1166526Z #define M_LOG2El 1.442695040888963407359924681001892137L 2025-05-07T19:48:30.1166912Z #define M_PI 3.14159265358979323846 2025-05-07T19:48:30.1167205Z #define M_PI_2 1.57079632679489661923 2025-05-07T19:48:30.1167539Z #define M_PI_2l 1.570796326794896619231321691639751442L 2025-05-07T19:48:30.1167885Z #define M_PI_4 0.78539816339744830962 2025-05-07T19:48:30.1168206Z #define M_PI_4l 0.785398163397448309615660845819875721L 2025-05-07T19:48:30.1168573Z #define M_PIl 3.141592653589793238462643383279502884L 2025-05-07T19:48:30.1168992Z #define M_SQRT1_2 0.70710678118654752440 2025-05-07T19:48:30.1169382Z #define M_SQRT1_2l 0.707106781186547524400844362104849039L 2025-05-07T19:48:30.1169738Z #define M_SQRT2 1.41421356237309504880 2025-05-07T19:48:30.1170096Z #define M_SQRT2l 1.414213562373095048801688724209698079L 2025-05-07T19:48:30.1170427Z #define NAME_MAX 255 2025-05-07T19:48:30.1170682Z #define NAN (__builtin_nanf ("")) 2025-05-07T19:48:30.1170983Z #define NFDBITS __NFDBITS 2025-05-07T19:48:30.1171242Z #define NGROUPS_MAX 65536 2025-05-07T19:48:30.1171548Z #define NL_ARGMAX _POSIX_ARG_MAX 2025-05-07T19:48:30.1171852Z #define NL_LANGMAX _POSIX2_LINE_MAX 2025-05-07T19:48:30.1172184Z #define NL_MSGMAX INT_MAX 2025-05-07T19:48:30.1172453Z #define NL_NMAX INT_MAX 2025-05-07T19:48:30.1172739Z #define NL_SETMAX INT_MAX 2025-05-07T19:48:30.1173006Z #define NL_TEXTMAX INT_MAX 2025-05-07T19:48:30.1173300Z #define NULL __null 2025-05-07T19:48:30.1173538Z #define NZERO 20 2025-05-07T19:48:30.1173803Z #define OVERFLOW 3 2025-05-07T19:48:30.1174076Z #define PATH_MAX 4096 2025-05-07T19:48:30.1174352Z #define PDP_ENDIAN __PDP_ENDIAN 2025-05-07T19:48:30.1174671Z #define PIPE_BUF 4096 2025-05-07T19:48:30.1174921Z #define PLOSS 6 2025-05-07T19:48:30.1175332Z #define PTHREAD_DESTRUCTOR_ITERATIONS _POSIX_THREAD_DESTRUCTOR_ITERATIONS 2025-05-07T19:48:30.1175796Z #define PTHREAD_KEYS_MAX 1024 2025-05-07T19:48:30.1176115Z #define PTHREAD_STACK_MIN 16384 2025-05-07T19:48:30.1176413Z #define P_tmpdir "/tmp" 2025-05-07T19:48:30.1176710Z #define RAND_MAX 2147483647 2025-05-07T19:48:30.1176990Z #define RE_DUP_MAX (0x7fff) 2025-05-07T19:48:30.1177291Z #define RTSIG_MAX 32 2025-05-07T19:48:30.1177551Z #define SCHAR_MAX __SCHAR_MAX__ 2025-05-07T19:48:30.1177885Z #define SCHAR_MIN (-__SCHAR_MAX__-1) 2025-05-07T19:48:30.1178218Z #define SEEK_CUR 1 2025-05-07T19:48:30.1178458Z #define SEEK_DATA 3 2025-05-07T19:48:30.1178726Z #define SEEK_END 2 2025-05-07T19:48:30.1178968Z #define SEEK_HOLE 4 2025-05-07T19:48:30.1179237Z #define SEEK_SET 0 2025-05-07T19:48:30.1179486Z #define SEM_VALUE_MAX (2147483647) 2025-05-07T19:48:30.1179824Z #define SHRT_MAX __SHRT_MAX__ 2025-05-07T19:48:30.1180119Z #define SHRT_MIN (-__SHRT_MAX__ -1) 2025-05-07T19:48:30.1180443Z #define SING 2 2025-05-07T19:48:30.1180681Z #define SSIZE_MAX LONG_MAX 2025-05-07T19:48:30.1180984Z #define STA_CLK 0x8000 2025-05-07T19:48:30.1181276Z #define STA_CLOCKERR 0x1000 2025-05-07T19:48:30.1181546Z #define STA_DEL 0x0020 2025-05-07T19:48:30.1181819Z #define STA_FLL 0x0008 2025-05-07T19:48:30.1182158Z #define STA_FREQHOLD 0x0080 2025-05-07T19:48:30.1182458Z #define STA_INS 0x0010 2025-05-07T19:48:30.1182710Z #define STA_MODE 0x4000 2025-05-07T19:48:30.1182987Z #define STA_NANO 0x2000 2025-05-07T19:48:30.1183247Z #define STA_PLL 0x0001 2025-05-07T19:48:30.1183535Z #define STA_PPSERROR 0x0800 2025-05-07T19:48:30.1183814Z #define STA_PPSFREQ 0x0002 2025-05-07T19:48:30.1184125Z #define STA_PPSJITTER 0x0200 2025-05-07T19:48:30.1184413Z #define STA_PPSSIGNAL 0x0100 2025-05-07T19:48:30.1184726Z #define STA_PPSTIME 0x0004 2025-05-07T19:48:30.1185039Z #define STA_PPSWANDER 0x0400 2025-05-07T19:48:30.1185626Z #define STA_RONLY (STA_PPSSIGNAL | STA_PPSJITTER | STA_PPSWANDER | STA_PPSERROR | STA_CLOCKERR | STA_NANO | STA_MODE | STA_CLK) 2025-05-07T19:48:30.1186272Z #define STA_UNSYNC 0x0040 2025-05-07T19:48:30.1186549Z #define TIMER_ABSTIME 1 2025-05-07T19:48:30.1186835Z #define TIME_UTC 1 2025-05-07T19:48:30.1187075Z #define TLOSS 5 2025-05-07T19:48:30.1187340Z #define TMP_MAX 238328 2025-05-07T19:48:30.1187590Z #define TTY_NAME_MAX 32 2025-05-07T19:48:30.1187890Z #define UCHAR_MAX (__SCHAR_MAX__*2 +1) 2025-05-07T19:48:30.1188206Z #define UINT_MAX (__INT_MAX__ *2U +1U) 2025-05-07T19:48:30.1188572Z #define ULLONG_MAX (__LONG_LONG_MAX__*2ULL+1ULL) 2025-05-07T19:48:30.1188978Z #define ULONG_LONG_MAX (__LONG_LONG_MAX__*2ULL+1ULL) 2025-05-07T19:48:30.1189346Z #define ULONG_MAX (__LONG_MAX__ *2UL+1UL) 2025-05-07T19:48:30.1189684Z #define UNDERFLOW 4 2025-05-07T19:48:30.1189944Z #define USHRT_MAX (__SHRT_MAX__ *2 +1) 2025-05-07T19:48:30.1190275Z #define WCONTINUED 8 2025-05-07T19:48:30.1190621Z #define WEXITED 4 2025-05-07T19:48:30.1190992Z #define WEXITSTATUS(status) __WEXITSTATUS (__WAIT_INT (status)) 2025-05-07T19:48:30.1191506Z #define WIFCONTINUED(status) __WIFCONTINUED (__WAIT_INT (status)) 2025-05-07T19:48:30.1192015Z #define WIFEXITED(status) __WIFEXITED (__WAIT_INT (status)) 2025-05-07T19:48:30.1192515Z #define WIFSIGNALED(status) __WIFSIGNALED (__WAIT_INT (status)) 2025-05-07T19:48:30.1193223Z #define WIFSTOPPED(status) __WIFSTOPPED (__WAIT_INT (status)) 2025-05-07T19:48:30.1193672Z #define WNOHANG 1 2025-05-07T19:48:30.1193985Z #define WNOWAIT 0x01000000 2025-05-07T19:48:30.1194286Z #define WORD_BIT 32 2025-05-07T19:48:30.1194552Z #define WSTOPPED 2 2025-05-07T19:48:30.1194869Z #define WSTOPSIG(status) __WSTOPSIG (__WAIT_INT (status)) 2025-05-07T19:48:30.1195345Z #define WTERMSIG(status) __WTERMSIG (__WAIT_INT (status)) 2025-05-07T19:48:30.1195722Z #define WUNTRACED 2 2025-05-07T19:48:30.1196002Z #define XATTR_LIST_MAX 65536 2025-05-07T19:48:30.1196279Z #define XATTR_NAME_MAX 255 2025-05-07T19:48:30.1196589Z #define XATTR_SIZE_MAX 65536 2025-05-07T19:48:30.1196885Z #define X_TLOSS 1.41484755040568800000e+16 2025-05-07T19:48:30.1197219Z #define _ACRTIMP 2025-05-07T19:48:30.1197443Z #define _ALLOCA_H 1 2025-05-07T19:48:30.1197712Z #define _ASSERT_H 1 2025-05-07T19:48:30.1197974Z #define _ATFILE_SOURCE 1 2025-05-07T19:48:30.1198237Z #define _BITS_BYTESWAP_H 1 2025-05-07T19:48:30.1198535Z #define _BITS_POSIX1_LIM_H 1 2025-05-07T19:48:30.1198803Z #define _BITS_POSIX2_LIM_H 1 2025-05-07T19:48:30.1199097Z #define _BITS_PTHREADTYPES_H 1 2025-05-07T19:48:30.1199378Z #define _BITS_TIMEX_H 1 2025-05-07T19:48:30.1199636Z #define _BITS_TIME_H 1 2025-05-07T19:48:30.1199900Z #define _BITS_TYPESIZES_H 1 2025-05-07T19:48:30.1200176Z #define _BITS_TYPES_H 1 2025-05-07T19:48:30.1200424Z #define _BSD_SOURCE 1 2025-05-07T19:48:30.1200712Z #define _CONCEPT_CHECK_H 1 2025-05-07T19:48:30.1200980Z #define _CPP_TYPE_TRAITS_H 1 2025-05-07T19:48:30.1201255Z #define _CRTIMP 2025-05-07T19:48:30.1201490Z #define _ENDIAN_H 1 2025-05-07T19:48:30.1201730Z #define _EXCEPTION_DEFINES_H 1 2025-05-07T19:48:30.1202219Z #define _EXT_NUMERIC_TRAITS 1 2025-05-07T19:48:30.1202485Z #define _EXT_TYPE_TRAITS 1 2025-05-07T19:48:30.1202749Z #define _FEATURES_H 1 2025-05-07T19:48:30.1202998Z #define _FUNCTEXCEPT_H 1 2025-05-07T19:48:30.1203282Z #define _GCC_LIMITS_H_ 2025-05-07T19:48:30.1203571Z #define _GLIBCXX11_DEPRECATED _GLIBCXX_DEPRECATED 2025-05-07T19:48:30.1204222Z #define _GLIBCXX11_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:48:30.1204693Z #define _GLIBCXX11_USE_C99_COMPLEX 1 2025-05-07T19:48:30.1205030Z #define _GLIBCXX11_USE_C99_MATH 1 2025-05-07T19:48:30.1205369Z #define _GLIBCXX11_USE_C99_STDIO 1 2025-05-07T19:48:30.1205672Z #define _GLIBCXX11_USE_C99_STDLIB 1 2025-05-07T19:48:30.1206005Z #define _GLIBCXX11_USE_C99_WCHAR 1 2025-05-07T19:48:30.1206306Z #define _GLIBCXX14_CONSTEXPR constexpr 2025-05-07T19:48:30.1206645Z #define _GLIBCXX17_CONSTEXPR constexpr 2025-05-07T19:48:30.1206994Z #define _GLIBCXX17_DEPRECATED [[__deprecated__]] 2025-05-07T19:48:30.1207490Z #define _GLIBCXX17_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:48:30.1207935Z #define _GLIBCXX17_INLINE inline 2025-05-07T19:48:30.1208241Z #define _GLIBCXX20_CONSTEXPR 2025-05-07T19:48:30.1208532Z #define _GLIBCXX20_DEPRECATED(MSG) 2025-05-07T19:48:30.1208845Z #define _GLIBCXX20_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:48:30.1209192Z #define _GLIBCXX98_USE_C99_COMPLEX 1 2025-05-07T19:48:30.1209492Z #define _GLIBCXX98_USE_C99_MATH 1 2025-05-07T19:48:30.1209788Z #define _GLIBCXX98_USE_C99_STDIO 1 2025-05-07T19:48:30.1210077Z #define _GLIBCXX98_USE_C99_STDLIB 1 2025-05-07T19:48:30.1210392Z #define _GLIBCXX98_USE_C99_WCHAR 1 2025-05-07T19:48:30.1210772Z #define _GLIBCXX_ABI_TAG_CXX11 __attribute ((__abi_tag__ ("cxx11"))) 2025-05-07T19:48:30.1211192Z #define _GLIBCXX_ATOMIC_BUILTINS 1 2025-05-07T19:48:30.1211522Z #define _GLIBCXX_BEGIN_EXTERN_C extern "C" { 2025-05-07T19:48:30.1211947Z #define _GLIBCXX_BEGIN_NAMESPACE_ALGO 2025-05-07T19:48:30.1212305Z #define _GLIBCXX_BEGIN_NAMESPACE_CONTAINER 2025-05-07T19:48:30.1212703Z #define _GLIBCXX_BEGIN_NAMESPACE_CXX11 namespace __cxx11 { 2025-05-07T19:48:30.1213116Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL 2025-05-07T19:48:30.1213568Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_BEGIN_NAMESPACE_CXX11 2025-05-07T19:48:30.1214154Z #define _GLIBCXX_BEGIN_NAMESPACE_VERSION 2025-05-07T19:48:30.1214455Z #define _GLIBCXX_BITS_SPECFUN_H 1 2025-05-07T19:48:30.1214736Z #define _GLIBCXX_BITS_STD_ABS_H 2025-05-07T19:48:30.1215003Z #define _GLIBCXX_CMATH 1 2025-05-07T19:48:30.1215270Z #define _GLIBCXX_CONST __attribute__ ((__const__)) 2025-05-07T19:48:30.1215603Z #define _GLIBCXX_CONSTEXPR constexpr 2025-05-07T19:48:30.1215878Z #define _GLIBCXX_CPU_DEFINES 1 2025-05-07T19:48:30.1216143Z #define _GLIBCXX_CSTDLIB 1 2025-05-07T19:48:30.1216383Z #define _GLIBCXX_CXX_CONFIG_H 1 2025-05-07T19:48:30.1216670Z #define _GLIBCXX_DARWIN_USE_64_BIT_INODE 1 2025-05-07T19:48:30.1216981Z #define _GLIBCXX_DEBUG_ASSERT(_Condition) 2025-05-07T19:48:30.1217292Z #define _GLIBCXX_DEBUG_ASSERTIONS_H 1 2025-05-07T19:48:30.1217575Z #define _GLIBCXX_DEBUG_MACRO_SWITCH_H 1 2025-05-07T19:48:30.1217879Z #define _GLIBCXX_DEBUG_ONLY(_Statement) 2025-05-07T19:48:30.1218200Z #define _GLIBCXX_DEBUG_PEDASSERT(_Condition) 2025-05-07T19:48:30.1218562Z #define _GLIBCXX_DEFAULT_ABI_TAG _GLIBCXX_ABI_TAG_CXX11 2025-05-07T19:48:30.1218988Z #define _GLIBCXX_DEPRECATED __attribute__ ((__deprecated__)) 2025-05-07T19:48:30.1219535Z #define _GLIBCXX_DEPRECATED_SUGGEST(ALT) __attribute__ ((__deprecated__ ("use '" ALT "' instead"))) 2025-05-07T19:48:30.1220049Z #define _GLIBCXX_DOUBLE_IS_IEEE_BINARY64 1 2025-05-07T19:48:30.1220338Z #define _GLIBCXX_END_EXTERN_C } 2025-05-07T19:48:30.1220614Z #define _GLIBCXX_END_NAMESPACE_ALGO 2025-05-07T19:48:30.1220932Z #define _GLIBCXX_END_NAMESPACE_CONTAINER 2025-05-07T19:48:30.1221229Z #define _GLIBCXX_END_NAMESPACE_CXX11 } 2025-05-07T19:48:30.1221531Z #define _GLIBCXX_END_NAMESPACE_LDBL 2025-05-07T19:48:30.1221914Z #define _GLIBCXX_END_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_END_NAMESPACE_CXX11 2025-05-07T19:48:30.1222332Z #define _GLIBCXX_END_NAMESPACE_VERSION 2025-05-07T19:48:30.1222620Z #define _GLIBCXX_EXTERN_TEMPLATE 1 2025-05-07T19:48:30.1222920Z #define _GLIBCXX_FAST_MATH 0 2025-05-07T19:48:30.1223189Z #define _GLIBCXX_FLOAT_IS_IEEE_BINARY32 1 2025-05-07T19:48:30.1223808Z #define _GLIBCXX_FORWARD(_Tp,__val) std::forward<_Tp>(__val) 2025-05-07T19:48:30.1224275Z #define _GLIBCXX_FULLY_DYNAMIC_STRING 0 2025-05-07T19:48:30.1224625Z #define _GLIBCXX_FWDREF(_Tp) _Tp&& 2025-05-07T19:48:30.1224962Z #define _GLIBCXX_HAS_GTHREADS 1 2025-05-07T19:48:30.1225876Z #define _GLIBCXX_HAS_NESTED_TYPE(_NTYPE) template> struct __has_##_NTYPE : false_type { }; template struct __has_##_NTYPE<_Tp, __void_t> : true_type { }; 2025-05-07T19:48:30.1226819Z #define _GLIBCXX_HAVE_ACOSF 1 2025-05-07T19:48:30.1227094Z #define _GLIBCXX_HAVE_ACOSL 1 2025-05-07T19:48:30.1227393Z #define _GLIBCXX_HAVE_ALIGNED_ALLOC 1 2025-05-07T19:48:30.1227708Z #define _GLIBCXX_HAVE_ARPA_INET_H 1 2025-05-07T19:48:30.1228005Z #define _GLIBCXX_HAVE_ASINF 1 2025-05-07T19:48:30.1228283Z #define _GLIBCXX_HAVE_ASINL 1 2025-05-07T19:48:30.1228564Z #define _GLIBCXX_HAVE_AS_SYMVER_DIRECTIVE 1 2025-05-07T19:48:30.1228887Z #define _GLIBCXX_HAVE_ATAN2F 1 2025-05-07T19:48:30.1229150Z #define _GLIBCXX_HAVE_ATAN2L 1 2025-05-07T19:48:30.1229437Z #define _GLIBCXX_HAVE_ATANF 1 2025-05-07T19:48:30.1229720Z #define _GLIBCXX_HAVE_ATANL 1 2025-05-07T19:48:30.1230032Z #define _GLIBCXX_HAVE_ATOMIC_LOCK_POLICY 1 2025-05-07T19:48:30.1230373Z #define _GLIBCXX_HAVE_ATTRIBUTE_VISIBILITY 1 2025-05-07T19:48:30.1230691Z #define _GLIBCXX_HAVE_AT_QUICK_EXIT 1 2025-05-07T19:48:30.1231049Z #define _GLIBCXX_HAVE_BUILTIN_HAS_UNIQ_OBJ_REP 1 2025-05-07T19:48:30.1231395Z #define _GLIBCXX_HAVE_BUILTIN_IS_AGGREGATE 1 2025-05-07T19:48:30.1231773Z #define _GLIBCXX_HAVE_BUILTIN_IS_CONSTANT_EVALUATED 1 2025-05-07T19:48:30.1232185Z #define _GLIBCXX_HAVE_BUILTIN_IS_SAME 1 2025-05-07T19:48:30.1232504Z #define _GLIBCXX_HAVE_BUILTIN_LAUNDER 1 2025-05-07T19:48:30.1232917Z #define _GLIBCXX_HAVE_CEILF 1 2025-05-07T19:48:30.1233371Z #define _GLIBCXX_HAVE_CEILL 1 2025-05-07T19:48:30.1233652Z #define _GLIBCXX_HAVE_COMPLEX_H 1 2025-05-07T19:48:30.1234026Z #define _GLIBCXX_HAVE_COSF 1 2025-05-07T19:48:30.1234323Z #define _GLIBCXX_HAVE_COSHF 1 2025-05-07T19:48:30.1234614Z #define _GLIBCXX_HAVE_COSHL 1 2025-05-07T19:48:30.1234909Z #define _GLIBCXX_HAVE_COSL 1 2025-05-07T19:48:30.1235198Z #define _GLIBCXX_HAVE_DIRENT_H 1 2025-05-07T19:48:30.1235507Z #define _GLIBCXX_HAVE_DLFCN_H 1 2025-05-07T19:48:30.1235781Z #define _GLIBCXX_HAVE_ENDIAN_H 1 2025-05-07T19:48:30.1236107Z #define _GLIBCXX_HAVE_EXCEPTION_PTR_SINCE_GCC46 1 2025-05-07T19:48:30.1236451Z #define _GLIBCXX_HAVE_EXECINFO_H 1 2025-05-07T19:48:30.1236750Z #define _GLIBCXX_HAVE_EXPF 1 2025-05-07T19:48:30.1237023Z #define _GLIBCXX_HAVE_EXPL 1 2025-05-07T19:48:30.1237304Z #define _GLIBCXX_HAVE_FABSF 1 2025-05-07T19:48:30.1237580Z #define _GLIBCXX_HAVE_FABSL 1 2025-05-07T19:48:30.1237866Z #define _GLIBCXX_HAVE_FCNTL_H 1 2025-05-07T19:48:30.1238183Z #define _GLIBCXX_HAVE_FENV_H 1 2025-05-07T19:48:30.1238463Z #define _GLIBCXX_HAVE_FINITE 1 2025-05-07T19:48:30.1238743Z #define _GLIBCXX_HAVE_FINITEF 1 2025-05-07T19:48:30.1239021Z #define _GLIBCXX_HAVE_FINITEL 1 2025-05-07T19:48:30.1239316Z #define _GLIBCXX_HAVE_FLOAT_H 1 2025-05-07T19:48:30.1239601Z #define _GLIBCXX_HAVE_FLOORF 1 2025-05-07T19:48:30.1239879Z #define _GLIBCXX_HAVE_FLOORL 1 2025-05-07T19:48:30.1240146Z #define _GLIBCXX_HAVE_FMODF 1 2025-05-07T19:48:30.1240442Z #define _GLIBCXX_HAVE_FMODL 1 2025-05-07T19:48:30.1240730Z #define _GLIBCXX_HAVE_FREXPF 1 2025-05-07T19:48:30.1241004Z #define _GLIBCXX_HAVE_FREXPL 1 2025-05-07T19:48:30.1241292Z #define _GLIBCXX_HAVE_GETIPINFO 1 2025-05-07T19:48:30.1241572Z #define _GLIBCXX_HAVE_GETS 1 2025-05-07T19:48:30.1241847Z #define _GLIBCXX_HAVE_HYPOT 1 2025-05-07T19:48:30.1242138Z #define _GLIBCXX_HAVE_HYPOTF 1 2025-05-07T19:48:30.1242443Z #define _GLIBCXX_HAVE_HYPOTL 1 2025-05-07T19:48:30.1242708Z #define _GLIBCXX_HAVE_ICONV 1 2025-05-07T19:48:30.1243002Z #define _GLIBCXX_HAVE_INT64_T 1 2025-05-07T19:48:30.1243298Z #define _GLIBCXX_HAVE_INT64_T_LONG 1 2025-05-07T19:48:30.1243617Z #define _GLIBCXX_HAVE_INTTYPES_H 1 2025-05-07T19:48:30.1243919Z #define _GLIBCXX_HAVE_ISINF 1 2025-05-07T19:48:30.1244207Z #define _GLIBCXX_HAVE_ISINFF 1 2025-05-07T19:48:30.1244616Z #define _GLIBCXX_HAVE_ISINFL 1 2025-05-07T19:48:30.1244916Z #define _GLIBCXX_HAVE_ISNAN 1 2025-05-07T19:48:30.1245349Z #define _GLIBCXX_HAVE_ISNANF 1 2025-05-07T19:48:30.1245642Z #define _GLIBCXX_HAVE_ISNANL 1 2025-05-07T19:48:30.1245967Z #define _GLIBCXX_HAVE_ISWBLANK 1 2025-05-07T19:48:30.1246246Z #define _GLIBCXX_HAVE_LC_MESSAGES 1 2025-05-07T19:48:30.1246545Z #define _GLIBCXX_HAVE_LDEXPF 1 2025-05-07T19:48:30.1246818Z #define _GLIBCXX_HAVE_LDEXPL 1 2025-05-07T19:48:30.1247120Z #define _GLIBCXX_HAVE_LIMIT_AS 1 2025-05-07T19:48:30.1247423Z #define _GLIBCXX_HAVE_LIMIT_DATA 1 2025-05-07T19:48:30.1247721Z #define _GLIBCXX_HAVE_LIMIT_FSIZE 1 2025-05-07T19:48:30.1248056Z #define _GLIBCXX_HAVE_LIMIT_RSS 1 2025-05-07T19:48:30.1248348Z #define _GLIBCXX_HAVE_LIMIT_VMEM 0 2025-05-07T19:48:30.1248655Z #define _GLIBCXX_HAVE_LINK 1 2025-05-07T19:48:30.1248919Z #define _GLIBCXX_HAVE_LINUX_FUTEX 1 2025-05-07T19:48:30.1249224Z #define _GLIBCXX_HAVE_LINUX_RANDOM_H 1 2025-05-07T19:48:30.1249543Z #define _GLIBCXX_HAVE_LINUX_TYPES_H 1 2025-05-07T19:48:30.1249858Z #define _GLIBCXX_HAVE_LOCALE_H 1 2025-05-07T19:48:30.1250154Z #define _GLIBCXX_HAVE_LOG10F 1 2025-05-07T19:48:30.1250419Z #define _GLIBCXX_HAVE_LOG10L 1 2025-05-07T19:48:30.1250704Z #define _GLIBCXX_HAVE_LOGF 1 2025-05-07T19:48:30.1250959Z #define _GLIBCXX_HAVE_LOGL 1 2025-05-07T19:48:30.1251263Z #define _GLIBCXX_HAVE_MBSTATE_T 1 2025-05-07T19:48:30.1251538Z #define _GLIBCXX_HAVE_MEMALIGN 1 2025-05-07T19:48:30.1251850Z #define _GLIBCXX_HAVE_MEMORY_H 1 2025-05-07T19:48:30.1252188Z #define _GLIBCXX_HAVE_MODF 1 2025-05-07T19:48:30.1252462Z #define _GLIBCXX_HAVE_MODFF 1 2025-05-07T19:48:30.1252721Z #define _GLIBCXX_HAVE_MODFL 1 2025-05-07T19:48:30.1253162Z #define _GLIBCXX_HAVE_NETDB_H 1 2025-05-07T19:48:30.1253484Z #define _GLIBCXX_HAVE_NETINET_IN_H 1 2025-05-07T19:48:30.1253802Z #define _GLIBCXX_HAVE_NETINET_TCP_H 1 2025-05-07T19:48:30.1254150Z #define _GLIBCXX_HAVE_OBSOLETE_ISINF 1 2025-05-07T19:48:30.1254468Z #define _GLIBCXX_HAVE_OBSOLETE_ISNAN 1 2025-05-07T19:48:30.1254809Z #define _GLIBCXX_HAVE_POLL 1 2025-05-07T19:48:30.1255119Z #define _GLIBCXX_HAVE_POLL_H 1 2025-05-07T19:48:30.1255419Z #define _GLIBCXX_HAVE_POSIX_MEMALIGN 1 2025-05-07T19:48:30.1255765Z #define _GLIBCXX_HAVE_POSIX_SEMAPHORE 1 2025-05-07T19:48:30.1256080Z #define _GLIBCXX_HAVE_POWF 1 2025-05-07T19:48:30.1256389Z #define _GLIBCXX_HAVE_POWL 1 2025-05-07T19:48:30.1256677Z #define _GLIBCXX_HAVE_QUICK_EXIT 1 2025-05-07T19:48:30.1257007Z #define _GLIBCXX_HAVE_READLINK 1 2025-05-07T19:48:30.1257404Z #define _GLIBCXX_HAVE_SETENV 1 2025-05-07T19:48:30.1257707Z #define _GLIBCXX_HAVE_SINCOS 1 2025-05-07T19:48:30.1257983Z #define _GLIBCXX_HAVE_SINCOSF 1 2025-05-07T19:48:30.1258272Z #define _GLIBCXX_HAVE_SINCOSL 1 2025-05-07T19:48:30.1258569Z #define _GLIBCXX_HAVE_SINF 1 2025-05-07T19:48:30.1258838Z #define _GLIBCXX_HAVE_SINHF 1 2025-05-07T19:48:30.1259132Z #define _GLIBCXX_HAVE_SINHL 1 2025-05-07T19:48:30.1259403Z #define _GLIBCXX_HAVE_SINL 1 2025-05-07T19:48:30.1259696Z #define _GLIBCXX_HAVE_SOCKATMARK 1 2025-05-07T19:48:30.1259985Z #define _GLIBCXX_HAVE_SQRTF 1 2025-05-07T19:48:30.1260281Z #define _GLIBCXX_HAVE_SQRTL 1 2025-05-07T19:48:30.1260557Z #define _GLIBCXX_HAVE_STDALIGN_H 1 2025-05-07T19:48:30.1260867Z #define _GLIBCXX_HAVE_STDBOOL_H 1 2025-05-07T19:48:30.1261152Z #define _GLIBCXX_HAVE_STDINT_H 1 2025-05-07T19:48:30.1261455Z #define _GLIBCXX_HAVE_STDLIB_H 1 2025-05-07T19:48:30.1261760Z #define _GLIBCXX_HAVE_STRERROR_L 1 2025-05-07T19:48:30.1262048Z #define _GLIBCXX_HAVE_STRERROR_R 1 2025-05-07T19:48:30.1262364Z #define _GLIBCXX_HAVE_STRINGS_H 1 2025-05-07T19:48:30.1262649Z #define _GLIBCXX_HAVE_STRING_H 1 2025-05-07T19:48:30.1262956Z #define _GLIBCXX_HAVE_STRTOF 1 2025-05-07T19:48:30.1263232Z #define _GLIBCXX_HAVE_STRTOLD 1 2025-05-07T19:48:30.1263556Z #define _GLIBCXX_HAVE_STRUCT_DIRENT_D_TYPE 1 2025-05-07T19:48:30.1263876Z #define _GLIBCXX_HAVE_STRXFRM_L 1 2025-05-07T19:48:30.1264181Z #define _GLIBCXX_HAVE_SYMLINK 1 2025-05-07T19:48:30.1264527Z #define _GLIBCXX_HAVE_SYMVER_SYMBOL_RENAMING_RUNTIME_SUPPORT 1 2025-05-07T19:48:30.1266885Z #define _GLIBCXX_HAVE_SYS_IOCTL_H 1 2025-05-07T19:48:30.1267227Z #define _GLIBCXX_HAVE_SYS_IPC_H 1 2025-05-07T19:48:30.1267519Z #define _GLIBCXX_HAVE_SYS_PARAM_H 1 2025-05-07T19:48:30.1267854Z #define _GLIBCXX_HAVE_SYS_RESOURCE_H 1 2025-05-07T19:48:30.1268194Z #define _GLIBCXX_HAVE_SYS_SEM_H 1 2025-05-07T19:48:30.1268494Z #define _GLIBCXX_HAVE_SYS_SOCKET_H 1 2025-05-07T19:48:30.1268831Z #define _GLIBCXX_HAVE_SYS_STATVFS_H 1 2025-05-07T19:48:30.1269138Z #define _GLIBCXX_HAVE_SYS_STAT_H 1 2025-05-07T19:48:30.1269472Z #define _GLIBCXX_HAVE_SYS_SYSINFO_H 1 2025-05-07T19:48:30.1269779Z #define _GLIBCXX_HAVE_SYS_TIME_H 1 2025-05-07T19:48:30.1270118Z #define _GLIBCXX_HAVE_SYS_TYPES_H 1 2025-05-07T19:48:30.1270422Z #define _GLIBCXX_HAVE_SYS_UIO_H 1 2025-05-07T19:48:30.1270742Z #define _GLIBCXX_HAVE_S_ISREG 1 2025-05-07T19:48:30.1271027Z #define _GLIBCXX_HAVE_TANF 1 2025-05-07T19:48:30.1271334Z #define _GLIBCXX_HAVE_TANHF 1 2025-05-07T19:48:30.1271640Z #define _GLIBCXX_HAVE_TANHL 1 2025-05-07T19:48:30.1271927Z #define _GLIBCXX_HAVE_TANL 1 2025-05-07T19:48:30.1272240Z #define _GLIBCXX_HAVE_TGMATH_H 1 2025-05-07T19:48:30.1272527Z #define _GLIBCXX_HAVE_TLS 1 2025-05-07T19:48:30.1272912Z #define _GLIBCXX_HAVE_TRUNCATE 1 2025-05-07T19:48:30.1273380Z #define _GLIBCXX_HAVE_UNISTD_H 1 2025-05-07T19:48:30.1273713Z #define _GLIBCXX_HAVE_USELOCALE 1 2025-05-07T19:48:30.1274063Z #define _GLIBCXX_HAVE_UTIME_H 1 2025-05-07T19:48:30.1274386Z #define _GLIBCXX_HAVE_VFWSCANF 1 2025-05-07T19:48:30.1274792Z #define _GLIBCXX_HAVE_VSWSCANF 1 2025-05-07T19:48:30.1275132Z #define _GLIBCXX_HAVE_VWSCANF 1 2025-05-07T19:48:30.1275458Z #define _GLIBCXX_HAVE_WCHAR_H 1 2025-05-07T19:48:30.1275751Z #define _GLIBCXX_HAVE_WCSTOF 1 2025-05-07T19:48:30.1276069Z #define _GLIBCXX_HAVE_WCTYPE_H 1 2025-05-07T19:48:30.1276365Z #define _GLIBCXX_HAVE_WRITEV 1 2025-05-07T19:48:30.1276690Z #define _GLIBCXX_HAVE_XLOCALE_H 1 2025-05-07T19:48:30.1276984Z #define _GLIBCXX_HOSTED 1 2025-05-07T19:48:30.1277284Z #define _GLIBCXX_ICONV_CONST 2025-05-07T19:48:30.1277579Z #define _GLIBCXX_INLINE_VERSION 0 2025-05-07T19:48:30.1277910Z #define _GLIBCXX_LT_OBJDIR ".libs/" 2025-05-07T19:48:30.1278457Z #define _GLIBCXX_MAKE_MOVE_IF_NOEXCEPT_ITERATOR(_Iter) std::__make_move_if_noexcept_iterator(_Iter) 2025-05-07T19:48:30.1279150Z #define _GLIBCXX_MAKE_MOVE_ITERATOR(_Iter) std::make_move_iterator(_Iter) 2025-05-07T19:48:30.1279626Z #define _GLIBCXX_MANGLE_SIZE_T m 2025-05-07T19:48:30.1279925Z #define _GLIBCXX_MATH_H 1 2025-05-07T19:48:30.1280253Z #define _GLIBCXX_MOVE(__val) std::move(__val) 2025-05-07T19:48:30.1280676Z #define _GLIBCXX_MOVE3(_Tp,_Up,_Vp) std::move(_Tp, _Up, _Vp) 2025-05-07T19:48:30.1281237Z #define _GLIBCXX_MOVE_BACKWARD3(_Tp,_Up,_Vp) std::move_backward(_Tp, _Up, _Vp) 2025-05-07T19:48:30.1281721Z #define _GLIBCXX_NAMESPACE_CXX11 __cxx11:: 2025-05-07T19:48:30.1282094Z #define _GLIBCXX_NAMESPACE_LDBL 2025-05-07T19:48:30.1282518Z #define _GLIBCXX_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_NAMESPACE_CXX11 2025-05-07T19:48:30.1283123Z #define _GLIBCXX_NATIVE_THREAD_ID (__gthread_active_p() ? __gthread_self() : (__gthread_t)1) 2025-05-07T19:48:30.1283685Z #define _GLIBCXX_NODISCARD [[__nodiscard__]] 2025-05-07T19:48:30.1284039Z #define _GLIBCXX_NOEXCEPT noexcept 2025-05-07T19:48:30.1284437Z #define _GLIBCXX_NOEXCEPT_IF(...) noexcept(__VA_ARGS__) 2025-05-07T19:48:30.1284831Z #define _GLIBCXX_NOEXCEPT_PARM , bool _NE 2025-05-07T19:48:30.1285325Z #define _GLIBCXX_NOEXCEPT_QUAL noexcept (_NE) 2025-05-07T19:48:30.1285704Z #define _GLIBCXX_NORETURN __attribute__ ((__noreturn__)) 2025-05-07T19:48:30.1286122Z #define _GLIBCXX_NOTHROW _GLIBCXX_USE_NOEXCEPT 2025-05-07T19:48:30.1286576Z #define _GLIBCXX_NO_OBSOLETE_ISINF_ISNAN_DYNAMIC __GLIBC_PREREQ(2,23) 2025-05-07T19:48:30.1286988Z #define _GLIBCXX_NUMERIC_LIMITS 1 2025-05-07T19:48:30.1287300Z #define _GLIBCXX_OS_DEFINES 1 2025-05-07T19:48:30.1287582Z #define _GLIBCXX_PACKAGE_BUGREPORT "" 2025-05-07T19:48:30.1287943Z #define _GLIBCXX_PACKAGE_NAME "package-unused" 2025-05-07T19:48:30.1288429Z #define _GLIBCXX_PACKAGE_STRING "package-unused version-unused" 2025-05-07T19:48:30.1288866Z #define _GLIBCXX_PACKAGE_TARNAME "libstdc++" 2025-05-07T19:48:30.1289193Z #define _GLIBCXX_PACKAGE_URL "" 2025-05-07T19:48:30.1289571Z #define _GLIBCXX_PACKAGE__GLIBCXX_VERSION "version-unused" 2025-05-07T19:48:30.1289980Z #define _GLIBCXX_PREDEFINED_OPS_H 1 2025-05-07T19:48:30.1290290Z #define _GLIBCXX_PSEUDO_VISIBILITY(V) 2025-05-07T19:48:30.1290653Z #define _GLIBCXX_PURE __attribute__ ((__pure__)) 2025-05-07T19:48:30.1290989Z #define _GLIBCXX_RELEASE 11 2025-05-07T19:48:30.1291289Z #define _GLIBCXX_RES_LIMITS 1 2025-05-07T19:48:30.1291560Z #define _GLIBCXX_STDC_HEADERS 1 2025-05-07T19:48:30.1291862Z #define _GLIBCXX_STDIO_EOF -1 2025-05-07T19:48:30.1292140Z #define _GLIBCXX_STDIO_SEEK_CUR 1 2025-05-07T19:48:30.1292455Z #define _GLIBCXX_STDIO_SEEK_END 2 2025-05-07T19:48:30.1292764Z #define _GLIBCXX_STDLIB_H 1 2025-05-07T19:48:30.1293026Z #define _GLIBCXX_STD_A std 2025-05-07T19:48:30.1293309Z #define _GLIBCXX_STD_C std 2025-05-07T19:48:30.1293568Z #define _GLIBCXX_SYMVER 1 2025-05-07T19:48:30.1293856Z #define _GLIBCXX_SYMVER_GNU 1 2025-05-07T19:48:30.1294167Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_AFTER(A) 2025-05-07T19:48:30.1294580Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_BEFORE(A) 2025-05-07T19:48:30.1294920Z #define _GLIBCXX_THROW(_EXC) 2025-05-07T19:48:30.1295254Z #define _GLIBCXX_THROW_OR_ABORT(_EXC) (throw (_EXC)) 2025-05-07T19:48:30.1295607Z #define _GLIBCXX_TR1_BESSEL_FUNCTION_TCC 1 2025-05-07T19:48:30.1295961Z #define _GLIBCXX_TR1_BETA_FUNCTION_TCC 1 2025-05-07T19:48:30.1296365Z #define _GLIBCXX_TR1_ELL_INTEGRAL_TCC 1 2025-05-07T19:48:30.1296677Z #define _GLIBCXX_TR1_EXP_INTEGRAL_TCC 1 2025-05-07T19:48:30.1297012Z #define _GLIBCXX_TR1_GAMMA_TCC 1 2025-05-07T19:48:30.1297311Z #define _GLIBCXX_TR1_HYPERGEOMETRIC_TCC 1 2025-05-07T19:48:30.1297670Z #define _GLIBCXX_TR1_LEGENDRE_FUNCTION_TCC 1 2025-05-07T19:48:30.1298015Z #define _GLIBCXX_TR1_MODIFIED_BESSEL_FUNC_TCC 1 2025-05-07T19:48:30.1298380Z #define _GLIBCXX_TR1_POLY_HERMITE_TCC 1 2025-05-07T19:48:30.1298703Z #define _GLIBCXX_TR1_POLY_LAGUERRE_TCC 1 2025-05-07T19:48:30.1299044Z #define _GLIBCXX_TR1_RIEMANN_ZETA_TCC 1 2025-05-07T19:48:30.1299409Z #define _GLIBCXX_TR1_SPECIAL_FUNCTION_UTIL_H 1 2025-05-07T19:48:30.1299732Z #define _GLIBCXX_TXN_SAFE 2025-05-07T19:48:30.1300020Z #define _GLIBCXX_TXN_SAFE_DYN 2025-05-07T19:48:30.1300295Z #define _GLIBCXX_TYPE_TRAITS 1 2025-05-07T19:48:30.1300601Z #define _GLIBCXX_USE_ALLOCATOR_NEW 1 2025-05-07T19:48:30.1300894Z #define _GLIBCXX_USE_C99 1 2025-05-07T19:48:30.1301254Z #define _GLIBCXX_USE_C99_COMPLEX _GLIBCXX11_USE_C99_COMPLEX 2025-05-07T19:48:30.1301631Z #define _GLIBCXX_USE_C99_COMPLEX_TR1 1 2025-05-07T19:48:30.1301962Z #define _GLIBCXX_USE_C99_CTYPE_TR1 1 2025-05-07T19:48:30.1302564Z #define _GLIBCXX_USE_C99_FENV_TR1 1 2025-05-07T19:48:30.1302926Z #define _GLIBCXX_USE_C99_INTTYPES_TR1 1 2025-05-07T19:48:30.1303399Z #define _GLIBCXX_USE_C99_INTTYPES_WCHAR_T_TR1 1 2025-05-07T19:48:30.1303805Z #define _GLIBCXX_USE_C99_MATH _GLIBCXX11_USE_C99_MATH 2025-05-07T19:48:30.1304219Z #define _GLIBCXX_USE_C99_MATH_TR1 1 2025-05-07T19:48:30.1304548Z #define _GLIBCXX_USE_C99_STDINT_TR1 1 2025-05-07T19:48:30.1304956Z #define _GLIBCXX_USE_C99_STDIO _GLIBCXX11_USE_C99_STDIO 2025-05-07T19:48:30.1305399Z #define _GLIBCXX_USE_C99_STDLIB _GLIBCXX11_USE_C99_STDLIB 2025-05-07T19:48:30.1305869Z #define _GLIBCXX_USE_C99_WCHAR _GLIBCXX11_USE_C99_WCHAR 2025-05-07T19:48:30.1306264Z #define _GLIBCXX_USE_CLOCK_MONOTONIC 1 2025-05-07T19:48:30.1306630Z #define _GLIBCXX_USE_CLOCK_REALTIME 1 2025-05-07T19:48:30.1306992Z #define _GLIBCXX_USE_CONSTEXPR constexpr 2025-05-07T19:48:30.1307329Z #define _GLIBCXX_USE_CXX11_ABI 1 2025-05-07T19:48:30.1307677Z #define _GLIBCXX_USE_DECIMAL_FLOAT 1 2025-05-07T19:48:30.1308004Z #define _GLIBCXX_USE_DEPRECATED 1 2025-05-07T19:48:30.1308348Z #define _GLIBCXX_USE_DEV_RANDOM 1 2025-05-07T19:48:30.1308658Z #define _GLIBCXX_USE_DUAL_ABI 1 2025-05-07T19:48:30.1308996Z #define _GLIBCXX_USE_FCHMOD 1 2025-05-07T19:48:30.1309410Z #define _GLIBCXX_USE_FCHMODAT 1 2025-05-07T19:48:30.1309747Z #define _GLIBCXX_USE_FLOAT128 1 2025-05-07T19:48:30.1310090Z #define _GLIBCXX_USE_GETTIMEOFDAY 1 2025-05-07T19:48:30.1310415Z #define _GLIBCXX_USE_GET_NPROCS 1 2025-05-07T19:48:30.1310758Z #define _GLIBCXX_USE_INT128 1 2025-05-07T19:48:30.1311055Z #define _GLIBCXX_USE_LFS 1 2025-05-07T19:48:30.1311379Z #define _GLIBCXX_USE_LONG_LONG 1 2025-05-07T19:48:30.1311689Z #define _GLIBCXX_USE_LSTAT 1 2025-05-07T19:48:30.1312020Z #define _GLIBCXX_USE_NANOSLEEP 1 2025-05-07T19:48:30.1312332Z #define _GLIBCXX_USE_NOEXCEPT noexcept 2025-05-07T19:48:30.1312758Z #define _GLIBCXX_USE_PTHREAD_RWLOCK_T 1 2025-05-07T19:48:30.1313090Z #define _GLIBCXX_USE_RANDOM_TR1 1 2025-05-07T19:48:30.1313427Z #define _GLIBCXX_USE_REALPATH 1 2025-05-07T19:48:30.1313757Z #define _GLIBCXX_USE_SCHED_YIELD 1 2025-05-07T19:48:30.1314088Z #define _GLIBCXX_USE_SC_NPROCESSORS_ONLN 1 2025-05-07T19:48:30.1314450Z #define _GLIBCXX_USE_SENDFILE 1 2025-05-07T19:48:30.1314749Z #define _GLIBCXX_USE_STD_SPEC_FUNCS 1 2025-05-07T19:48:30.1315096Z #define _GLIBCXX_USE_ST_MTIM 1 2025-05-07T19:48:30.1315468Z #define _GLIBCXX_USE_TBB_PAR_BACKEND __has_include() 2025-05-07T19:48:30.1315907Z #define _GLIBCXX_USE_TMPNAM 1 2025-05-07T19:48:30.1316202Z #define _GLIBCXX_USE_UTIME 1 2025-05-07T19:48:30.1316522Z #define _GLIBCXX_USE_UTIMENSAT 1 2025-05-07T19:48:30.1316829Z #define _GLIBCXX_USE_WCHAR_T 1 2025-05-07T19:48:30.1317168Z #define _GLIBCXX_USE_WEAK_REF __GXX_WEAK__ 2025-05-07T19:48:30.1317528Z #define _GLIBCXX_UTILITY 1 2025-05-07T19:48:30.1317902Z #define _GLIBCXX_VERBOSE 1 2025-05-07T19:48:30.1318321Z #define _GLIBCXX_VISIBILITY(V) __attribute__ ((__visibility__ (#V))) 2025-05-07T19:48:30.1318761Z #define _GLIBCXX_WEAK_DEFINITION 2025-05-07T19:48:30.1319100Z #define _GLIBCXX_X86_RDRAND 1 2025-05-07T19:48:30.1319391Z #define _GLIBCXX_X86_RDSEED 1 2025-05-07T19:48:30.1319698Z #define _GNU_SOURCE 1 2025-05-07T19:48:30.1319977Z #define _GTHREAD_USE_MUTEX_TIMEDLOCK 1 2025-05-07T19:48:30.1320321Z #define _G_BUFSIZ 8192 2025-05-07T19:48:30.1320584Z #define _G_HAVE_MMAP 1 2025-05-07T19:48:30.1320869Z #define _G_HAVE_MREMAP 1 2025-05-07T19:48:30.1321229Z #define _G_HAVE_ST_BLKSIZE defined (_STATBUF_ST_BLKSIZE) 2025-05-07T19:48:30.1321619Z #define _G_IO_IO_FILE_VERSION 0x20001 2025-05-07T19:48:30.1321954Z #define _G_config_h 1 2025-05-07T19:48:30.1322219Z #define _G_va_list __gnuc_va_list 2025-05-07T19:48:30.1322545Z #define _INITIALIZER_LIST 2025-05-07T19:48:30.1322814Z #define _IOFBF 0 2025-05-07T19:48:30.1323079Z #define _IOLBF 1 2025-05-07T19:48:30.1323316Z #define _IONBF 2 2025-05-07T19:48:30.1323579Z #define _IOS_APPEND 8 2025-05-07T19:48:30.1323829Z #define _IOS_ATEND 4 2025-05-07T19:48:30.1324104Z #define _IOS_BIN 128 2025-05-07T19:48:30.1324379Z #define _IOS_INPUT 1 2025-05-07T19:48:30.1324628Z #define _IOS_NOCREATE 32 2025-05-07T19:48:30.1325027Z #define _IOS_NOREPLACE 64 2025-05-07T19:48:30.1325284Z #define _IOS_OUTPUT 2 2025-05-07T19:48:30.1325545Z #define _IOS_TRUNC 16 2025-05-07T19:48:30.1325789Z #define _IO_BAD_SEEN 0x4000 2025-05-07T19:48:30.1326130Z #define _IO_BE(expr,res) __builtin_expect ((expr), res) 2025-05-07T19:48:30.1326481Z #define _IO_BOOLALPHA 0200000 2025-05-07T19:48:30.1326782Z #define _IO_BUFSIZ _G_BUFSIZ 2025-05-07T19:48:30.1327055Z #define _IO_CURRENTLY_PUTTING 0x800 2025-05-07T19:48:30.1327362Z #define _IO_DEC 020 2025-05-07T19:48:30.1327600Z #define _IO_DELETE_DONT_CLOSE 0x40 2025-05-07T19:48:30.1327912Z #define _IO_DONT_CLOSE 0100000 2025-05-07T19:48:30.1328203Z #define _IO_EOF_SEEN 0x10 2025-05-07T19:48:30.1328456Z #define _IO_ERR_SEEN 0x20 2025-05-07T19:48:30.1328732Z #define _IO_FIXED 010000 2025-05-07T19:48:30.1328980Z #define _IO_FLAGS2_MMAP 1 2025-05-07T19:48:30.1329261Z #define _IO_FLAGS2_NOTCANCEL 2 2025-05-07T19:48:30.1329534Z #define _IO_FLAGS2_USER_WBUF 8 2025-05-07T19:48:30.1329857Z #define _IO_HAVE_ST_BLKSIZE _G_HAVE_ST_BLKSIZE 2025-05-07T19:48:30.1330168Z #define _IO_HEX 0100 2025-05-07T19:48:30.1330434Z #define _IO_INTERNAL 010 2025-05-07T19:48:30.1330757Z #define _IO_IN_BACKUP 0x100 2025-05-07T19:48:30.1331044Z #define _IO_IS_APPENDING 0x1000 2025-05-07T19:48:30.1331351Z #define _IO_IS_FILEBUF 0x2000 2025-05-07T19:48:30.1331613Z #define _IO_LEFT 02 2025-05-07T19:48:30.1331887Z #define _IO_LINE_BUF 0x200 2025-05-07T19:48:30.1332150Z #define _IO_LINKED 0x80 2025-05-07T19:48:30.1332440Z #define _IO_MAGIC 0xFBAD0000 2025-05-07T19:48:30.1332720Z #define _IO_MAGIC_MASK 0xFFFF0000 2025-05-07T19:48:30.1333024Z #define _IO_NO_READS 4 2025-05-07T19:48:30.1333270Z #define _IO_NO_WRITES 8 2025-05-07T19:48:30.1333544Z #define _IO_OCT 040 2025-05-07T19:48:30.1333921Z #define _IO_PENDING_OUTPUT_COUNT(_fp) ((_fp)->_IO_write_ptr - (_fp)->_IO_write_base) 2025-05-07T19:48:30.1334386Z #define _IO_RIGHT 04 2025-05-07T19:48:30.1334658Z #define _IO_SCIENTIFIC 04000 2025-05-07T19:48:30.1334923Z #define _IO_SHOWBASE 0200 2025-05-07T19:48:30.1335232Z #define _IO_SHOWPOINT 0400 2025-05-07T19:48:30.1335499Z #define _IO_SHOWPOS 02000 2025-05-07T19:48:30.1335781Z #define _IO_SKIPWS 01 2025-05-07T19:48:30.1336031Z #define _IO_STDIO 040000 2025-05-07T19:48:30.1336317Z #define _IO_STDIO_H 2025-05-07T19:48:30.1336567Z #define _IO_TIED_PUT_GET 0x400 2025-05-07T19:48:30.1336864Z #define _IO_UNBUFFERED 2 2025-05-07T19:48:30.1337133Z #define _IO_UNIFIED_JUMPTABLES 1 2025-05-07T19:48:30.1337455Z #define _IO_UNITBUF 020000 2025-05-07T19:48:30.1337762Z #define _IO_UPPERCASE 01000 2025-05-07T19:48:30.1338032Z #define _IO_USER_BUF 1 2025-05-07T19:48:30.1338319Z #define _IO_USER_LOCK 0x8000 2025-05-07T19:48:30.1338669Z #define _IO_cleanup_region_end(_Doit) 2025-05-07T19:48:30.1339028Z #define _IO_cleanup_region_start(_fct,_fp) 2025-05-07T19:48:30.1339441Z #define _IO_feof_unlocked(__fp) (((__fp)->_flags & _IO_EOF_SEEN) != 0) 2025-05-07T19:48:30.1339965Z #define _IO_ferror_unlocked(__fp) (((__fp)->_flags & _IO_ERR_SEEN) != 0) 2025-05-07T19:48:30.1340376Z #define _IO_file_flags _flags 2025-05-07T19:48:30.1340686Z #define _IO_flockfile(_fp) 2025-05-07T19:48:30.1340969Z #define _IO_fpos64_t _G_fpos64_t 2025-05-07T19:48:30.1341292Z #define _IO_fpos_t _G_fpos_t 2025-05-07T19:48:30.1341607Z #define _IO_ftrylockfile(_fp) 2025-05-07T19:48:30.1341900Z #define _IO_funlockfile(_fp) 2025-05-07T19:48:30.1342477Z #define _IO_getc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) ? __uflow (_fp) : *(unsigned char *) (_fp)->_IO_read_ptr++) 2025-05-07T19:48:30.1343041Z #define _IO_iconv_t _G_iconv_t 2025-05-07T19:48:30.1343343Z #define _IO_off64_t __off64_t 2025-05-07T19:48:30.1343606Z #define _IO_off_t __off_t 2025-05-07T19:48:30.1343933Z #define _IO_peekc(_fp) _IO_peekc_unlocked (_fp) 2025-05-07T19:48:30.1344557Z #define _IO_peekc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) && __underflow (_fp) == EOF ? EOF : *(unsigned char *) (_fp)->_IO_read_ptr) 2025-05-07T19:48:30.1345184Z #define _IO_pid_t __pid_t 2025-05-07T19:48:30.1345841Z #define _IO_putc_unlocked(_ch,_fp) (_IO_BE ((_fp)->_IO_write_ptr >= (_fp)->_IO_write_end, 0) ? __overflow (_fp, (unsigned char) (_ch)) : (unsigned char) (*(_fp)->_IO_write_ptr++ = (_ch))) 2025-05-07T19:48:30.1346507Z #define _IO_size_t size_t 2025-05-07T19:48:30.1346788Z #define _IO_ssize_t __ssize_t 2025-05-07T19:48:30.1347095Z #define _IO_stderr ((_IO_FILE*)(&_IO_2_1_stderr_)) 2025-05-07T19:48:30.1347478Z #define _IO_stdin ((_IO_FILE*)(&_IO_2_1_stdin_)) 2025-05-07T19:48:30.1347859Z #define _IO_stdout ((_IO_FILE*)(&_IO_2_1_stdout_)) 2025-05-07T19:48:30.1348182Z #define _IO_uid_t __uid_t 2025-05-07T19:48:30.1348477Z #define _IO_va_list __gnuc_va_list 2025-05-07T19:48:30.1348764Z #define _IO_wint_t wint_t 2025-05-07T19:48:30.1349045Z #define _ISOC11_SOURCE 1 2025-05-07T19:48:30.1349451Z #define _ISOC95_SOURCE 1 2025-05-07T19:48:30.1349725Z #define _ISOC99_SOURCE 1 2025-05-07T19:48:30.1349986Z #define _LARGEFILE64_SOURCE 1 2025-05-07T19:48:30.1350287Z #define _LARGEFILE_SOURCE 1 2025-05-07T19:48:30.1350553Z #define _LIBC_LIMITS_H_ 1 2025-05-07T19:48:30.1350829Z #define _LINUX_LIMITS_H 2025-05-07T19:48:30.1351069Z #define _LP64 1 2025-05-07T19:48:30.1351403Z #define _MATH_H 1 2025-05-07T19:48:30.1351667Z #define _MATH_H_MATHDEF 1 2025-05-07T19:48:30.1351923Z #define _MOVE_H 1 2025-05-07T19:48:30.1352188Z #define _Mfloat_ float 2025-05-07T19:48:30.1352454Z #define _Mlong_double_ long double 2025-05-07T19:48:30.1352837Z #define _NEW 2025-05-07T19:48:30.1353244Z #define _OLD_STDIO_MAGIC 0xFABC0000 2025-05-07T19:48:30.1353594Z #define _POSIX2_BC_BASE_MAX 99 2025-05-07T19:48:30.1353897Z #define _POSIX2_BC_DIM_MAX 2048 2025-05-07T19:48:30.1354232Z #define _POSIX2_BC_SCALE_MAX 99 2025-05-07T19:48:30.1354538Z #define _POSIX2_BC_STRING_MAX 1000 2025-05-07T19:48:30.1354882Z #define _POSIX2_CHARCLASS_NAME_MAX 14 2025-05-07T19:48:30.1355235Z #define _POSIX2_COLL_WEIGHTS_MAX 2 2025-05-07T19:48:30.1355543Z #define _POSIX2_EXPR_NEST_MAX 32 2025-05-07T19:48:30.1355864Z #define _POSIX2_LINE_MAX 2048 2025-05-07T19:48:30.1356157Z #define _POSIX2_RE_DUP_MAX 255 2025-05-07T19:48:30.1356474Z #define _POSIX_AIO_LISTIO_MAX 2 2025-05-07T19:48:30.1356764Z #define _POSIX_AIO_MAX 1 2025-05-07T19:48:30.1357065Z #define _POSIX_ARG_MAX 4096 2025-05-07T19:48:30.1357347Z #define _POSIX_CHILD_MAX 25 2025-05-07T19:48:30.1357667Z #define _POSIX_CLOCKRES_MIN 20000000 2025-05-07T19:48:30.1357980Z #define _POSIX_C_SOURCE 200809L 2025-05-07T19:48:30.1358301Z #define _POSIX_DELAYTIMER_MAX 32 2025-05-07T19:48:30.1358648Z #define _POSIX_FD_SETSIZE _POSIX_OPEN_MAX 2025-05-07T19:48:30.1358991Z #define _POSIX_HIWAT _POSIX_PIPE_BUF 2025-05-07T19:48:30.1359332Z #define _POSIX_HOST_NAME_MAX 255 2025-05-07T19:48:30.1359630Z #define _POSIX_LINK_MAX 8 2025-05-07T19:48:30.1359998Z #define _POSIX_LOGIN_NAME_MAX 9 2025-05-07T19:48:30.1360290Z #define _POSIX_MAX_CANON 255 2025-05-07T19:48:30.1360606Z #define _POSIX_MAX_INPUT 255 2025-05-07T19:48:30.1360889Z #define _POSIX_MQ_OPEN_MAX 8 2025-05-07T19:48:30.1361203Z #define _POSIX_MQ_PRIO_MAX 32 2025-05-07T19:48:30.1361488Z #define _POSIX_NAME_MAX 14 2025-05-07T19:48:30.1361797Z #define _POSIX_NGROUPS_MAX 8 2025-05-07T19:48:30.1362111Z #define _POSIX_OPEN_MAX 20 2025-05-07T19:48:30.1362388Z #define _POSIX_PATH_MAX 256 2025-05-07T19:48:30.1362697Z #define _POSIX_PIPE_BUF 512 2025-05-07T19:48:30.1362974Z #define _POSIX_QLIMIT 1 2025-05-07T19:48:30.1363271Z #define _POSIX_RE_DUP_MAX 255 2025-05-07T19:48:30.1363558Z #define _POSIX_RTSIG_MAX 8 2025-05-07T19:48:30.1363867Z #define _POSIX_SEM_NSEMS_MAX 256 2025-05-07T19:48:30.1364172Z #define _POSIX_SEM_VALUE_MAX 32767 2025-05-07T19:48:30.1364505Z #define _POSIX_SIGQUEUE_MAX 32 2025-05-07T19:48:30.1364794Z #define _POSIX_SOURCE 1 2025-05-07T19:48:30.1365098Z #define _POSIX_SSIZE_MAX 32767 2025-05-07T19:48:30.1365487Z #define _POSIX_STREAM_MAX 8 2025-05-07T19:48:30.1365784Z #define _POSIX_SYMLINK_MAX 255 2025-05-07T19:48:30.1366085Z #define _POSIX_SYMLOOP_MAX 8 2025-05-07T19:48:30.1366384Z #define _POSIX_THREAD_DESTRUCTOR_ITERATIONS 4 2025-05-07T19:48:30.1366740Z #define _POSIX_THREAD_KEYS_MAX 128 2025-05-07T19:48:30.1367036Z #define _POSIX_THREAD_THREADS_MAX 64 2025-05-07T19:48:30.1367357Z #define _POSIX_TIMER_MAX 32 2025-05-07T19:48:30.1367631Z #define _POSIX_TTY_NAME_MAX 9 2025-05-07T19:48:30.1367931Z #define _POSIX_TZNAME_MAX 6 2025-05-07T19:48:30.1368199Z #define _POSIX_UIO_MAXIOV 16 2025-05-07T19:48:30.1368568Z #define _PSTL_ASSERT(_Condition) __glibcxx_assert(_Condition) 2025-05-07T19:48:30.1369089Z #define _PSTL_ASSERT_MSG(_Condition,_Message) __glibcxx_assert(_Condition) 2025-05-07T19:48:30.1369690Z #define _PSTL_CLANG_VERSION (__clang_major__ * 10000 + __clang_minor__ * 100 + __clang_patchlevel__) 2025-05-07T19:48:30.1370205Z #define _PSTL_CONFIG_H 2025-05-07T19:48:30.1370672Z #define _PSTL_CPP11_STD_ROTATE_BROKEN ((__GLIBCXX__ && __GLIBCXX__ < 20150716) || (_MSC_VER && _MSC_VER < 1800)) 2025-05-07T19:48:30.1371546Z #define _PSTL_CPP14_2RANGE_MISMATCH_EQUAL_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201300L || __cpp_lib_robust_nonmodifying_seq_ops == 201304) 2025-05-07T19:48:30.1372335Z #define _PSTL_CPP14_INTEGER_SEQUENCE_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L) 2025-05-07T19:48:30.1373145Z #define _PSTL_CPP14_MAKE_REVERSE_ITERATOR_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L || __cpp_lib_make_reverse_iterator == 201402) 2025-05-07T19:48:30.1374192Z #define _PSTL_CPP14_VARIABLE_TEMPLATES_PRESENT (!__INTEL_COMPILER || __INTEL_COMPILER >= 1700) && (_MSC_FULL_VER >= 190023918 || __cplusplus >= 201402L) 2025-05-07T19:48:30.1374930Z #define _PSTL_CPP17_EXECUTION_POLICIES_PRESENT (_MSC_VER >= 1912) 2025-05-07T19:48:30.1375413Z #define _PSTL_EARLYEXIT_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:48:30.1375926Z #define _PSTL_GCC_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:48:30.1376418Z #define _PSTL_HIDE_FROM_ABI_POP 2025-05-07T19:48:30.1376746Z #define _PSTL_HIDE_FROM_ABI_PUSH 2025-05-07T19:48:30.1377114Z #define _PSTL_ICC_18_OMP_SIMD_BROKEN (__INTEL_COMPILER == 1800) 2025-05-07T19:48:30.1377583Z #define _PSTL_MONOTONIC_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:48:30.1377957Z #define _PSTL_PAR_BACKEND_SERIAL 2025-05-07T19:48:30.1378293Z #define _PSTL_PRAGMA(x) _Pragma(# x) 2025-05-07T19:48:30.1378948Z #define _PSTL_PRAGMA_DECLARE_REDUCTION(NAME,OP) _PSTL_PRAGMA(omp declare reduction(NAME:OP : omp_out(omp_in)) initializer(omp_priv = omp_orig)) 2025-05-07T19:48:30.1379724Z #define _PSTL_PRAGMA_DECLARE_SIMD _PSTL_PRAGMA(omp declare simd) 2025-05-07T19:48:30.1380159Z #define _PSTL_PRAGMA_FORCEINLINE 2025-05-07T19:48:30.1380517Z #define _PSTL_PRAGMA_LOCATION " [Parallel STL message]: " 2025-05-07T19:48:30.1380918Z #define _PSTL_PRAGMA_MESSAGE(x) 2025-05-07T19:48:30.1381484Z #define _PSTL_PRAGMA_MESSAGE_IMPL(x) _PSTL_PRAGMA(message(_PSTL_STRING_CONCAT(_PSTL_PRAGMA_LOCATION, x))) 2025-05-07T19:48:30.1382066Z #define _PSTL_PRAGMA_MESSAGE_POLICIES(x) 2025-05-07T19:48:30.1382419Z #define _PSTL_PRAGMA_SIMD _PSTL_PRAGMA(omp simd) 2025-05-07T19:48:30.1382814Z #define _PSTL_PRAGMA_SIMD_EARLYEXIT 2025-05-07T19:48:30.1383138Z #define _PSTL_PRAGMA_SIMD_EXCLUSIVE_SCAN(PRM) 2025-05-07T19:48:30.1383518Z #define _PSTL_PRAGMA_SIMD_INCLUSIVE_SCAN(PRM) 2025-05-07T19:48:30.1383923Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC(PRM) 2025-05-07T19:48:30.1384340Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC_2ARGS(PRM1,PRM2) 2025-05-07T19:48:30.1384869Z #define _PSTL_PRAGMA_SIMD_REDUCTION(PRM) _PSTL_PRAGMA(omp simd reduction(PRM)) 2025-05-07T19:48:30.1385315Z #define _PSTL_PRAGMA_SIMD_SCAN(PRM) 2025-05-07T19:48:30.1385661Z #define _PSTL_PRAGMA_VECTOR_UNALIGNED 2025-05-07T19:48:30.1385978Z #define _PSTL_STRING(x) _PSTL_STRING_AUX(x) 2025-05-07T19:48:30.1386326Z #define _PSTL_STRING_AUX(x) #x 2025-05-07T19:48:30.1386618Z #define _PSTL_STRING_CONCAT(x,y) x #y 2025-05-07T19:48:30.1386942Z #define _PSTL_UDR_PRESENT 0 2025-05-07T19:48:30.1387416Z #define _PSTL_UDS_PRESENT (__INTEL_COMPILER >= 1900 && __INTEL_COMPILER_BUILD_DATE >= 20180626) 2025-05-07T19:48:30.1387899Z #define _PSTL_USAGE_WARNINGS 0 2025-05-07T19:48:30.1388244Z #define _PSTL_USE_NONTEMPORAL_STORES_IF_ALLOWED 2025-05-07T19:48:30.1388578Z #define _PSTL_VERSION 12000 2025-05-07T19:48:30.1388917Z #define _PSTL_VERSION_MAJOR (_PSTL_VERSION / 1000) 2025-05-07T19:48:30.1389316Z #define _PSTL_VERSION_MINOR ((_PSTL_VERSION % 1000) / 10) 2025-05-07T19:48:30.1389735Z #define _PSTL_VERSION_PATCH (_PSTL_VERSION % 10) 2025-05-07T19:48:30.1390087Z #define _PTRDIFF_T 2025-05-07T19:48:30.1390323Z #define _PTR_TRAITS_H 1 2025-05-07T19:48:30.1390605Z #define _SIGSET_H_types 1 2025-05-07T19:48:30.1390940Z #define _SIGSET_NWORDS (1024 / (8 * sizeof (unsigned long int))) 2025-05-07T19:48:30.1391335Z #define _SIZE_T 2025-05-07T19:48:30.1391567Z #define _STDC_PREDEF_H 1 2025-05-07T19:48:30.1391846Z #define _STDIO_H 1 2025-05-07T19:48:30.1392084Z #define _STDIO_USES_IOSTREAM 2025-05-07T19:48:30.1392378Z #define _STDLIB_H 1 2025-05-07T19:48:30.1392687Z #define _STL_ALGOBASE_H 1 2025-05-07T19:48:30.1392999Z #define _STL_ITERATOR_BASE_FUNCS_H 1 2025-05-07T19:48:30.1393492Z #define _STL_ITERATOR_BASE_TYPES_H 1 2025-05-07T19:48:30.1393842Z #define _STL_ITERATOR_H 1 2025-05-07T19:48:30.1394173Z #define _STL_PAIR_H 1 2025-05-07T19:48:30.1394436Z #define _STL_RELOPS_H 1 2025-05-07T19:48:30.1394796Z #define _STRING_H 1 2025-05-07T19:48:30.1395052Z #define _STRUCT_TIMEVAL 1 2025-05-07T19:48:30.1395354Z #define _SVID_SOURCE 1 2025-05-07T19:48:30.1395610Z #define _SYS_CDEFS_H 1 2025-05-07T19:48:30.1395905Z #define _SYS_SELECT_H 1 2025-05-07T19:48:30.1396173Z #define _SYS_SYSMACROS_H 1 2025-05-07T19:48:30.1396482Z #define _SYS_TYPES_H 1 2025-05-07T19:48:30.1396739Z #define _TIME_H 1 2025-05-07T19:48:30.1397010Z #define _VA_LIST_DEFINED 2025-05-07T19:48:30.1397277Z #define _XLOCALE_H 1 2025-05-07T19:48:30.1397584Z #define _XOPEN_IOV_MAX _POSIX_UIO_MAXIOV 2025-05-07T19:48:30.1397931Z #define _XOPEN_LIM_H 1 2025-05-07T19:48:30.1398194Z #define _XOPEN_SOURCE 700 2025-05-07T19:48:30.1398500Z #define _XOPEN_SOURCE_EXTENDED 1 2025-05-07T19:48:30.1398893Z #define __ASMNAME(cname) __ASMNAME2 (__USER_LABEL_PREFIX__, cname) 2025-05-07T19:48:30.1399399Z #define __ASMNAME2(prefix,cname) __STRING (prefix) cname 2025-05-07T19:48:30.1399813Z #define __ASSERT_FUNCTION __PRETTY_FUNCTION__ 2025-05-07T19:48:30.1400211Z #define __ASSERT_VOID_CAST static_cast 2025-05-07T19:48:30.1400543Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:48:30.1400843Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:48:30.1401151Z #define __ATOMIC_CONSUME 1 2025-05-07T19:48:30.1401423Z #define __ATOMIC_RELAXED 0 2025-05-07T19:48:30.1401723Z #define __ATOMIC_RELEASE 3 2025-05-07T19:48:30.1402157Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:48:30.1402473Z #define __BEGIN_DECLS extern "C" { 2025-05-07T19:48:30.1402789Z #define __BEGIN_NAMESPACE_C99 2025-05-07T19:48:30.1403236Z #define __BEGIN_NAMESPACE_STD 2025-05-07T19:48:30.1403541Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:48:30.1403872Z #define __BIG_ENDIAN 4321 2025-05-07T19:48:30.1404159Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:48:30.1404507Z #define __BIT_TYPES_DEFINED__ 1 2025-05-07T19:48:30.1404848Z #define __BLKCNT64_T_TYPE __SQUAD_TYPE 2025-05-07T19:48:30.1405199Z #define __BLKCNT_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:30.1405594Z #define __BLKSIZE_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:30.1405937Z #define __BOOL_WIDTH__ 8 2025-05-07T19:48:30.1406255Z #define __BYTE_ORDER __LITTLE_ENDIAN 2025-05-07T19:48:30.1406594Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:48:30.1406976Z #define __CHANNEL_DESCRIPTOR_H__ 2025-05-07T19:48:30.1407295Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:48:30.1407659Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:48:30.1407969Z #define __CHAR_BIT__ 8 2025-05-07T19:48:30.1408278Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:48:30.1408661Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:48:30.1409020Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:48:30.1409385Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:48:30.1409723Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:48:30.1410090Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:48:30.1410429Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:48:30.1410808Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:48:30.1411172Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:48:30.1411549Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:48:30.1411915Z #define __CLANG_LIMITS_H 2025-05-07T19:48:30.1412203Z #define __CLANG_MAX_ALIGN_T_DEFINED 2025-05-07T19:48:30.1412563Z #define __CLOCKID_T_TYPE __S32_TYPE 2025-05-07T19:48:30.1412901Z #define __CLOCK_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:30.1413278Z #define __COMMON_FUNCTIONS_H__ 2025-05-07T19:48:30.1413579Z #define __COMPAR_FN_T 2025-05-07T19:48:30.1413890Z #define __CONCAT(x,y) x ## y 2025-05-07T19:48:30.1414295Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:48:30.1414631Z #define __CUDACC_VER_BUILD__ 89 2025-05-07T19:48:30.1414927Z #define __CUDACC_VER_MAJOR__ 11 2025-05-07T19:48:30.1415355Z #define __CUDACC_VER_MINOR__ 8 2025-05-07T19:48:30.1415993Z #define __CUDACC_VER__ "__CUDACC_VER__ is no longer supported. Use __CUDACC_VER_MAJOR__, __CUDACC_VER_MINOR__, and __CUDACC_VER_BUILD__ instead." 2025-05-07T19:48:30.1416711Z #define __CUDACC__ 1 2025-05-07T19:48:30.1417009Z #define __CUDART_API_PTDS(api) api 2025-05-07T19:48:30.1417307Z #define __CUDART_API_PTSZ(api) api 2025-05-07T19:48:30.1417792Z #define __CUDART_API_VERSION ((__CUDA_API_VER_MAJOR__ * 1000) + (__CUDA_API_VER_MINOR__ * 10)) 2025-05-07T19:48:30.1418250Z #define __CUDA_API_VER_MAJOR__ 11 2025-05-07T19:48:30.1418571Z #define __CUDA_API_VER_MINOR__ 8 2025-05-07T19:48:30.1418849Z #define __CUDA_ARCH_LIST__ 520 2025-05-07T19:48:30.1419144Z #define __CUDA_ARCH__ 520 2025-05-07T19:48:30.1419418Z #define __CUDA_DEVICE_RUNTIME_API_H__ 2025-05-07T19:48:30.1419706Z #define __CUDA_MATH_CRTIMP 2025-05-07T19:48:30.1419989Z #define __CUDA_RUNTIME_API_H__ 2025-05-07T19:48:30.1420256Z #define __CUDA_RUNTIME_H__ 2025-05-07T19:48:30.1420549Z #define __CUDA_SURFACE_TYPES_H__ 2025-05-07T19:48:30.1420840Z #define __CUDA_TEXTURE_TYPES_H__ 2025-05-07T19:48:30.1421146Z #define __DADDR_T_TYPE __S32_TYPE 2025-05-07T19:48:30.1421433Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:48:30.1421747Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:48:30.1422071Z #define __DBL_DIG__ 15 2025-05-07T19:48:30.1422334Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:48:30.1422658Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:48:30.1422912Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:48:30.1423206Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:48:30.1423481Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:48:30.1423772Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:48:30.1424104Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:48:30.1424395Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:48:30.1424697Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:48:30.1425000Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:48:30.1425314Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:48:30.1425642Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:48:30.1425989Z #define __DELETE_THROW throw() 2025-05-07T19:48:30.1426263Z #define __DEPRECATED 1 2025-05-07T19:48:30.1426550Z #define __DEVICE_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:48:30.1426866Z #define __DEVICE_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:30.1427192Z #define __DEVICE_DOUBLE_FUNCTIONS_HPP__ 2025-05-07T19:48:30.1427505Z #define __DEVICE_DOUBLE_FUNCTIONS_H__ 2025-05-07T19:48:30.1427823Z #define __DEVICE_FUNCTIONS_HPP__ 2025-05-07T19:48:30.1428111Z #define __DEVICE_FUNCTIONS_H__ 2025-05-07T19:48:30.1428412Z #define __DEVICE_LAUNCH_PARAMETERS_H__ 2025-05-07T19:48:30.1428721Z #define __DEVICE_TYPES_H__ 2025-05-07T19:48:30.1428992Z #define __DEV_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:30.1429280Z #define __DRIVER_FUNCTIONS_H__ 2025-05-07T19:48:30.1429546Z #define __DRIVER_TYPES_H__ 2025-05-07T19:48:30.1429801Z #define __ELF__ 1 2025-05-07T19:48:30.1430020Z #define __END_DECLS } 2025-05-07T19:48:30.1430273Z #define __END_NAMESPACE_C99 2025-05-07T19:48:30.1430537Z #define __END_NAMESPACE_STD 2025-05-07T19:48:30.1430816Z #define __EXCEPTIONS 1 2025-05-07T19:48:30.1431051Z #define __EXCEPTION_H 1 2025-05-07T19:48:30.1431331Z #define __FDS_BITS(set) ((set)->fds_bits) 2025-05-07T19:48:30.1431759Z #define __FD_CLR(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] &= ~__FD_MASK (d))) 2025-05-07T19:48:30.1432169Z #define __FD_ELT(d) ((d) / __NFDBITS) 2025-05-07T19:48:30.1432573Z #define __FD_ISSET(d,set) ((__FDS_BITS (set)[__FD_ELT (d)] & __FD_MASK (d)) != 0) 2025-05-07T19:48:30.1433245Z #define __FD_MASK(d) ((__fd_mask) 1 << ((d) % __NFDBITS)) 2025-05-07T19:48:30.1433725Z #define __FD_SET(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] |= __FD_MASK (d))) 2025-05-07T19:48:30.1434152Z #define __FD_SETSIZE 1024 2025-05-07T19:48:30.1434890Z #define __FD_ZERO(fdsp) do { int __d0, __d1; __asm__ __volatile__ ("cld; rep; " __FD_ZERO_STOS : "=c" (__d0), "=D" (__d1) : "a" (0), "0" (sizeof (fd_set) / sizeof (__fd_mask)), "1" (&__FDS_BITS (fdsp)[0]) : "memory"); } while (0) 2025-05-07T19:48:30.1435664Z #define __FD_ZERO_STOS "stosq" 2025-05-07T19:48:30.1435949Z #define __FILE_defined 1 2025-05-07T19:48:30.1436240Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:48:30.1436588Z #define __FLOAT128__ 1 2025-05-07T19:48:30.1436883Z #define __FLOAT_WORD_ORDER __BYTE_ORDER 2025-05-07T19:48:30.1437196Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:48:30.1437529Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:48:30.1437862Z #define __FLT16_DIG__ 3 2025-05-07T19:48:30.1438153Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:48:30.1438489Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:48:30.1438772Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:48:30.1439089Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:48:30.1439378Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:48:30.1439777Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:48:30.1440107Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:48:30.1440471Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:48:30.1441090Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:48:30.1441436Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:48:30.1441828Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:48:30.1442304Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:48:30.1442649Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:48:30.1443042Z #define __FLT_DIG__ 6 2025-05-07T19:48:30.1443400Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:48:30.1443807Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:48:30.1444182Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:48:30.1444589Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:48:30.1444918Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:48:30.1445405Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:48:30.1445839Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:48:30.1446148Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:48:30.1458572Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:48:30.1458969Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:48:30.1459238Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:48:30.1459511Z #define __FLT_RADIX__ 2 2025-05-07T19:48:30.1459755Z #define __FSBLKCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:30.1460104Z #define __FSBLKCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:30.1460460Z #define __FSFILCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:30.1460807Z #define __FSFILCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:30.1461144Z #define __FSID_T_TYPE struct { int __val[2]; } 2025-05-07T19:48:30.1461469Z #define __FSWORD_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:30.1461774Z #define __FXSR__ 1 2025-05-07T19:48:30.1461991Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:48:30.1462312Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:48:30.1462632Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:48:30.1462967Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:48:30.1463279Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:48:30.1463588Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:48:30.1463866Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:48:30.1464154Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:48:30.1464445Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:48:30.1464737Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:48:30.1465082Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:48:30.1465398Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:48:30.1465737Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:48:30.1466044Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:48:30.1466392Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:48:30.1466716Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:48:30.1467038Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:48:30.1467346Z #define __GID_T_TYPE __U32_TYPE 2025-05-07T19:48:30.1467601Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:48:30.1467913Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:48:30.1468194Z #define __GLIBCXX__ 20230528 2025-05-07T19:48:30.1468447Z #define __GLIBC_HAVE_LONG_LONG 1 2025-05-07T19:48:30.1468697Z #define __GLIBC_MINOR__ 17 2025-05-07T19:48:30.1469106Z #define __GLIBC_PREREQ(maj,min) ((__GLIBC__ << 16) + __GLIBC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:48:30.1469674Z #define __GLIBC__ 2 2025-05-07T19:48:30.1469923Z #define __GNUC_GNU_INLINE__ 1 2025-05-07T19:48:30.1470180Z #define __GNUC_MINOR__ 2 2025-05-07T19:48:30.1470463Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:48:30.1470888Z #define __GNUC_PREREQ(maj,min) ((__GNUC__ << 16) + __GNUC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:48:30.1471316Z #define __GNUC_VA_LIST 2025-05-07T19:48:30.1471573Z #define __GNUC__ 4 2025-05-07T19:48:30.1471784Z #define __GNUG__ 4 2025-05-07T19:48:30.1472035Z #define __GNU_LIBRARY__ 6 2025-05-07T19:48:30.1472289Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:48:30.1472556Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:48:30.1472925Z #define __GXX_RTTI 1 2025-05-07T19:48:30.1473329Z #define __GXX_WEAK__ 1 2025-05-07T19:48:30.1473561Z #define __HAVE_COLUMN 2025-05-07T19:48:30.1473810Z #define __HOST_CONFIG_H__ 2025-05-07T19:48:30.1474095Z #define __HOST_DEFINES_H__ 2025-05-07T19:48:30.1474349Z #define __ID_T_TYPE __U32_TYPE 2025-05-07T19:48:30.1474635Z #define __INO64_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:30.1474930Z #define __INO_T_MATCHES_INO64_T 1 2025-05-07T19:48:30.1475235Z #define __INO_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:30.1475553Z #define __INT16_C_SUFFIX__ 2025-05-07T19:48:30.1475852Z #define __INT16_FMTd__ "hd" 2025-05-07T19:48:30.1476113Z #define __INT16_FMTi__ "hi" 2025-05-07T19:48:30.1476369Z #define __INT16_MAX__ 32767 2025-05-07T19:48:30.1476628Z #define __INT16_TYPE__ short 2025-05-07T19:48:30.1476923Z #define __INT32_C_SUFFIX__ 2025-05-07T19:48:30.1477283Z #define __INT32_FMTd__ "d" 2025-05-07T19:48:30.1477533Z #define __INT32_FMTi__ "i" 2025-05-07T19:48:30.1477799Z #define __INT32_MAX__ 2147483647 2025-05-07T19:48:30.1478065Z #define __INT32_TYPE__ int 2025-05-07T19:48:30.1478337Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:48:30.1478599Z #define __INT64_FMTd__ "ld" 2025-05-07T19:48:30.1478900Z #define __INT64_FMTi__ "li" 2025-05-07T19:48:30.1479167Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:48:30.1479492Z #define __INT64_TYPE__ long int 2025-05-07T19:48:30.1479765Z #define __INT8_C_SUFFIX__ 2025-05-07T19:48:30.1480047Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:48:30.1480332Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:48:30.1480575Z #define __INT8_MAX__ 127 2025-05-07T19:48:30.1480849Z #define __INT8_TYPE__ signed char 2025-05-07T19:48:30.1481120Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:48:30.1481421Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:48:30.1481680Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:48:30.1481985Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:48:30.1482309Z #define __INTMAX_TYPE__ long int 2025-05-07T19:48:30.1482609Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:48:30.1482886Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:48:30.1483160Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:48:30.1483453Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:48:30.1483764Z #define __INTPTR_TYPE__ long int 2025-05-07T19:48:30.1484049Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:48:30.1484335Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:48:30.1484646Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:48:30.1484931Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:48:30.1485228Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:48:30.1485427Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:48:30.1485531Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:48:30.1485630Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:48:30.1485724Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:48:30.1485811Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:48:30.1485931Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:48:30.1486015Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:48:30.1486115Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:48:30.1486243Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:48:30.1486369Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:48:30.1486470Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:48:30.1486569Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:48:30.1486698Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:48:30.1486852Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:48:30.1486965Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:48:30.1487070Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:48:30.1487205Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:48:30.1487308Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:48:30.1487414Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:48:30.1487553Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:48:30.1487654Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:48:30.1487756Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:48:30.1487860Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:48:30.1488002Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:48:30.1488103Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:48:30.1488204Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:48:30.1488333Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:48:30.1488428Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:48:30.1488553Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:48:30.1488662Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:48:30.1488796Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:48:30.1488887Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:48:30.1488986Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:48:30.1489120Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:48:30.1489232Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:48:30.1489332Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:48:30.1489424Z #define __INT_MAX__ 2147483647 2025-05-07T19:48:30.1489600Z #define __INT_WIDTH__ 32 2025-05-07T19:48:30.1489696Z #define __KERNEL_STRICT_NAMES 2025-05-07T19:48:30.1489790Z #define __KEY_T_TYPE __S32_TYPE 2025-05-07T19:48:30.1489919Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:48:30.1490069Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:48:30.1490157Z #define __LDBL_DIG__ 18 2025-05-07T19:48:30.1490290Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:48:30.1490414Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:48:30.1490511Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:48:30.1490606Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:48:30.1490737Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:48:30.1490844Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:48:30.1490939Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:48:30.1491058Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:48:30.1491190Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:48:30.1491299Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:48:30.1491421Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:48:30.1491569Z #define __LDBL_REDIR(name,proto) name proto 2025-05-07T19:48:30.1491713Z #define __LDBL_REDIR1(name,proto,alias) name proto 2025-05-07T19:48:30.1491895Z #define __LDBL_REDIR1_NTH(name,proto,alias) name proto __THROW 2025-05-07T19:48:30.1492026Z #define __LDBL_REDIR_DECL(name) 2025-05-07T19:48:30.1492178Z #define __LDBL_REDIR_NTH(name,proto) name proto __THROW 2025-05-07T19:48:30.1492269Z #define __LEAF 2025-05-07T19:48:30.1492361Z #define __LEAF_ATTR 2025-05-07T19:48:30.1492487Z #define __LIBRARY_TYPES_H__ 2025-05-07T19:48:30.1492585Z #define __LITTLE_ENDIAN 1234 2025-05-07T19:48:30.1492689Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:48:30.1492813Z #define __LLONG_WIDTH__ 64 2025-05-07T19:48:30.1492929Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:48:30.1493044Z #define __LONG_LONG_PAIR(HI,LO) LO, HI 2025-05-07T19:48:30.1493153Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:48:30.1493271Z #define __LONG_WIDTH__ 64 2025-05-07T19:48:30.1493364Z #define __LP64__ 1 2025-05-07T19:48:30.1493692Z #define __MATHCALLX(function,suffix,args,attrib) __MATHDECLX (_Mdouble_,function,suffix, args, attrib) 2025-05-07T19:48:30.1494339Z #define __MATHDECLX(type,function,suffix,args,attrib) __MATHDECL_1(type, function,suffix, args) __attribute__ (attrib); __MATHDECL_1(type, __CONCAT(__,function),suffix, args) __attribute__ (attrib) 2025-05-07T19:48:30.1494440Z #define __MATH_DECLARE_LDOUBLE 1 2025-05-07T19:48:30.1494595Z #define __MATH_FUNCTIONS_HPP__ 2025-05-07T19:48:30.1494722Z #define __MATH_FUNCTIONS_H__ 2025-05-07T19:48:30.1494799Z #define __MMX__ 1 2025-05-07T19:48:30.1494904Z #define __MODE_T_TYPE __U32_TYPE 2025-05-07T19:48:30.1495003Z #define __N(msgid) (msgid) 2025-05-07T19:48:30.1495151Z #define __NFDBITS (8 * (int) sizeof (__fd_mask)) 2025-05-07T19:48:30.1495264Z #define __NLINK_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:30.1495355Z #define __NO_CTYPE 1 2025-05-07T19:48:30.1495470Z #define __NO_INLINE__ 1 2025-05-07T19:48:30.1495567Z #define __NO_MATH_INLINES 1 2025-05-07T19:48:30.1495682Z #define __NTH(fct) __LEAF_ATTR fct throw () 2025-05-07T19:48:30.1495793Z #define __NVCC_DIAG_PRAGMA_SUPPORT__ 1 2025-05-07T19:48:30.1495899Z #define __NVCC__ 1 2025-05-07T19:48:30.1496000Z #define __NV_GLIBCXX_VERSION 40800 2025-05-07T19:48:30.1496105Z #define __NV_NO_HOST_COMPILER_CHECK 1 2025-05-07T19:48:30.1496237Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:48:30.1496340Z #define __OFF64_T_TYPE __SQUAD_TYPE 2025-05-07T19:48:30.1496436Z #define __OFF_T_MATCHES_OFF64_T 1 2025-05-07T19:48:30.1496552Z #define __OFF_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:30.1496703Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:48:30.1496806Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:48:30.1496923Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:48:30.1497068Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:48:30.1497180Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:48:30.1497351Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:48:30.1497460Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:48:30.1497596Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:48:30.1497679Z #define __P(args) args 2025-05-07T19:48:30.1497776Z #define __PDP_ENDIAN 3412 2025-05-07T19:48:30.1497890Z #define __PIC__ 2 2025-05-07T19:48:30.1497989Z #define __PID_T_TYPE __S32_TYPE 2025-05-07T19:48:30.1498070Z #define __PIE__ 2 2025-05-07T19:48:30.1498163Z #define __PMT(args) args 2025-05-07T19:48:30.1498291Z #define __POINTER_WIDTH__ 64 2025-05-07T19:48:30.1498397Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:48:30.1498505Z #define __PTHREAD_MUTEX_HAVE_PREV 1 2025-05-07T19:48:30.1498667Z #define __PTHREAD_RWLOCK_INT_FLAGS_SHARED 1 2025-05-07T19:48:30.1498769Z #define __PTHREAD_SPINS 0, 0 2025-05-07T19:48:30.1498869Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:48:30.1498960Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:48:30.1499101Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:48:30.1499202Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:48:30.1499296Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:48:30.1499544Z #define __REDIRECT(name,proto,alias) name proto __asm__ (__ASMNAME (#alias)) 2025-05-07T19:48:30.1499960Z #define __REDIRECT_LDBL(name,proto,alias) __REDIRECT (name, proto, alias) 2025-05-07T19:48:30.1500218Z #define __REDIRECT_NTH(name,proto,alias) name proto __THROW __asm__ (__ASMNAME (#alias)) 2025-05-07T19:48:30.1500526Z #define __REDIRECT_NTHNL(name,proto,alias) name proto __THROWNL __asm__ (__ASMNAME (#alias)) 2025-05-07T19:48:30.1500769Z #define __REDIRECT_NTH_LDBL(name,proto,alias) __REDIRECT_NTH (name, proto, alias) 2025-05-07T19:48:30.1500874Z #define __REGISTER_PREFIX__ 2025-05-07T19:48:30.1501011Z #define __RLIM64_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:30.1501129Z #define __RLIM_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:30.1501226Z #define __S16_TYPE short int 2025-05-07T19:48:30.1501318Z #define __S32_TYPE int 2025-05-07T19:48:30.1501443Z #define __S64_TYPE long int 2025-05-07T19:48:30.1501536Z #define __SCHAR_MAX__ 127 2025-05-07T19:48:30.1501621Z #define __SEG_FS 1 2025-05-07T19:48:30.1501738Z #define __SEG_GS 1 2025-05-07T19:48:30.1501836Z #define __SHRT_MAX__ 32767 2025-05-07T19:48:30.1501927Z #define __SHRT_WIDTH__ 16 2025-05-07T19:48:30.1502174Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:48:30.1502313Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:48:30.1502574Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:48:30.1502680Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:48:30.1502872Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:48:30.1502985Z #define __SIZEOF_INT128__ 16 2025-05-07T19:48:30.1503148Z #define __SIZEOF_INT__ 4 2025-05-07T19:48:30.1503251Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:48:30.1503369Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:48:30.1503468Z #define __SIZEOF_LONG__ 8 2025-05-07T19:48:30.1503573Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:48:30.1503702Z #define __SIZEOF_PTHREAD_ATTR_T 56 2025-05-07T19:48:30.1503821Z #define __SIZEOF_PTHREAD_BARRIERATTR_T 4 2025-05-07T19:48:30.1503938Z #define __SIZEOF_PTHREAD_BARRIER_T 32 2025-05-07T19:48:30.1504076Z #define __SIZEOF_PTHREAD_CONDATTR_T 4 2025-05-07T19:48:30.1504181Z #define __SIZEOF_PTHREAD_COND_T 48 2025-05-07T19:48:30.1504295Z #define __SIZEOF_PTHREAD_MUTEXATTR_T 4 2025-05-07T19:48:30.1504408Z #define __SIZEOF_PTHREAD_MUTEX_T 40 2025-05-07T19:48:30.1504550Z #define __SIZEOF_PTHREAD_RWLOCKATTR_T 8 2025-05-07T19:48:30.1504660Z #define __SIZEOF_PTHREAD_RWLOCK_T 56 2025-05-07T19:48:30.1504766Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:48:30.1504885Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:48:30.1504985Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:48:30.1505093Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:48:30.1505191Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:48:30.1505320Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:48:30.1505417Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:48:30.1505518Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:48:30.1505635Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:48:30.1506562Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:48:30.1506677Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:48:30.1506781Z #define __SIZE_WIDTH__ 64 2025-05-07T19:48:30.1506905Z #define __SLONG32_TYPE int 2025-05-07T19:48:30.1507011Z #define __SLONGWORD_TYPE long int 2025-05-07T19:48:30.1507123Z #define __SM_20_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:48:30.1507244Z #define __SM_20_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:30.1507354Z #define __SM_20_INTRINSICS_HPP__ 2025-05-07T19:48:30.1507459Z #define __SM_20_INTRINSICS_H__ 2025-05-07T19:48:30.1507573Z #define __SM_30_INTRINSICS_HPP__ 2025-05-07T19:48:30.1507670Z #define __SM_30_INTRINSICS_H__ 2025-05-07T19:48:30.1507781Z #define __SM_32_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:48:30.1507887Z #define __SM_32_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:30.1508016Z #define __SM_32_INTRINSICS_HPP__ 2025-05-07T19:48:30.1508111Z #define __SM_32_INTRINSICS_H__ 2025-05-07T19:48:30.1508210Z #define __SM_35_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:30.1508338Z #define __SM_35_INTRINSICS_H__ 2025-05-07T19:48:30.1508458Z #define __SM_60_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:48:30.1508563Z #define __SM_60_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:30.1508659Z #define __SM_61_INTRINSICS_HPP__ 2025-05-07T19:48:30.1508775Z #define __SM_61_INTRINSICS_H__ 2025-05-07T19:48:30.1508865Z #define __SM_70_RT_HPP__ 2025-05-07T19:48:30.1508953Z #define __SM_70_RT_H__ 2025-05-07T19:48:30.1509070Z #define __SM_80_RT_HPP__ 2025-05-07T19:48:30.1509171Z #define __SM_80_RT_H__ 2025-05-07T19:48:30.1509262Z #define __SM_90_RT_HPP__ 2025-05-07T19:48:30.1509344Z #define __SM_90_RT_H__ 2025-05-07T19:48:30.1509470Z #define __SQUAD_TYPE long int 2025-05-07T19:48:30.1509567Z #define __SSE2_MATH__ 1 2025-05-07T19:48:30.1509649Z #define __SSE2__ 1 2025-05-07T19:48:30.1509743Z #define __SSE_MATH__ 1 2025-05-07T19:48:30.1509831Z #define __SSE__ 1 2025-05-07T19:48:30.1509936Z #define __SSIZE_T_TYPE __SWORD_TYPE 2025-05-07T19:48:30.1510075Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16UL 2025-05-07T19:48:30.1510214Z #define __STDCPP_MATH_SPEC_FUNCS__ 201003L 2025-05-07T19:48:30.1510311Z #define __STDCPP_THREADS__ 1 2025-05-07T19:48:30.1510398Z #define __STDC_HOSTED__ 1 2025-05-07T19:48:30.1510513Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:48:30.1510607Z #define __STDC_IEC_559__ 1 2025-05-07T19:48:30.1510708Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:48:30.1510809Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:48:30.1510916Z #define __STDC_UTF_16__ 1 2025-05-07T19:48:30.1511081Z #define __STDC_UTF_32__ 1 2025-05-07T19:48:30.1511162Z #define __STDC__ 1 2025-05-07T19:48:30.1511271Z #define __STDDEF_H 2025-05-07T19:48:30.1511372Z #define __STRING(x) #x 2025-05-07T19:48:30.1511488Z #define __SURFACE_FUNCTIONS_H__ 2025-05-07T19:48:30.1511608Z #define __SURFACE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:48:30.1511728Z #define __SURFACE_TYPES_H__ 2025-05-07T19:48:30.1511871Z #define __SUSECONDS_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:30.1511977Z #define __SWORD_TYPE long int 2025-05-07T19:48:30.1512131Z #define __SYSCALL_SLONG_TYPE __SLONGWORD_TYPE 2025-05-07T19:48:30.1512253Z #define __SYSCALL_ULONG_TYPE __ULONGWORD_TYPE 2025-05-07T19:48:30.1512359Z #define __SYSCALL_WORDSIZE 64 2025-05-07T19:48:30.1512472Z #define __TEXTURE_FETCH_FUNCTIONS_H__ 2025-05-07T19:48:30.1512655Z #define __TEXTURE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:48:30.1512764Z #define __TEXTURE_TYPES_H__ 2025-05-07T19:48:30.1512853Z #define __THROW throw () 2025-05-07T19:48:30.1512965Z #define __THROWNL throw () 2025-05-07T19:48:30.1513060Z #define __TIMER_T_TYPE void * 2025-05-07T19:48:30.1513181Z #define __TIME_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:30.1513283Z #define __U16_TYPE unsigned short int 2025-05-07T19:48:30.1513402Z #define __U32_TYPE unsigned int 2025-05-07T19:48:30.1513502Z #define __U64_TYPE unsigned long int 2025-05-07T19:48:30.1513597Z #define __UID_T_TYPE __U32_TYPE 2025-05-07T19:48:30.1513711Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:48:30.1513803Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:48:30.1513948Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:48:30.1514041Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:48:30.1514140Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:48:30.1514231Z #define __UINT16_MAX__ 65535 2025-05-07T19:48:30.1514333Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:48:30.1514446Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:48:30.1514536Z #define __UINT32_FMTX__ "X" 2025-05-07T19:48:30.1514634Z #define __UINT32_FMTo__ "o" 2025-05-07T19:48:30.1514727Z #define __UINT32_FMTu__ "u" 2025-05-07T19:48:30.1514828Z #define __UINT32_FMTx__ "x" 2025-05-07T19:48:30.1514924Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:48:30.1515029Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:48:30.1515143Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:48:30.1515233Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:48:30.1515329Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:48:30.1515419Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:48:30.1515522Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:48:30.1515639Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:48:30.1515748Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:48:30.1515856Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:48:30.1515945Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:48:30.1516033Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:48:30.1516124Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:48:30.1516238Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:48:30.1516334Z #define __UINT8_MAX__ 255 2025-05-07T19:48:30.1516439Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:48:30.1516565Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:48:30.1516664Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:48:30.1516766Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:48:30.1516858Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:48:30.1516958Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:48:30.1517070Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:48:30.1517182Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:48:30.1517284Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:48:30.1517380Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:48:30.1517471Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:48:30.1517563Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:48:30.1517667Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:48:30.1517780Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:48:30.1517891Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:48:30.1517997Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:48:30.1518096Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:48:30.1518245Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:48:30.1518353Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:48:30.1518446Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:48:30.1518536Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:48:30.1518649Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:48:30.1518758Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:48:30.1518851Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:48:30.1518945Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:48:30.1519067Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:48:30.1519169Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:48:30.1519273Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:48:30.1519366Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:48:30.1519476Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:48:30.1519566Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:48:30.1519658Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:48:30.1519799Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:48:30.1519920Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:48:30.1520017Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:48:30.1520110Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:48:30.1520226Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:48:30.1520326Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:48:30.1520418Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:48:30.1520537Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:48:30.1520688Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:48:30.1520783Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:48:30.1520879Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:48:30.1520986Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:48:30.1521080Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:48:30.1521194Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:48:30.1521305Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:48:30.1521409Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:48:30.1521509Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:48:30.1521600Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:48:30.1521717Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:48:30.1521828Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:48:30.1521924Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:48:30.1522033Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:48:30.1522132Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:48:30.1522228Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:48:30.1522368Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:48:30.1522493Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:48:30.1522591Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:48:30.1522688Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:48:30.1522814Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:48:30.1522910Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:48:30.1523006Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:48:30.1523143Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:48:30.1523250Z #define __ULONG32_TYPE unsigned int 2025-05-07T19:48:30.1523379Z #define __ULONGWORD_TYPE unsigned long int 2025-05-07T19:48:30.1523493Z #define __UQUAD_TYPE unsigned long int 2025-05-07T19:48:30.1523627Z #define __USECONDS_T_TYPE __U32_TYPE 2025-05-07T19:48:30.1523738Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:48:30.1523837Z #define __USE_ANSI 1 2025-05-07T19:48:30.1523942Z #define __USE_ATFILE 1 2025-05-07T19:48:30.1524022Z #define __USE_BSD 1 2025-05-07T19:48:30.1524118Z #define __USE_FORTIFY_LEVEL 0 2025-05-07T19:48:30.1524197Z #define __USE_GNU 1 2025-05-07T19:48:30.1524299Z #define __USE_ISOC11 1 2025-05-07T19:48:30.1524382Z #define __USE_ISOC95 1 2025-05-07T19:48:30.1524463Z #define __USE_ISOC99 1 2025-05-07T19:48:30.1524572Z #define __USE_ISOCXX11 1 2025-05-07T19:48:30.1524663Z #define __USE_LARGEFILE 1 2025-05-07T19:48:30.1524757Z #define __USE_LARGEFILE64 1 2025-05-07T19:48:30.1524838Z #define __USE_MISC 1 2025-05-07T19:48:30.1524990Z #define __USE_POSIX 1 2025-05-07T19:48:30.1525079Z #define __USE_POSIX199309 1 2025-05-07T19:48:30.1525275Z #define __USE_POSIX199506 1 2025-05-07T19:48:30.1525363Z #define __USE_POSIX2 1 2025-05-07T19:48:30.1525441Z #define __USE_SVID 1 2025-05-07T19:48:30.1525519Z #define __USE_UNIX98 1 2025-05-07T19:48:30.1525596Z #define __USE_XOPEN 1 2025-05-07T19:48:30.1525688Z #define __USE_XOPEN2K 1 2025-05-07T19:48:30.1525771Z #define __USE_XOPEN2K8 1 2025-05-07T19:48:30.1525856Z #define __USE_XOPEN2K8XSI 1 2025-05-07T19:48:30.1525957Z #define __USE_XOPEN2KXSI 1 2025-05-07T19:48:30.1526065Z #define __USE_XOPEN_EXTENDED 1 2025-05-07T19:48:30.1526175Z #define __USING_NAMESPACE_C99(name) 2025-05-07T19:48:30.1526270Z #define __USING_NAMESPACE_STD(name) 2025-05-07T19:48:30.1526386Z #define __UWORD_TYPE unsigned long int 2025-05-07T19:48:30.1526479Z #define __VECTOR_FUNCTIONS_HPP__ 2025-05-07T19:48:30.1526571Z #define __VECTOR_FUNCTIONS_H__ 2025-05-07T19:48:30.1526666Z #define __VECTOR_TYPES_H__ 2025-05-07T19:48:30.1527105Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:48:30.1527221Z #define __WAIT_INT(status) (*(int *) &(status)) 2025-05-07T19:48:30.1527322Z #define __WAIT_STATUS void * 2025-05-07T19:48:30.1527441Z #define __WAIT_STATUS_DEFN void * 2025-05-07T19:48:30.1527523Z #define __WALL 0x40000000 2025-05-07T19:48:30.1527622Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:48:30.1527724Z #define __WCHAR_TYPE__ int 2025-05-07T19:48:30.1527871Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:48:30.1527960Z #define __WCLONE 0x80000000 2025-05-07T19:48:30.1528091Z #define __WCOREDUMP(status) ((status) & __WCOREFLAG) 2025-05-07T19:48:30.1528183Z #define __WCOREFLAG 0x80 2025-05-07T19:48:30.1528320Z #define __WEXITSTATUS(status) (((status) & 0xff00) >> 8) 2025-05-07T19:48:30.1528474Z #define __WIFCONTINUED(status) ((status) == __W_CONTINUED) 2025-05-07T19:48:30.1528621Z #define __WIFEXITED(status) (__WTERMSIG(status) == 0) 2025-05-07T19:48:30.1528837Z #define __WIFSIGNALED(status) (((signed char) (((status) & 0x7f) + 1) >> 1) > 0) 2025-05-07T19:48:30.1528977Z #define __WIFSTOPPED(status) (((status) & 0xff) == 0x7f) 2025-05-07T19:48:30.1529076Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:48:30.1529168Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:48:30.1529254Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:48:30.1529339Z #define __WINT_WIDTH__ 32 2025-05-07T19:48:30.1529444Z #define __WNOTHREAD 0x20000000 2025-05-07T19:48:30.1529535Z #define __WORDSIZE 64 2025-05-07T19:48:30.1529649Z #define __WORDSIZE_TIME64_COMPAT32 1 2025-05-07T19:48:30.1529799Z #define __WSTOPSIG(status) __WEXITSTATUS(status) 2025-05-07T19:48:30.1529904Z #define __WTERMSIG(status) ((status) & 0x7f) 2025-05-07T19:48:30.1529999Z #define __W_CONTINUED 0xffff 2025-05-07T19:48:30.1530130Z #define __W_EXITCODE(ret,sig) ((ret) << 8 | (sig)) 2025-05-07T19:48:30.1530263Z #define __W_STOPCODE(sig) ((sig) << 8 | 0x7f) 2025-05-07T19:48:30.1530354Z #define ____FILE_defined 1 2025-05-07T19:48:30.1530450Z #define ____mbstate_t_defined 1 2025-05-07T19:48:30.1530579Z #define __align__(n) __attribute__((aligned(n))) 2025-05-07T19:48:30.1530755Z #define __always_inline __inline __attribute__ ((__always_inline__)) 2025-05-07T19:48:30.1530831Z #define __amd64 1 2025-05-07T19:48:30.1530909Z #define __amd64__ 1 2025-05-07T19:48:30.1531022Z #define __annotate__(a) __attribute__((a)) 2025-05-07T19:48:30.1531114Z #define __attribute_artificial__ 2025-05-07T19:48:30.1531253Z #define __attribute_const__ __attribute__ ((__const__)) 2025-05-07T19:48:30.1531448Z #define __attribute_deprecated__ __attribute__ ((__deprecated__)) 2025-05-07T19:48:30.1531638Z #define __attribute_format_arg__(x) __attribute__ ((__format_arg__ (x))) 2025-05-07T19:48:30.1531883Z #define __attribute_format_strfmon__(a,b) __attribute__ ((__format__ (__strfmon__, a, b))) 2025-05-07T19:48:30.1532039Z #define __attribute_malloc__ __attribute__ ((__malloc__)) 2025-05-07T19:48:30.1532194Z #define __attribute_noinline__ __attribute__ ((__noinline__)) 2025-05-07T19:48:30.1532373Z #define __attribute_pure__ __attribute__ ((__pure__)) 2025-05-07T19:48:30.1532503Z #define __attribute_used__ __attribute__ ((__used__)) 2025-05-07T19:48:30.1532763Z #define __attribute_warn_unused_result__ __attribute__ ((__warn_unused_result__)) 2025-05-07T19:48:30.1532864Z #define __blkcnt_t_defined 2025-05-07T19:48:30.1532958Z #define __blksize_t_defined 2025-05-07T19:48:30.1533166Z #define __bos(ptr) __builtin_object_size (ptr, __USE_FORTIFY_LEVEL > 1) 2025-05-07T19:48:30.1533293Z #define __bos0(ptr) __builtin_object_size (ptr, 0) 2025-05-07T19:48:30.1533369Z #define __bounded 2025-05-07T19:48:30.1533970Z #define __bswap_16(x) (__extension__ ({ unsigned short int __v, __x = (unsigned short int) (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_16 (__x); else __asm__ ("rorw $8, %w0" : "=r" (__v) : "0" (__x) : "cc"); __v; })) 2025-05-07T19:48:30.1534446Z #define __bswap_32(x) (__extension__ ({ unsigned int __v, __x = (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_32 (__x); else __asm__ ("bswap %0" : "=r" (__v) : "0" (__x)); __v; })) 2025-05-07T19:48:30.1534914Z #define __bswap_64(x) (__extension__ ({ __uint64_t __v, __x = (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_64 (__x); else __asm__ ("bswap %q0" : "=r" (__v) : "0" (__x)); __v; })) 2025-05-07T19:48:30.1535182Z #define __bswap_constant_16(x) ((unsigned short int) ((((x) >> 8) & 0xff) | (((x) & 0xff) << 8))) 2025-05-07T19:48:30.1535603Z #define __bswap_constant_32(x) ((((x) & 0xff000000) >> 24) | (((x) & 0x00ff0000) >> 8) | (((x) & 0x0000ff00) << 8) | (((x) & 0x000000ff) << 24)) 2025-05-07T19:48:30.1536535Z #define __bswap_constant_64(x) (__extension__ ((((x) & 0xff00000000000000ull) >> 56) | (((x) & 0x00ff000000000000ull) >> 40) | (((x) & 0x0000ff0000000000ull) >> 24) | (((x) & 0x000000ff00000000ull) >> 8) | (((x) & 0x00000000ff000000ull) << 8) | (((x) & 0x0000000000ff0000ull) << 24) | (((x) & 0x000000000000ff00ull) << 40) | (((x) & 0x00000000000000ffull) << 56))) 2025-05-07T19:48:30.1536651Z #define __builtin_align__(a) __align__(a) 2025-05-07T19:48:30.1536736Z #define __catch(X) catch(X) 2025-05-07T19:48:30.1536813Z #define __cdecl 2025-05-07T19:48:30.1536903Z #define __clang__ 1 2025-05-07T19:48:30.1537010Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:48:30.1537108Z #define __clang_major__ 16 2025-05-07T19:48:30.1537196Z #define __clang_minor__ 0 2025-05-07T19:48:30.1537302Z #define __clang_patchlevel__ 6 2025-05-07T19:48:30.1537711Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:48:30.1537832Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:48:30.1537935Z #define __clock_t_defined 1 2025-05-07T19:48:30.1538020Z #define __clockid_t_defined 1 2025-05-07T19:48:30.1538208Z #define __cluster_dims__(...) __attribute__((cluster_dims(__VA_ARGS__))) 2025-05-07T19:48:30.1538312Z #define __code_model_small__ 1 2025-05-07T19:48:30.1538421Z #define __constant__ __location__(constant) 2025-05-07T19:48:30.1538510Z #define __cplusplus 201703L 2025-05-07T19:48:30.1538620Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:48:30.1538753Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:48:30.1538856Z #define __cpp_alias_templates 200704L 2025-05-07T19:48:30.1538947Z #define __cpp_aligned_new 201606L 2025-05-07T19:48:30.1539052Z #define __cpp_attributes 200809L 2025-05-07T19:48:30.1539143Z #define __cpp_binary_literals 201304L 2025-05-07T19:48:30.1539238Z #define __cpp_capture_star_this 201603L 2025-05-07T19:48:30.1539339Z #define __cpp_constexpr 201603L 2025-05-07T19:48:30.1539483Z #define __cpp_constexpr_in_decltype 201711L 2025-05-07T19:48:30.1539576Z #define __cpp_decltype 200707L 2025-05-07T19:48:30.1539674Z #define __cpp_decltype_auto 201304L 2025-05-07T19:48:30.1539785Z #define __cpp_deduction_guides 201703L 2025-05-07T19:48:30.1539903Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:48:30.1539996Z #define __cpp_digit_separators 201309L 2025-05-07T19:48:30.1540164Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:48:30.1540286Z #define __cpp_exceptions 199711L 2025-05-07T19:48:30.1540395Z #define __cpp_fold_expressions 201603L 2025-05-07T19:48:30.1540494Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:48:30.1540643Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:48:30.1540745Z #define __cpp_hex_float 201603L 2025-05-07T19:48:30.1540850Z #define __cpp_if_constexpr 201606L 2025-05-07T19:48:30.1540970Z #define __cpp_impl_destroying_delete 201806L 2025-05-07T19:48:30.1541110Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:48:30.1541214Z #define __cpp_init_captures 201304L 2025-05-07T19:48:30.1541325Z #define __cpp_initializer_lists 200806L 2025-05-07T19:48:30.1541449Z #define __cpp_inline_variables 201606L 2025-05-07T19:48:30.1541539Z #define __cpp_lambdas 200907L 2025-05-07T19:48:30.1541651Z #define __cpp_lib_addressof_constexpr 201603 2025-05-07T19:48:30.1541765Z #define __cpp_lib_array_constexpr 201803L 2025-05-07T19:48:30.1541859Z #define __cpp_lib_as_const 201510 2025-05-07T19:48:30.1541969Z #define __cpp_lib_bool_constant 201505 2025-05-07T19:48:30.1542072Z #define __cpp_lib_exchange_function 201304 2025-05-07T19:48:30.1542263Z #define __cpp_lib_has_unique_object_representations 201606 2025-05-07T19:48:30.1542365Z #define __cpp_lib_hypot 201603 2025-05-07T19:48:30.1542468Z #define __cpp_lib_integer_sequence 201304 2025-05-07T19:48:30.1542600Z #define __cpp_lib_integral_constant_callable 201304 2025-05-07T19:48:30.1542755Z #define __cpp_lib_is_aggregate 201703 2025-05-07T19:48:30.1542858Z #define __cpp_lib_is_final 201402L 2025-05-07T19:48:30.1542961Z #define __cpp_lib_is_invocable 201703 2025-05-07T19:48:30.1543089Z #define __cpp_lib_is_null_pointer 201309 2025-05-07T19:48:30.1543193Z #define __cpp_lib_is_swappable 201603 2025-05-07T19:48:30.1543289Z #define __cpp_lib_launder 201606 2025-05-07T19:48:30.1543402Z #define __cpp_lib_logical_traits 201510 2025-05-07T19:48:30.1543517Z #define __cpp_lib_make_reverse_iterator 201402 2025-05-07T19:48:30.1543642Z #define __cpp_lib_math_special_functions 201603L 2025-05-07T19:48:30.1543742Z #define __cpp_lib_result_of_sfinae 201210 2025-05-07T19:48:30.1543884Z #define __cpp_lib_robust_nonmodifying_seq_ops 201304 2025-05-07T19:48:30.1544025Z #define __cpp_lib_transformation_trait_aliases 201304 2025-05-07T19:48:30.1544140Z #define __cpp_lib_tuple_element_t 201402L 2025-05-07T19:48:30.1544265Z #define __cpp_lib_tuples_by_type 201304 2025-05-07T19:48:30.1544406Z #define __cpp_lib_type_trait_variable_templates 201510L 2025-05-07T19:48:30.1544496Z #define __cpp_lib_void_t 201411 2025-05-07T19:48:30.1544630Z #define __cpp_named_character_escapes 202207L 2025-05-07T19:48:30.1544737Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:48:30.1544864Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:48:30.1544971Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:48:30.1545095Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:48:30.1545234Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:48:30.1545323Z #define __cpp_nsdmi 200809L 2025-05-07T19:48:30.1545453Z #define __cpp_range_based_for 201603L 2025-05-07T19:48:30.1545546Z #define __cpp_raw_strings 200710L 2025-05-07T19:48:30.1545642Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:48:30.1545751Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:48:30.1545853Z #define __cpp_rtti 199711L 2025-05-07T19:48:30.1545960Z #define __cpp_rvalue_references 200610L 2025-05-07T19:48:30.1546060Z #define __cpp_static_assert 201411L 2025-05-07T19:48:30.1546177Z #define __cpp_static_call_operator 202207L 2025-05-07T19:48:30.1546278Z #define __cpp_structured_bindings 201606L 2025-05-07T19:48:30.1546369Z #define __cpp_template_auto 201606L 2025-05-07T19:48:30.1546478Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:48:30.1546592Z #define __cpp_unicode_characters 200704L 2025-05-07T19:48:30.1546702Z #define __cpp_unicode_literals 200710L 2025-05-07T19:48:30.1546869Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:48:30.1547001Z #define __cpp_variable_templates 201304L 2025-05-07T19:48:30.1547108Z #define __cpp_variadic_templates 200704L 2025-05-07T19:48:30.1547208Z #define __cpp_variadic_using 201611L 2025-05-07T19:48:30.1547340Z #define __cudaGet_blockDim() blockDim 2025-05-07T19:48:30.1547444Z #define __cudaGet_blockIdx() blockIdx 2025-05-07T19:48:30.1547541Z #define __cudaGet_gridDim() gridDim 2025-05-07T19:48:30.1547670Z #define __cudaGet_threadIdx() threadIdx 2025-05-07T19:48:30.1547769Z #define __cudaGet_warpSize() warpSize 2025-05-07T19:48:30.1547914Z #define __cudart_builtin__ __location__(cudart_builtin) 2025-05-07T19:48:30.1548010Z #define __daddr_t_defined 2025-05-07T19:48:30.1548133Z #define __dev_t_defined 2025-05-07T19:48:30.1548231Z #define __device__ __location__(device) 2025-05-07T19:48:30.1548372Z #define __device_builtin__ __location__(device_builtin) 2025-05-07T19:48:30.1548635Z #define __device_builtin_surface_type__ __location__(device_builtin_surface_type) 2025-05-07T19:48:30.1548875Z #define __device_builtin_texture_type__ __location__(device_builtin_texture_type) 2025-05-07T19:48:30.1549012Z #define __errordecl(name,msg) extern void name (void) 2025-05-07T19:48:30.1549101Z #define __export__ 2025-05-07T19:48:30.1549371Z #define __extern_always_inline extern __always_inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:48:30.1549571Z #define __extern_inline extern __inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:48:30.1549657Z #define __flexarr [] 2025-05-07T19:48:30.1549910Z #define __forceinline__ __inline__ __attribute__((always_inline)) 2025-05-07T19:48:30.1550131Z #define __fortify_function __extern_always_inline __attribute_artificial__ 2025-05-07T19:48:30.1550225Z #define __fsblkcnt_t_defined 2025-05-07T19:48:30.1550339Z #define __fsfilcnt_t_defined 2025-05-07T19:48:30.1550434Z #define __gid_t_defined 2025-05-07T19:48:30.1550586Z #define __glibc_likely(cond) __builtin_expect((cond), 1) 2025-05-07T19:48:30.1550742Z #define __glibc_unlikely(cond) __builtin_expect((cond), 0) 2025-05-07T19:48:30.1551002Z #define __glibcxx_assert(cond) do { __glibcxx_constexpr_assert(cond); } while (false) 2025-05-07T19:48:30.1551120Z #define __glibcxx_class_requires(_a,_b) 2025-05-07T19:48:30.1551234Z #define __glibcxx_class_requires2(_a,_b,_c) 2025-05-07T19:48:30.1551382Z #define __glibcxx_class_requires3(_a,_b,_c,_d) 2025-05-07T19:48:30.1551515Z #define __glibcxx_class_requires4(_a,_b,_c,_d,_e) 2025-05-07T19:48:30.1551876Z #define __glibcxx_constexpr_assert(cond) if (__builtin_is_constant_evaluated() && !bool(cond)) __builtin_unreachable() 2025-05-07T19:48:30.1552094Z #define __glibcxx_digits10_b(T,B) (__glibcxx_digits_b (T,B) * 643L / 2136) 2025-05-07T19:48:30.1552266Z #define __glibcxx_digits_b(T,B) (B - __glibcxx_signed_b (T,B)) 2025-05-07T19:48:30.1552379Z #define __glibcxx_function_requires(...) 2025-05-07T19:48:30.1552489Z #define __glibcxx_integral_traps true 2025-05-07T19:48:30.1552891Z #define __glibcxx_max_b(T,B) (__glibcxx_signed_b (T,B) ? (((((T)1 << (__glibcxx_digits_b (T,B) - 1)) - 1) << 1) + 1) : ~(T)0) 2025-05-07T19:48:30.1553334Z #define __glibcxx_min_b(T,B) (__glibcxx_signed_b (T,B) ? -__glibcxx_max_b (T,B) - 1 : (T)0) 2025-05-07T19:48:30.1553555Z #define __glibcxx_requires_can_decrement_range(_First1,_Last1,_First2) 2025-05-07T19:48:30.1553739Z #define __glibcxx_requires_can_increment(_First,_Size) 2025-05-07T19:48:30.1553958Z #define __glibcxx_requires_can_increment_range(_First1,_Last1,_First2) 2025-05-07T19:48:30.1554093Z #define __glibcxx_requires_cond(_Cond,_Msg) 2025-05-07T19:48:30.1554245Z #define __glibcxx_requires_heap(_First,_Last) 2025-05-07T19:48:30.1554411Z #define __glibcxx_requires_heap_pred(_First,_Last,_Pred) 2025-05-07T19:48:30.1554565Z #define __glibcxx_requires_irreflexive(_First,_Last) 2025-05-07T19:48:30.1554747Z #define __glibcxx_requires_irreflexive2(_First,_Last) 2025-05-07T19:48:30.1554937Z #define __glibcxx_requires_irreflexive_pred(_First,_Last,_Pred) 2025-05-07T19:48:30.1555133Z #define __glibcxx_requires_irreflexive_pred2(_First,_Last,_Pred) 2025-05-07T19:48:30.1555377Z #define __glibcxx_requires_non_empty_range(_First,_Last) 2025-05-07T19:48:30.1555510Z #define __glibcxx_requires_nonempty() 2025-05-07T19:48:30.1555718Z #define __glibcxx_requires_partitioned_lower(_First,_Last,_Value) 2025-05-07T19:48:30.1555962Z #define __glibcxx_requires_partitioned_lower_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:48:30.1556195Z #define __glibcxx_requires_partitioned_upper(_First,_Last,_Value) 2025-05-07T19:48:30.1556440Z #define __glibcxx_requires_partitioned_upper_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:48:30.1556584Z #define __glibcxx_requires_sorted(_First,_Last) 2025-05-07T19:48:30.1556787Z #define __glibcxx_requires_sorted_pred(_First,_Last,_Pred) 2025-05-07T19:48:30.1556968Z #define __glibcxx_requires_sorted_set(_First1,_Last1,_First2) 2025-05-07T19:48:30.1557188Z #define __glibcxx_requires_sorted_set_pred(_First1,_Last1,_First2,_Pred) 2025-05-07T19:48:30.1557316Z #define __glibcxx_requires_string(_String) 2025-05-07T19:48:30.1557499Z #define __glibcxx_requires_string_len(_String,_Len) 2025-05-07T19:48:30.1557619Z #define __glibcxx_requires_subscript(_N) 2025-05-07T19:48:30.1557766Z #define __glibcxx_requires_valid_range(_First,_Last) 2025-05-07T19:48:30.1557919Z #define __glibcxx_signed_b(T,B) ((T)(-1) < 0) 2025-05-07T19:48:30.1558035Z #define __global__ __location__(global) 2025-05-07T19:48:30.1558138Z #define __gnu_linux__ 1 2025-05-07T19:48:30.1558283Z #define __grid_constant__ __location__(grid_constant) 2025-05-07T19:48:30.1558465Z #define __have_pthread_attr_t 1 2025-05-07T19:48:30.1558573Z #define __host__ __location__(host) 2025-05-07T19:48:30.1558664Z #define __id_t_defined 2025-05-07T19:48:30.1558788Z #define __import__ 2025-05-07T19:48:30.1558891Z #define __ino64_t_defined 2025-05-07T19:48:30.1558977Z #define __ino_t_defined 2025-05-07T19:48:30.1559069Z #define __int8_t_defined 2025-05-07T19:48:30.1559340Z #define __intN_t(N,MODE) typedef int int##N##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:48:30.1559556Z #define __isleap(year) ((year) % 4 == 0 && ((year) % 100 != 0 || (year) % 400 == 0)) 2025-05-07T19:48:30.1559663Z #define __k8 1 2025-05-07T19:48:30.1559754Z #define __k8__ 1 2025-05-07T19:48:30.1559855Z #define __key_t_defined 2025-05-07T19:48:30.1560081Z #define __launch_bounds__(...) __annotate__(launch_bounds(__VA_ARGS__)) 2025-05-07T19:48:30.1560180Z #define __ldiv_t_defined 1 2025-05-07T19:48:30.1560273Z #define __linux 1 2025-05-07T19:48:30.1560365Z #define __linux__ 1 2025-05-07T19:48:30.1560483Z #define __lldiv_t_defined 1 2025-05-07T19:48:30.1560573Z #define __llvm__ 1 2025-05-07T19:48:30.1560687Z #define __location__(a) __annotate__(a) 2025-05-07T19:48:30.1560817Z #define __long_double_t long double 2025-05-07T19:48:30.1560922Z #define __malloc_and_calloc_defined 2025-05-07T19:48:30.1561044Z #define __managed__ __location__(managed) 2025-05-07T19:48:30.1561140Z #define __mode_t_defined 2025-05-07T19:48:30.1561254Z #define __need_IOV_MAX 2025-05-07T19:48:30.1561353Z #define __need_clock_t 2025-05-07T19:48:30.1561451Z #define __need_clockid_t 2025-05-07T19:48:30.1561558Z #define __need_time_t 2025-05-07T19:48:30.1561649Z #define __need_timer_t 2025-05-07T19:48:30.1561750Z #define __need_timespec 2025-05-07T19:48:30.1561846Z #define __nlink_t_defined 2025-05-07T19:48:30.1562002Z #define __no_return__ __attribute__((noreturn)) 2025-05-07T19:48:30.1562127Z #define __noinline__ __attribute__((noinline)) 2025-05-07T19:48:30.1562314Z #define __nonnull(params) __attribute__ ((__nonnull__ params)) 2025-05-07T19:48:30.1562443Z #define __off64_t_defined 2025-05-07T19:48:30.1562537Z #define __off_t_defined 2025-05-07T19:48:30.1562628Z #define __pic__ 2 2025-05-07T19:48:30.1562725Z #define __pid_t_defined 2025-05-07T19:48:30.1562848Z #define __pie__ 2 2025-05-07T19:48:30.1562951Z #define __private_extern__ extern 2025-05-07T19:48:30.1563036Z #define __ptr_t void * 2025-05-07T19:48:30.1563152Z #define __ptrvalue 2025-05-07T19:48:30.1563253Z #define __restrict_arr 2025-05-07T19:48:30.1563445Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:48:30.1563589Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:48:30.1563721Z #define __shared__ __location__(shared) 2025-05-07T19:48:30.1563815Z #define __sigset_t_defined 2025-05-07T19:48:30.1563928Z #define __specialization_static 2025-05-07T19:48:30.1564054Z #define __ssize_t_defined 2025-05-07T19:48:30.1564139Z #define __stub_bdflush 2025-05-07T19:48:30.1564228Z #define __stub_chflags 2025-05-07T19:48:30.1564323Z #define __stub_fattach 2025-05-07T19:48:30.1564446Z #define __stub_fchflags 2025-05-07T19:48:30.1564532Z #define __stub_fdetach 2025-05-07T19:48:30.1564630Z #define __stub_getmsg 2025-05-07T19:48:30.1564754Z #define __stub_gtty 2025-05-07T19:48:30.1564840Z #define __stub_lchmod 2025-05-07T19:48:30.1564930Z #define __stub_putmsg 2025-05-07T19:48:30.1565022Z #define __stub_revoke 2025-05-07T19:48:30.1565139Z #define __stub_setlogin 2025-05-07T19:48:30.1565234Z #define __stub_sigreturn 2025-05-07T19:48:30.1565433Z #define __stub_sstk 2025-05-07T19:48:30.1565552Z #define __stub_stty 2025-05-07T19:48:30.1565640Z #define __suseconds_t_defined 2025-05-07T19:48:30.1565730Z #define __thread__ __thread 2025-05-07T19:48:30.1565841Z #define __throw_exception_again throw 2025-05-07T19:48:30.1565953Z #define __time_t_defined 1 2025-05-07T19:48:30.1566034Z #define __timer_t_defined 1 2025-05-07T19:48:30.1566129Z #define __timespec_defined 1 2025-05-07T19:48:30.1566240Z #define __try try 2025-05-07T19:48:30.1566373Z #define __tune_k8__ 1 2025-05-07T19:48:30.1566460Z #define __u_char_defined 2025-05-07T19:48:30.1566720Z #define __u_intN_t(N,MODE) typedef unsigned int u_int##N##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:48:30.1566831Z #define __uid_t_defined 2025-05-07T19:48:30.1566913Z #define __unbounded 2025-05-07T19:48:30.1566988Z #define __unix 1 2025-05-07T19:48:30.1567104Z #define __unix__ 1 2025-05-07T19:48:30.1567190Z #define __useconds_t_defined 2025-05-07T19:48:30.1567281Z #define __warnattr(msg) 2025-05-07T19:48:30.1567415Z #define __warndecl(name,msg) extern void name (void) 2025-05-07T19:48:30.1567515Z #define __wur 2025-05-07T19:48:30.1567595Z #define __x86_64 1 2025-05-07T19:48:30.1567673Z #define __x86_64__ 1 2025-05-07T19:48:30.1567814Z #define alloca(size) __builtin_alloca (size) 2025-05-07T19:48:30.1568158Z #define assert(expr) ((expr) ? __ASSERT_VOID_CAST (0) : __assert_fail (__STRING(expr), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:48:30.1568556Z #define assert_perror(errnum) (!(errnum) ? __ASSERT_VOID_CAST (0) : __assert_perror_fail ((errnum), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:48:30.1568658Z #define be16toh(x) __bswap_16 (x) 2025-05-07T19:48:30.1568785Z #define be32toh(x) __bswap_32 (x) 2025-05-07T19:48:30.1568884Z #define be64toh(x) __bswap_64 (x) 2025-05-07T19:48:30.1568993Z #define cudaArrayColorAttachment 0x20 2025-05-07T19:48:30.1569125Z #define cudaArrayCubemap 0x04 2025-05-07T19:48:30.1569234Z #define cudaArrayDefault 0x00 2025-05-07T19:48:30.1569343Z #define cudaArrayDeferredMapping 0x80 2025-05-07T19:48:30.1569452Z #define cudaArrayLayered 0x01 2025-05-07T19:48:30.1569557Z #define cudaArraySparse 0x40 2025-05-07T19:48:30.1569709Z #define cudaArraySparsePropertiesSingleMipTail 0x1 2025-05-07T19:48:30.1569822Z #define cudaArraySurfaceLoadStore 0x02 2025-05-07T19:48:30.1569953Z #define cudaArrayTextureGather 0x08 2025-05-07T19:48:30.1570132Z #define cudaCooperativeLaunchMultiDeviceNoPostSync 0x02 2025-05-07T19:48:30.1570296Z #define cudaCooperativeLaunchMultiDeviceNoPreSync 0x01 2025-05-07T19:48:30.1570415Z #define cudaCpuDeviceId ((int)-1) 2025-05-07T19:48:30.1570524Z #define cudaDeviceBlockingSync 0x04 2025-05-07T19:48:30.1570637Z #define cudaDeviceLmemResizeToMax 0x10 2025-05-07T19:48:30.1570731Z #define cudaDeviceMapHost 0x08 2025-05-07T19:48:30.1570846Z #define cudaDeviceMask 0x1f 2025-05-07T19:48:30.1571328Z #define cudaDevicePropDontCare { {'\0'}, {{0}}, {'\0'}, 0, 0, 0, 0, 0, 0, 0, {0, 0, 0}, {0, 0, 0}, 0, 0, -1, -1, 0, 0, -1, 0, 0, 0, 0, 0, 0, 0, 0, {0, 0}, {0, 0}, {0, 0, 0}, {0, 0}, {0, 0, 0}, {0, 0, 0}, 0, {0, 0}, {0, 0, 0}, {0, 0}, 0, {0, 0}, {0, 0, 0}, {0, 0}, {0, 0, 0}, 0, {0, 0}, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, } 2025-05-07T19:48:30.1571488Z #define cudaDeviceScheduleAuto 0x00 2025-05-07T19:48:30.1571632Z #define cudaDeviceScheduleBlockingSync 0x04 2025-05-07T19:48:30.1571743Z #define cudaDeviceScheduleMask 0x07 2025-05-07T19:48:30.1571847Z #define cudaDeviceScheduleSpin 0x01 2025-05-07T19:48:30.1571967Z #define cudaDeviceScheduleYield 0x02 2025-05-07T19:48:30.1572073Z #define cudaEventBlockingSync 0x01 2025-05-07T19:48:30.1572174Z #define cudaEventDefault 0x00 2025-05-07T19:48:30.1572271Z #define cudaEventDisableTiming 0x02 2025-05-07T19:48:30.1572392Z #define cudaEventInterprocess 0x04 2025-05-07T19:48:30.1572497Z #define cudaEventRecordDefault 0x00 2025-05-07T19:48:30.1572601Z #define cudaEventRecordExternal 0x01 2025-05-07T19:48:30.1572728Z #define cudaEventWaitDefault 0x00 2025-05-07T19:48:30.1572833Z #define cudaEventWaitExternal 0x01 2025-05-07T19:48:30.1572951Z #define cudaExternalMemoryDedicated 0x1 2025-05-07T19:48:30.1573160Z #define cudaExternalSemaphoreSignalSkipNvSciBufMemSync 0x01 2025-05-07T19:48:30.1573342Z #define cudaExternalSemaphoreWaitSkipNvSciBufMemSync 0x02 2025-05-07T19:48:30.1573447Z #define cudaHostAllocDefault 0x00 2025-05-07T19:48:30.1573553Z #define cudaHostAllocMapped 0x02 2025-05-07T19:48:30.1573677Z #define cudaHostAllocPortable 0x01 2025-05-07T19:48:30.1573833Z #define cudaHostAllocWriteCombined 0x04 2025-05-07T19:48:30.1573944Z #define cudaHostRegisterDefault 0x00 2025-05-07T19:48:30.1574080Z #define cudaHostRegisterIoMemory 0x04 2025-05-07T19:48:30.1574185Z #define cudaHostRegisterMapped 0x02 2025-05-07T19:48:30.1574294Z #define cudaHostRegisterPortable 0x01 2025-05-07T19:48:30.1574403Z #define cudaHostRegisterReadOnly 0x08 2025-05-07T19:48:30.1574525Z #define cudaInvalidDeviceId ((int)-2) 2025-05-07T19:48:30.1574651Z #define cudaIpcMemLazyEnablePeerAccess 0x01 2025-05-07T19:48:30.1574796Z #define cudaKernelNodeAttrID cudaLaunchAttributeID 2025-05-07T19:48:30.1574994Z #define cudaKernelNodeAttrValue cudaLaunchAttributeValue 2025-05-07T19:48:30.1575308Z #define cudaKernelNodeAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:48:30.1575598Z #define cudaKernelNodeAttributeClusterDimension cudaLaunchAttributeClusterDimension 2025-05-07T19:48:30.1576103Z #define cudaKernelNodeAttributeClusterSchedulingPolicyPreference cudaLaunchAttributeClusterSchedulingPolicyPreference 2025-05-07T19:48:30.1576353Z #define cudaKernelNodeAttributeCooperative cudaLaunchAttributeCooperative 2025-05-07T19:48:30.1576566Z #define cudaKernelNodeAttributePriority cudaLaunchAttributePriority 2025-05-07T19:48:30.1576670Z #define cudaMemAttachGlobal 0x01 2025-05-07T19:48:30.1576796Z #define cudaMemAttachHost 0x02 2025-05-07T19:48:30.1576898Z #define cudaMemAttachSingle 0x04 2025-05-07T19:48:30.1577004Z #define cudaNvSciSyncAttrSignal 0x1 2025-05-07T19:48:30.1577131Z #define cudaNvSciSyncAttrWait 0x2 2025-05-07T19:48:30.1577239Z #define cudaOccupancyDefault 0x00 2025-05-07T19:48:30.1577377Z #define cudaOccupancyDisableCachingOverride 0x01 2025-05-07T19:48:30.1577484Z #define cudaPeerAccessDefault 0x00 2025-05-07T19:48:30.1577850Z #define cudaSignalExternalSemaphoresAsync __CUDART_API_PTSZ(cudaSignalExternalSemaphoresAsync_v2) 2025-05-07T19:48:30.1577992Z #define cudaStreamAttrID cudaLaunchAttributeID 2025-05-07T19:48:30.1578140Z #define cudaStreamAttrValue cudaLaunchAttributeValue 2025-05-07T19:48:30.1578455Z #define cudaStreamAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:48:30.1578782Z #define cudaStreamAttributeSynchronizationPolicy cudaLaunchAttributeSynchronizationPolicy 2025-05-07T19:48:30.1578889Z #define cudaStreamDefault 0x00 2025-05-07T19:48:30.1579023Z #define cudaStreamLegacy ((cudaStream_t)0x1) 2025-05-07T19:48:30.1579130Z #define cudaStreamNonBlocking 0x01 2025-05-07T19:48:30.1579319Z #define cudaStreamPerThread ((cudaStream_t)0x2) 2025-05-07T19:48:30.1579424Z #define cudaSurfaceType1D 0x01 2025-05-07T19:48:30.1579552Z #define cudaSurfaceType1DLayered 0xF1 2025-05-07T19:48:30.1579655Z #define cudaSurfaceType2D 0x02 2025-05-07T19:48:30.1579762Z #define cudaSurfaceType2DLayered 0xF2 2025-05-07T19:48:30.1579881Z #define cudaSurfaceType3D 0x03 2025-05-07T19:48:30.1579992Z #define cudaSurfaceTypeCubemap 0x0C 2025-05-07T19:48:30.1580120Z #define cudaSurfaceTypeCubemapLayered 0xFC 2025-05-07T19:48:30.1580243Z #define cudaTextureType1D 0x01 2025-05-07T19:48:30.1580347Z #define cudaTextureType1DLayered 0xF1 2025-05-07T19:48:30.1580448Z #define cudaTextureType2D 0x02 2025-05-07T19:48:30.1580557Z #define cudaTextureType2DLayered 0xF2 2025-05-07T19:48:30.1580665Z #define cudaTextureType3D 0x03 2025-05-07T19:48:30.1580771Z #define cudaTextureTypeCubemap 0x0C 2025-05-07T19:48:30.1580901Z #define cudaTextureTypeCubemapLayered 0xFC 2025-05-07T19:48:30.1581240Z #define cudaWaitExternalSemaphoresAsync __CUDART_API_PTSZ(cudaWaitExternalSemaphoresAsync_v2) 2025-05-07T19:48:30.1581339Z #define getc(_fp) _IO_getc (_fp) 2025-05-07T19:48:30.1581448Z #define htobe16(x) __bswap_16 (x) 2025-05-07T19:48:30.1581548Z #define htobe32(x) __bswap_32 (x) 2025-05-07T19:48:30.1581661Z #define htobe64(x) __bswap_64 (x) 2025-05-07T19:48:30.1581750Z #define htole16(x) (x) 2025-05-07T19:48:30.1581841Z #define htole32(x) (x) 2025-05-07T19:48:30.1581945Z #define htole64(x) (x) 2025-05-07T19:48:30.1582034Z #define le16toh(x) (x) 2025-05-07T19:48:30.1582171Z #define le32toh(x) (x) 2025-05-07T19:48:30.1582266Z #define le64toh(x) (x) 2025-05-07T19:48:30.1582370Z #define linux 1 2025-05-07T19:48:30.1582472Z #define major(dev) gnu_dev_major (dev) 2025-05-07T19:48:30.1582606Z #define makedev(maj,min) gnu_dev_makedev (maj, min) 2025-05-07T19:48:30.1582772Z #define math_errhandling (MATH_ERRNO | MATH_ERREXCEPT) 2025-05-07T19:48:30.1582876Z #define minor(dev) gnu_dev_minor (dev) 2025-05-07T19:48:30.1582998Z #define offsetof(t,d) __builtin_offsetof(t, d) 2025-05-07T19:48:30.1583112Z #define putc(_ch,_fp) _IO_putc (_ch, _fp) 2025-05-07T19:48:30.1583218Z #define stderr stderr 2025-05-07T19:48:30.1583297Z #define stdin stdin 2025-05-07T19:48:30.1583386Z #define stdout stdout 2025-05-07T19:48:30.1583896Z #define strdupa(s) (__extension__ ({ const char *__old = (s); size_t __len = strlen (__old) + 1; char *__new = (char *) __builtin_alloca (__len); (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:48:30.1584427Z #define strndupa(s,n) (__extension__ ({ const char *__old = (s); size_t __len = strnlen (__old, (n)); char *__new = (char *) __builtin_alloca (__len + 1); __new[__len] = '\0'; (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:48:30.1584512Z #define unix 1 2025-05-07T19:48:30.1584667Z #define w_coredump __wait_terminated.__w_coredump 2025-05-07T19:48:30.1584793Z #define w_retcode __wait_terminated.__w_retcode 2025-05-07T19:48:30.1584905Z #define w_stopsig __wait_stopped.__w_stopsig 2025-05-07T19:48:30.1585028Z #define w_stopval __wait_stopped.__w_stopval 2025-05-07T19:48:30.1585174Z #define w_termsig __wait_terminated.__w_termsig 2025-05-07T19:48:30.1585182Z 2025-05-07T19:48:30.1706890Z 2025-05-07T19:48:30.1707342Z + conda run -n build_binary nvcc --version 2025-05-07T19:48:30.1707352Z 2025-05-07T19:48:32.0015273Z nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:48:32.0016619Z Copyright (c) 2005-2022 NVIDIA Corporation 2025-05-07T19:48:32.0017617Z Built on Wed_Sep_21_10:33:58_PDT_2022 2025-05-07T19:48:32.0018620Z Cuda compilation tools, release 11.8, V11.8.89 2025-05-07T19:48:32.0019643Z Build cuda_11.8.r11.8/compiler.31833905_0 2025-05-07T19:48:32.0020273Z 2025-05-07T19:48:32.0688180Z 2025-05-07T19:48:32.0697551Z which: no nvidia-smi in (CONDA=/github/home/miniconda:/github/home/miniconda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:48:32.0698322Z [CHECK] nvidia-smi not found 2025-05-07T19:48:32.0698674Z [INSTALL] Successfully installed CUDA 11.8.0 2025-05-07T19:48:32.0794760Z ##[group]Run . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/11.8.0 2025-05-07T19:48:32.0795414Z . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/11.8.0 2025-05-07T19:48:32.0796099Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:32.0796474Z env: 2025-05-07T19:48:32.0796746Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:32.0797076Z BUILD_ENV: build_binary 2025-05-07T19:48:32.0797372Z BUILD_TARGET: default 2025-05-07T19:48:32.0797630Z BUILD_VARIANT: cuda 2025-05-07T19:48:32.0797912Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:48:32.0798182Z ##[endgroup] 2025-05-07T19:48:32.4851829Z ################################################################################ 2025-05-07T19:48:32.4852893Z # Install PyTorch (PIP) 2025-05-07T19:48:32.4853148Z # 2025-05-07T19:48:32.4867449Z # [2025-05-07T19:48:32.485Z] + install_pytorch_pip build_binary nightly cuda/11.8.0 2025-05-07T19:48:32.4868552Z ################################################################################ 2025-05-07T19:48:32.4868815Z 2025-05-07T19:48:32.4894475Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y numpy 2025-05-07T19:48:33.3696125Z Channels: 2025-05-07T19:48:33.3696565Z - conda-forge 2025-05-07T19:48:33.3696906Z Platform: linux-64 2025-05-07T19:48:42.9603543Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:48:44.5023608Z Solving environment: | / - done 2025-05-07T19:48:44.6821961Z 2025-05-07T19:48:44.6822712Z ## Package Plan ## 2025-05-07T19:48:44.6823348Z 2025-05-07T19:48:44.6824093Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:48:44.6825023Z 2025-05-07T19:48:44.6825347Z added / updated specs: 2025-05-07T19:48:44.6825909Z - numpy 2025-05-07T19:48:44.6826044Z 2025-05-07T19:48:44.6826048Z 2025-05-07T19:48:44.6826213Z The following packages will be downloaded: 2025-05-07T19:48:44.6826500Z 2025-05-07T19:48:44.6826747Z package | build 2025-05-07T19:48:44.6827123Z ---------------------------|----------------- 2025-05-07T19:48:44.6827545Z libblas-3.9.0 |31_h59b9bed_openblas 16 KB conda-forge 2025-05-07T19:48:44.6828072Z libcblas-3.9.0 |31_he106b2a_openblas 16 KB conda-forge 2025-05-07T19:48:44.6828599Z liblapack-3.9.0 |31_h7ac8fdf_openblas 16 KB conda-forge 2025-05-07T19:48:44.6829075Z numpy-2.2.5 | py313h17eae1a_0 8.1 MB conda-forge 2025-05-07T19:48:44.6829523Z ------------------------------------------------------------ 2025-05-07T19:48:44.6829994Z Total: 8.2 MB 2025-05-07T19:48:44.6830242Z 2025-05-07T19:48:44.6830378Z The following NEW packages will be INSTALLED: 2025-05-07T19:48:44.6830611Z 2025-05-07T19:48:44.6830887Z libblas conda-forge/linux-64::libblas-3.9.0-31_h59b9bed_openblas 2025-05-07T19:48:44.6831421Z libcblas conda-forge/linux-64::libcblas-3.9.0-31_he106b2a_openblas 2025-05-07T19:48:44.6831977Z liblapack conda-forge/linux-64::liblapack-3.9.0-31_h7ac8fdf_openblas 2025-05-07T19:48:44.6832480Z numpy conda-forge/linux-64::numpy-2.2.5-py313h17eae1a_0 2025-05-07T19:48:44.6832907Z 2025-05-07T19:48:44.6832911Z 2025-05-07T19:48:44.6832915Z 2025-05-07T19:48:44.6833251Z Downloading and Extracting Packages: ...working... 2025-05-07T19:48:44.6840178Z numpy-2.2.5 | 8.1 MB | | 0% 2025-05-07T19:48:44.6840892Z 2025-05-07T19:48:44.6846268Z libblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:48:44.6847048Z 2025-05-07T19:48:44.6849255Z 2025-05-07T19:48:44.6855078Z libcblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:48:44.6855864Z 2025-05-07T19:48:44.6855908Z 2025-05-07T19:48:44.6855920Z 2025-05-07T19:48:44.7297799Z liblapack-3.9.0 | 16 KB | | 0%  2025-05-07T19:48:44.7298970Z 2025-05-07T19:48:44.7355312Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:44.7355611Z 2025-05-07T19:48:44.7355615Z 2025-05-07T19:48:44.7440471Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:44.7440809Z 2025-05-07T19:48:44.7496234Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:44.7496536Z 2025-05-07T19:48:44.7496638Z 2025-05-07T19:48:44.7654434Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:44.7654745Z 2025-05-07T19:48:44.7654957Z 2025-05-07T19:48:44.7654967Z 2025-05-07T19:48:44.7764983Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:44.7765945Z 2025-05-07T19:48:44.7765950Z 2025-05-07T19:48:44.7765955Z 2025-05-07T19:48:44.7824469Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:44.8400160Z numpy-2.2.5 | 8.1 MB | ####### | 71% 2025-05-07T19:48:45.1510405Z numpy-2.2.5 | 8.1 MB | ########## | 100% 2025-05-07T19:48:45.1516142Z numpy-2.2.5 | 8.1 MB | ########## | 100% 2025-05-07T19:48:45.1517182Z 2025-05-07T19:48:45.1517794Z 2025-05-07T19:48:45.1518366Z  2025-05-07T19:48:45.1518594Z 2025-05-07T19:48:45.1518598Z 2025-05-07T19:48:45.1518780Z  2025-05-07T19:48:45.1519046Z 2025-05-07T19:48:45.1519050Z 2025-05-07T19:48:45.1519053Z 2025-05-07T19:48:45.1519248Z  done 2025-05-07T19:48:45.2530071Z Preparing transaction: | done 2025-05-07T19:48:45.3535062Z Verifying transaction: - done 2025-05-07T19:48:45.4543730Z Executing transaction: | done 2025-05-07T19:48:45.5553633Z ################################################################################ 2025-05-07T19:48:45.5554101Z # Install Package From PyTorch PIP: torch 2025-05-07T19:48:45.5554481Z # 2025-05-07T19:48:45.5573198Z # [2025-05-07T19:48:45.556Z] + install_from_pytorch_pip build_binary torch nightly cuda/11.8.0 2025-05-07T19:48:45.5575735Z ################################################################################ 2025-05-07T19:48:45.5576891Z 2025-05-07T19:48:45.5597921Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:45.6462529Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:45.6462980Z ################################################################################ 2025-05-07T19:48:45.6463386Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:48:45.6463693Z # 2025-05-07T19:48:45.6490136Z # [2025-05-07T19:48:45.648Z] + __prepare_pip_arguments torch nightly cuda/11.8.0 2025-05-07T19:48:45.6491501Z ################################################################################ 2025-05-07T19:48:45.6492213Z 2025-05-07T19:48:45.6513563Z [INSTALL] Extracted package (channel, version): (nightly, LATEST) 2025-05-07T19:48:45.6538195Z [INSTALL] Extracted package variant: cu118 2025-05-07T19:48:45.6553918Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:48:45.6555612Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/cu118/ 2025-05-07T19:48:45.6558197Z [INSTALL] Extracted the full PIP package: --pre torch 2025-05-07T19:48:45.6566754Z [INSTALL] Attempting to install [torch, LATEST] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/cu118/ ... 2025-05-07T19:48:45.6591764Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu118/ 2025-05-07T19:50:05.6521529Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:50:05.6523258Z 2025-05-07T19:50:05.6523631Z Looking in indexes: https://download.pytorch.org/whl/nightly/cu118/ 2025-05-07T19:50:05.6524116Z Collecting torch 2025-05-07T19:50:05.6524845Z Downloading https://download.pytorch.org/whl/nightly/cu118/torch-2.8.0.dev20250507%2Bcu118-cp313-cp313-manylinux_2_28_x86_64.whl.metadata (29 kB) 2025-05-07T19:50:05.6525682Z Collecting filelock (from torch) 2025-05-07T19:50:05.6526246Z Downloading https://download.pytorch.org/whl/nightly/filelock-3.16.1-py3-none-any.whl (16 kB) 2025-05-07T19:50:05.6527319Z Requirement already satisfied: typing-extensions>=4.10.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from torch) (4.13.2) 2025-05-07T19:50:05.6528550Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from torch) (78.1.1) 2025-05-07T19:50:05.6529301Z Collecting sympy>=1.13.3 (from torch) 2025-05-07T19:50:05.6529888Z Downloading https://download.pytorch.org/whl/nightly/sympy-1.13.3-py3-none-any.whl (6.2 MB) 2025-05-07T19:50:05.6530877Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.2/6.2 MB 225.8 MB/s eta 0:00:00 2025-05-07T19:50:05.6531298Z Collecting networkx (from torch) 2025-05-07T19:50:05.6531997Z Downloading https://download.pytorch.org/whl/nightly/networkx-3.4.2-py3-none-any.whl (1.7 MB) 2025-05-07T19:50:05.6532784Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.7/1.7 MB 162.1 MB/s eta 0:00:00 2025-05-07T19:50:05.6533530Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from torch) (3.1.6) 2025-05-07T19:50:05.6534218Z Collecting fsspec (from torch) 2025-05-07T19:50:05.6534765Z Downloading https://download.pytorch.org/whl/nightly/fsspec-2024.10.0-py3-none-any.whl (179 kB) 2025-05-07T19:50:05.6535370Z Collecting nvidia-cuda-nvrtc-cu11==11.8.89 (from torch) 2025-05-07T19:50:05.6536140Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cuda_nvrtc_cu11-11.8.89-py3-none-manylinux1_x86_64.whl (23.2 MB) 2025-05-07T19:50:05.6536978Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 23.2/23.2 MB 252.5 MB/s eta 0:00:00 2025-05-07T19:50:05.6537400Z Collecting nvidia-cuda-runtime-cu11==11.8.89 (from torch) 2025-05-07T19:50:05.6538177Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cuda_runtime_cu11-11.8.89-py3-none-manylinux1_x86_64.whl (875 kB) 2025-05-07T19:50:05.6538983Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 875.6/875.6 kB 102.3 MB/s eta 0:00:00 2025-05-07T19:50:05.6539434Z Collecting nvidia-cuda-cupti-cu11==11.8.87 (from torch) 2025-05-07T19:50:05.6540191Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cuda_cupti_cu11-11.8.87-py3-none-manylinux1_x86_64.whl (13.1 MB) 2025-05-07T19:50:05.6540993Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 13.1/13.1 MB 189.6 MB/s eta 0:00:00 2025-05-07T19:50:05.6541420Z Collecting nvidia-cudnn-cu11==9.1.0.70 (from torch) 2025-05-07T19:50:05.6542140Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cudnn_cu11-9.1.0.70-py3-none-manylinux2014_x86_64.whl (663.9 MB) 2025-05-07T19:50:05.6542963Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 663.9/663.9 MB 47.2 MB/s eta 0:00:00 2025-05-07T19:50:05.6543378Z Collecting nvidia-cublas-cu11==11.11.3.6 (from torch) 2025-05-07T19:50:05.6544079Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cublas_cu11-11.11.3.6-py3-none-manylinux1_x86_64.whl (417.9 MB) 2025-05-07T19:50:05.6544886Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 417.9/417.9 MB 81.2 MB/s eta 0:00:00 2025-05-07T19:50:05.6545274Z Collecting nvidia-cufft-cu11==10.9.0.58 (from torch) 2025-05-07T19:50:05.6545992Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cufft_cu11-10.9.0.58-py3-none-manylinux1_x86_64.whl (168.4 MB) 2025-05-07T19:50:05.6546952Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 168.4/168.4 MB 176.4 MB/s eta 0:00:00 2025-05-07T19:50:05.6547388Z Collecting nvidia-curand-cu11==10.3.0.86 (from torch) 2025-05-07T19:50:05.6549933Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_curand_cu11-10.3.0.86-py3-none-manylinux1_x86_64.whl (58.1 MB) 2025-05-07T19:50:05.6550804Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 58.1/58.1 MB 239.6 MB/s eta 0:00:00 2025-05-07T19:50:05.6551254Z Collecting nvidia-cusolver-cu11==11.4.1.48 (from torch) 2025-05-07T19:50:05.6552016Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cusolver_cu11-11.4.1.48-py3-none-manylinux1_x86_64.whl (128.2 MB) 2025-05-07T19:50:05.6553001Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 128.2/128.2 MB 218.8 MB/s eta 0:00:00 2025-05-07T19:50:05.6553651Z Collecting nvidia-cusparse-cu11==11.7.5.86 (from torch) 2025-05-07T19:50:05.6554439Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cusparse_cu11-11.7.5.86-py3-none-manylinux1_x86_64.whl (204.1 MB) 2025-05-07T19:50:05.6555333Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 204.1/204.1 MB 148.9 MB/s eta 0:00:00 2025-05-07T19:50:05.6555759Z Collecting nvidia-nccl-cu11==2.21.5 (from torch) 2025-05-07T19:50:05.6556535Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_nccl_cu11-2.21.5-py3-none-manylinux2014_x86_64.whl (147.8 MB) 2025-05-07T19:50:05.6557392Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 147.8/147.8 MB 83.7 MB/s eta 0:00:00 2025-05-07T19:50:05.6557798Z Collecting nvidia-nvtx-cu11==11.8.86 (from torch) 2025-05-07T19:50:05.6558536Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_nvtx_cu11-11.8.86-py3-none-manylinux1_x86_64.whl (99 kB) 2025-05-07T19:50:05.6559292Z Collecting pytorch-triton==3.3.0+git96316ce5 (from torch) 2025-05-07T19:50:05.6560319Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.6 kB) 2025-05-07T19:50:05.6561182Z Collecting mpmath<1.4,>=1.1.0 (from sympy>=1.13.3->torch) 2025-05-07T19:50:05.6561785Z Downloading https://download.pytorch.org/whl/nightly/mpmath-1.3.0-py3-none-any.whl (536 kB) 2025-05-07T19:50:05.6562473Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 536.2/536.2 kB 3.2 MB/s eta 0:00:00 2025-05-07T19:50:05.6563242Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from jinja2->torch) (3.0.2) 2025-05-07T19:50:05.6564402Z Downloading https://download.pytorch.org/whl/nightly/cu118/torch-2.8.0.dev20250507%2Bcu118-cp313-cp313-manylinux_2_28_x86_64.whl (916.2 MB) 2025-05-07T19:50:05.6565261Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 916.2/916.2 MB 33.1 MB/s eta 0:00:00 2025-05-07T19:50:05.6566075Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (153.5 MB) 2025-05-07T19:50:05.6567002Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 153.5/153.5 MB 71.1 MB/s eta 0:00:00 2025-05-07T19:50:05.6568572Z Installing collected packages: mpmath, sympy, pytorch-triton, nvidia-nvtx-cu11, nvidia-nccl-cu11, nvidia-cusparse-cu11, nvidia-curand-cu11, nvidia-cufft-cu11, nvidia-cuda-runtime-cu11, nvidia-cuda-nvrtc-cu11, nvidia-cuda-cupti-cu11, nvidia-cublas-cu11, networkx, fsspec, filelock, nvidia-cusolver-cu11, nvidia-cudnn-cu11, torch 2025-05-07T19:50:05.6569999Z 2025-05-07T19:50:05.6571627Z Successfully installed filelock-3.16.1 fsspec-2024.10.0 mpmath-1.3.0 networkx-3.4.2 nvidia-cublas-cu11-11.11.3.6 nvidia-cuda-cupti-cu11-11.8.87 nvidia-cuda-nvrtc-cu11-11.8.89 nvidia-cuda-runtime-cu11-11.8.89 nvidia-cudnn-cu11-9.1.0.70 nvidia-cufft-cu11-10.9.0.58 nvidia-curand-cu11-10.3.0.86 nvidia-cusolver-cu11-11.4.1.48 nvidia-cusparse-cu11-11.7.5.86 nvidia-nccl-cu11-2.21.5 nvidia-nvtx-cu11-11.8.86 pytorch-triton-3.3.0+git96316ce5 sympy-1.13.3 torch-2.8.0.dev20250507+cu118 2025-05-07T19:50:05.6573377Z 2025-05-07T19:50:07.8592772Z torch 2.8.0.dev20250507+cu118 2025-05-07T19:50:07.8594555Z [CHECK] The installed package [torch, nightly/LATEST] is the correct variant (cu118) 2025-05-07T19:50:11.1856018Z [CHECK] Python (sub-)package 'torch.distributed' found ... 2025-05-07T19:50:14.5145946Z [CHECK] NOTE: The installed version is: 2.8.0.dev20250507+cu118 2025-05-07T19:50:14.5147292Z [CHECK] NOTE: Checking _GLIBCXX_USE_CXX11_ABI ... 2025-05-07T19:50:17.7599797Z True 2025-05-07T19:50:17.7600339Z True 2025-05-07T19:50:17.7600467Z 2025-05-07T19:50:17.8186370Z [INSTALL] Successfully installed PyTorch through PyTorch PIP 2025-05-07T19:50:17.8293394Z ##[group]Run if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:50:17.8294008Z if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:50:17.8294628Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:50:17.8294975Z env: 2025-05-07T19:50:17.8295185Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:50:17.8295496Z BUILD_ENV: build_binary 2025-05-07T19:50:17.8295730Z BUILD_TARGET: default 2025-05-07T19:50:17.8295981Z BUILD_VARIANT: cuda 2025-05-07T19:50:17.8296201Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:50:17.8296484Z ##[endgroup] 2025-05-07T19:50:18.2766269Z /github/home/miniconda/bin/conda 2025-05-07T19:50:18.2767249Z ################################################################################ 2025-05-07T19:50:18.2768527Z # Collect PyTorch Environment Information (for Reporting Issues) 2025-05-07T19:50:18.2769645Z # 2025-05-07T19:50:18.2786609Z # [2025-05-07T19:50:18.278Z] + collect_pytorch_env_info build_binary 2025-05-07T19:50:18.2787052Z ################################################################################ 2025-05-07T19:50:18.2787306Z 2025-05-07T19:50:18.2804015Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:50:18.3657372Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:50:18.3661209Z [INFO] Downloading the PyTorch environment info collection script ... 2025-05-07T19:50:18.3662938Z + wget -q https://raw.githubusercontent.com/pytorch/pytorch/main/torch/utils/collect_env.py 2025-05-07T19:50:18.3663371Z 2025-05-07T19:50:18.4498957Z 2025-05-07T19:50:18.4499514Z [INFO] Collecting PyTorch environment info (will be needed for reporting issues to PyTorch) ... 2025-05-07T19:50:18.4517469Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary python collect_env.py 2025-05-07T19:50:23.8262611Z Collecting environment information... 2025-05-07T19:50:23.8263602Z PyTorch version: 2.8.0.dev20250507+cu118 2025-05-07T19:50:23.8264356Z Is debug build: False 2025-05-07T19:50:23.8264644Z CUDA used to build PyTorch: 11.8 2025-05-07T19:50:23.8264936Z ROCM used to build PyTorch: N/A 2025-05-07T19:50:23.8265123Z 2025-05-07T19:50:23.8265246Z OS: Amazon Linux 2023.7.20250428 (x86_64) 2025-05-07T19:50:23.8265556Z GCC version: Could not collect 2025-05-07T19:50:23.8266166Z Clang version: 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:23.8266770Z CMake version: version 4.0.2 2025-05-07T19:50:23.8267169Z Libc version: glibc-2.34 2025-05-07T19:50:23.8267335Z 2025-05-07T19:50:23.8267656Z Python version: 3.13.2 | packaged by conda-forge | (main, Feb 17 2025, 14:10:22) [GCC 13.3.0] (64-bit runtime) 2025-05-07T19:50:23.8268632Z Python platform: Linux-6.1.130-139.222.amzn2023.x86_64-x86_64-with-glibc2.34 2025-05-07T19:50:23.8269094Z Is CUDA available: False 2025-05-07T19:50:23.8269364Z CUDA runtime version: 11.8.89 2025-05-07T19:50:23.8269677Z CUDA_MODULE_LOADING set to: N/A 2025-05-07T19:50:23.8269996Z GPU models and configuration: Could not collect 2025-05-07T19:50:23.8270349Z Nvidia driver version: Could not collect 2025-05-07T19:50:23.8270654Z cuDNN version: Could not collect 2025-05-07T19:50:23.8270942Z HIP runtime version: N/A 2025-05-07T19:50:23.8271207Z MIOpen runtime version: N/A 2025-05-07T19:50:23.8271467Z Is XNNPACK available: True 2025-05-07T19:50:23.8271628Z 2025-05-07T19:50:23.8271722Z CPU: 2025-05-07T19:50:23.8271928Z Architecture: x86_64 2025-05-07T19:50:23.8272271Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:50:23.8272659Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:50:23.8273209Z Byte Order: Little Endian 2025-05-07T19:50:23.8273740Z CPU(s): 96 2025-05-07T19:50:23.8274105Z On-line CPU(s) list: 0-95 2025-05-07T19:50:23.8274511Z Vendor ID: GenuineIntel 2025-05-07T19:50:23.8275115Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:50:23.8275552Z CPU family: 6 2025-05-07T19:50:23.8275862Z Model: 85 2025-05-07T19:50:23.8276193Z Thread(s) per core: 2 2025-05-07T19:50:23.8276497Z Core(s) per socket: 24 2025-05-07T19:50:23.8276826Z Socket(s): 2 2025-05-07T19:50:23.8277114Z Stepping: 7 2025-05-07T19:50:23.8277452Z BogoMIPS: 5999.97 2025-05-07T19:50:23.8280011Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:50:23.8282294Z Hypervisor vendor: KVM 2025-05-07T19:50:23.8282602Z Virtualization type: full 2025-05-07T19:50:23.8282979Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:50:23.8283342Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:50:23.8283730Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:50:23.8284112Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:50:23.8284425Z NUMA node(s): 2 2025-05-07T19:50:23.8284739Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:50:23.8285062Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:50:23.8285524Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:50:23.8286074Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:50:23.8286565Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:50:23.8287167Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:50:23.8287750Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:50:23.8288379Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:50:23.8288980Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:50:23.8289461Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:50:23.8289822Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:50:23.8290206Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:50:23.8290778Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:50:23.8291584Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:50:23.8292219Z Vulnerability Srbds: Not affected 2025-05-07T19:50:23.8292581Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:50:23.8292831Z 2025-05-07T19:50:23.8292930Z Versions of relevant libraries: 2025-05-07T19:50:23.8293188Z [pip3] numpy==2.2.5 2025-05-07T19:50:23.8293439Z [pip3] nvidia-cublas-cu11==11.11.3.6 2025-05-07T19:50:23.8293755Z [pip3] nvidia-cuda-cupti-cu11==11.8.87 2025-05-07T19:50:23.8294066Z [pip3] nvidia-cuda-nvrtc-cu11==11.8.89 2025-05-07T19:50:23.8294414Z [pip3] nvidia-cuda-runtime-cu11==11.8.89 2025-05-07T19:50:23.8294726Z [pip3] nvidia-cudnn-cu11==9.1.0.70 2025-05-07T19:50:23.8295020Z [pip3] nvidia-cufft-cu11==10.9.0.58 2025-05-07T19:50:23.8295310Z [pip3] nvidia-curand-cu11==10.3.0.86 2025-05-07T19:50:23.8295635Z [pip3] nvidia-cusolver-cu11==11.4.1.48 2025-05-07T19:50:23.8296003Z [pip3] nvidia-cusparse-cu11==11.7.5.86 2025-05-07T19:50:23.8296313Z [pip3] nvidia-nccl-cu11==2.21.5 2025-05-07T19:50:23.8296596Z [pip3] nvidia-nvtx-cu11==11.8.86 2025-05-07T19:50:23.8296916Z [pip3] pytorch-triton==3.3.0+git96316ce5 2025-05-07T19:50:23.8297255Z [pip3] torch==2.8.0.dev20250507+cu118 2025-05-07T19:50:23.8297648Z [conda] cuda-cudart 11.8.89 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8298202Z [conda] cuda-cudart-dev 11.8.89 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8298717Z [conda] cuda-cupti 11.8.87 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8299233Z [conda] cuda-libraries 11.8.0 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8299774Z [conda] cuda-libraries-dev 11.8.0 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8300292Z [conda] cuda-nvrtc 11.8.89 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8300817Z [conda] cuda-nvrtc-dev 11.8.89 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8301302Z [conda] cuda-nvtx 11.8.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8301820Z [conda] cuda-runtime 11.8.0 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8302754Z [conda] libcublas 11.11.3.6 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8303403Z [conda] libcublas-dev 11.11.3.6 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8303964Z [conda] libcufft 10.9.0.58 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8304482Z [conda] libcufft-dev 10.9.0.58 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8305030Z [conda] libcurand 10.3.0.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8305570Z [conda] libcurand-dev 10.3.0.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8306139Z [conda] libcusolver 11.4.1.48 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8306703Z [conda] libcusolver-dev 11.4.1.48 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8307255Z [conda] libcusparse 11.7.5.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8307833Z [conda] libcusparse-dev 11.7.5.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:23.8308354Z [conda] numpy 2.2.5 py313h17eae1a_0 conda-forge 2025-05-07T19:50:23.8309007Z [conda] nvidia-cublas-cu11 11.11.3.6 pypi_0 pypi 2025-05-07T19:50:23.8309568Z [conda] nvidia-cuda-cupti-cu11 11.8.87 pypi_0 pypi 2025-05-07T19:50:23.8310116Z [conda] nvidia-cuda-nvrtc-cu11 11.8.89 pypi_0 pypi 2025-05-07T19:50:23.8310672Z [conda] nvidia-cuda-runtime-cu11 11.8.89 pypi_0 pypi 2025-05-07T19:50:23.8311195Z [conda] nvidia-cudnn-cu11 9.1.0.70 pypi_0 pypi 2025-05-07T19:50:23.8311749Z [conda] nvidia-cufft-cu11 10.9.0.58 pypi_0 pypi 2025-05-07T19:50:23.8312306Z [conda] nvidia-curand-cu11 10.3.0.86 pypi_0 pypi 2025-05-07T19:50:23.8312942Z [conda] nvidia-cusolver-cu11 11.4.1.48 pypi_0 pypi 2025-05-07T19:50:23.8313534Z [conda] nvidia-cusparse-cu11 11.7.5.86 pypi_0 pypi 2025-05-07T19:50:23.8314109Z [conda] nvidia-nccl-cu11 2.21.5 pypi_0 pypi 2025-05-07T19:50:23.8314631Z [conda] nvidia-nvtx-cu11 11.8.86 pypi_0 pypi 2025-05-07T19:50:23.8315197Z [conda] pytorch-triton 3.3.0+git96316ce5 pypi_0 pypi 2025-05-07T19:50:23.8315822Z [conda] torch 2.8.0.dev20250507+cu118 pypi_0 pypi 2025-05-07T19:50:23.8316160Z 2025-05-07T19:50:23.9026112Z ##[group]Run . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 11.8.0 2025-05-07T19:50:23.9026740Z . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 11.8.0 2025-05-07T19:50:23.9027297Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:50:23.9027620Z env: 2025-05-07T19:50:23.9027828Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:50:23.9028131Z BUILD_ENV: build_binary 2025-05-07T19:50:23.9028373Z BUILD_TARGET: default 2025-05-07T19:50:23.9028590Z BUILD_VARIANT: cuda 2025-05-07T19:50:23.9028843Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:50:23.9029078Z ##[endgroup] 2025-05-07T19:50:24.3016839Z ################################################################################ 2025-05-07T19:50:24.3017435Z # Install cuDNN 2025-05-07T19:50:24.3017723Z # 2025-05-07T19:50:24.3031922Z # [2025-05-07T19:50:24.302Z] + install_cudnn build_binary /__w/FBGEMM/FBGEMM/build_only/cudnn 11.8.0 2025-05-07T19:50:24.3032651Z ################################################################################ 2025-05-07T19:50:24.3033000Z 2025-05-07T19:50:24.3056507Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:50:24.3973332Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:50:24.3973833Z [INSTALL] cuda_concat_version is determined to be: 118 2025-05-07T19:50:24.3974263Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:50:24.3974504Z 2025-05-07T19:50:24.3989841Z 2025-05-07T19:50:24.3990797Z + mkdir -p /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:50:24.3991600Z 2025-05-07T19:50:24.4010402Z 2025-05-07T19:50:24.4035113Z [INSTALL] Downloading cuDNN to /tmp/tmp.ZiP1L1Wc3w ... 2025-05-07T19:50:24.4058689Z [EXEC] [ATTEMPT 0/3] + wget -q https://developer.download.nvidia.com/compute/redist/cudnn/v8.7.0/local_installers/11.8/cudnn-linux-x86_64-8.7.0.84_cuda11-archive.tar.xz -O cudnn.tar.xz 2025-05-07T19:50:28.3943011Z [INSTALL] Unpacking cuDNN ... 2025-05-07T19:50:28.3943540Z + tar -xvf cudnn.tar.xz 2025-05-07T19:50:28.3943754Z 2025-05-07T19:50:28.3974053Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/ 2025-05-07T19:50:28.3974533Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/ 2025-05-07T19:50:28.3975067Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer_static.a 2025-05-07T19:50:30.7848206Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer_static_v8.a 2025-05-07T19:50:30.7848948Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train_static.a 2025-05-07T19:50:33.0410834Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train_static_v8.a 2025-05-07T19:50:33.0411963Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer_static.a 2025-05-07T19:50:41.2492321Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer_static_v8.a 2025-05-07T19:50:41.2493017Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train_static.a 2025-05-07T19:50:42.8419781Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train_static_v8.a 2025-05-07T19:50:42.8420514Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer_static.a 2025-05-07T19:50:44.5294929Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer_static_v8.a 2025-05-07T19:50:44.5295645Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train_static.a 2025-05-07T19:50:46.0396196Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train_static_v8.a 2025-05-07T19:50:46.0396859Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn.so.8 2025-05-07T19:50:46.0397341Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn.so 2025-05-07T19:50:46.0397926Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn.so.8.7.0 2025-05-07T19:50:46.0407449Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer.so.8 2025-05-07T19:50:46.0408045Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer.so 2025-05-07T19:50:46.0408661Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer.so.8.7.0 2025-05-07T19:50:48.4156968Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train.so.8 2025-05-07T19:50:48.4157639Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train.so 2025-05-07T19:50:48.4158230Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train.so.8.7.0 2025-05-07T19:50:50.6692816Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer.so 2025-05-07T19:50:50.6693512Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer.so.8 2025-05-07T19:50:50.6694102Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer.so.8.7.0 2025-05-07T19:50:59.1988734Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train.so 2025-05-07T19:50:59.1989413Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train.so.8.7.0 2025-05-07T19:51:00.8173385Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train.so.8 2025-05-07T19:51:00.8174028Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer.so.8.7.0 2025-05-07T19:51:02.5031481Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer.so 2025-05-07T19:51:02.5032149Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer.so.8 2025-05-07T19:51:02.5032740Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train.so.8.7.0 2025-05-07T19:51:04.0149000Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train.so 2025-05-07T19:51:04.0149714Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train.so.8 2025-05-07T19:51:04.0150241Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/ 2025-05-07T19:51:04.0150792Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_v8.h 2025-05-07T19:51:04.0151360Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_adv_infer_v8.h 2025-05-07T19:51:04.0151979Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_adv_train_v8.h 2025-05-07T19:51:04.0152536Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_backend_v8.h 2025-05-07T19:51:04.0153316Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_cnn_infer_v8.h 2025-05-07T19:51:04.0153962Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_cnn_train_v8.h 2025-05-07T19:51:04.0154568Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_ops_infer_v8.h 2025-05-07T19:51:04.0155182Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_ops_train_v8.h 2025-05-07T19:51:04.0155771Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_version_v8.h 2025-05-07T19:51:04.0156311Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn.h 2025-05-07T19:51:04.0156899Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_adv_infer.h 2025-05-07T19:51:04.0157836Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_adv_train.h 2025-05-07T19:51:04.0158451Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_backend.h 2025-05-07T19:51:04.0158977Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_cnn_infer.h 2025-05-07T19:51:04.0159585Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_cnn_train.h 2025-05-07T19:51:04.0160180Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_ops_infer.h 2025-05-07T19:51:04.0160734Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_ops_train.h 2025-05-07T19:51:04.0161279Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_version.h 2025-05-07T19:51:04.0161779Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/LICENSE 2025-05-07T19:51:04.0176984Z 2025-05-07T19:51:04.0179862Z [INSTALL] Moving cuDNN files to /__w/FBGEMM/FBGEMM/build_only/cudnn ... 2025-05-07T19:51:04.0180455Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:51:04.0180741Z 2025-05-07T19:51:04.0198335Z 2025-05-07T19:51:04.0199230Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:51:04.0200004Z 2025-05-07T19:51:04.0217792Z 2025-05-07T19:51:04.0218984Z + mv cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:51:04.0220189Z 2025-05-07T19:51:04.0254880Z 2025-05-07T19:51:04.0256453Z + mv cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:51:04.0257673Z 2025-05-07T19:51:05.6625977Z 2025-05-07T19:51:05.6626680Z /__w/FBGEMM/FBGEMM 2025-05-07T19:51:05.6627480Z + rm -rf /tmp/tmp.ZiP1L1Wc3w 2025-05-07T19:51:05.6628049Z 2025-05-07T19:51:06.0781124Z 2025-05-07T19:51:06.0787718Z [INSTALL] Set environment variables CUDNN_INCLUDE_DIR and CUDNN_LIBRARY ... 2025-05-07T19:51:06.0790527Z + conda env config vars set -n build_binary CUDNN_INCLUDE_DIR=/__w/FBGEMM/FBGEMM/build_only/cudnn/include CUDNN_LIBRARY=/__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:51:06.0792516Z 2025-05-07T19:51:06.4864352Z 2025-05-07T19:51:06.4865518Z [INSTALL] Successfully installed cuDNN (for CUDA 11.8.0) 2025-05-07T19:51:06.4931575Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:51:06.4932173Z . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:51:06.4932776Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:51:06.4933099Z env: 2025-05-07T19:51:06.4933307Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:51:06.4933610Z BUILD_ENV: build_binary 2025-05-07T19:51:06.4933842Z BUILD_TARGET: default 2025-05-07T19:51:06.4934077Z BUILD_VARIANT: cuda 2025-05-07T19:51:06.4934316Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:51:06.4934551Z ##[endgroup] 2025-05-07T19:51:06.9375910Z ################################################################################ 2025-05-07T19:51:06.9376340Z # Prepare FBGEMM-GPU Build 2025-05-07T19:51:06.9377089Z # 2025-05-07T19:51:06.9390897Z # [2025-05-07T19:51:06.938Z] + prepare_fbgemm_gpu_build build_binary 2025-05-07T19:51:06.9391617Z ################################################################################ 2025-05-07T19:51:06.9391893Z 2025-05-07T19:51:06.9404497Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:51:07.0256420Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:51:07.0269761Z [BUILD] Running git submodules update ... 2025-05-07T19:51:07.0288494Z [EXEC] [ATTEMPT 0/3] + git submodule sync 2025-05-07T19:51:07.0578155Z Synchronizing submodule url for '../external/asmjit' 2025-05-07T19:51:07.0579624Z Synchronizing submodule url for '../external/composable_kernel' 2025-05-07T19:51:07.0581036Z Synchronizing submodule url for '../external/cpuinfo' 2025-05-07T19:51:07.0582255Z Synchronizing submodule url for '../external/cutlass' 2025-05-07T19:51:07.0583537Z Synchronizing submodule url for '../external/googletest' 2025-05-07T19:51:07.0584338Z Synchronizing submodule url for '../external/hipify_torch' 2025-05-07T19:51:07.0585190Z Synchronizing submodule url for '../external/json' 2025-05-07T19:51:07.0604444Z [EXEC] [ATTEMPT 0/3] + git submodule update --init --recursive 2025-05-07T19:51:07.1025974Z [BUILD] Installing other build dependencies ... 2025-05-07T19:51:07.1043358Z [EXEC] [ATTEMPT 0/3] + conda run --no-capture-output -n build_binary python -m pip install -r requirements.txt 2025-05-07T19:51:09.2050769Z Collecting backports.tarfile (from -r requirements.txt (line 13)) 2025-05-07T19:51:09.2274654Z Downloading backports.tarfile-1.2.0-py3-none-any.whl.metadata (2.0 kB) 2025-05-07T19:51:09.2357969Z Requirement already satisfied: build in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 14)) (1.2.2.post1) 2025-05-07T19:51:09.3410480Z Collecting cmake (from -r requirements.txt (line 15)) 2025-05-07T19:51:09.3447632Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (6.3 kB) 2025-05-07T19:51:09.3523607Z Requirement already satisfied: click in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 16)) (8.1.8) 2025-05-07T19:51:09.3525616Z Requirement already satisfied: hypothesis in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 17)) (6.131.14) 2025-05-07T19:51:09.3528325Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 18)) (3.1.6) 2025-05-07T19:51:09.3531748Z Requirement already satisfied: mpmath==1.3.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 19)) (1.3.0) 2025-05-07T19:51:09.3802537Z Collecting ninja (from -r requirements.txt (line 20)) 2025-05-07T19:51:09.3840513Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl.metadata (5.0 kB) 2025-05-07T19:51:09.3918432Z Requirement already satisfied: numpy>=2.0.2 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 21)) (2.2.5) 2025-05-07T19:51:09.4065241Z Collecting pyre-extensions (from -r requirements.txt (line 22)) 2025-05-07T19:51:09.4092824Z Downloading pyre_extensions-0.0.32-py3-none-any.whl.metadata (4.0 kB) 2025-05-07T19:51:09.4160898Z Requirement already satisfied: pyyaml in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 23)) (6.0.2) 2025-05-07T19:51:09.4162677Z Requirement already satisfied: scikit-build in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 24)) (0.18.1) 2025-05-07T19:51:09.4171904Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 25)) (78.1.1) 2025-05-07T19:51:09.4366842Z Collecting setuptools_git_versioning (from -r requirements.txt (line 26)) 2025-05-07T19:51:09.4414265Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl.metadata (6.1 kB) 2025-05-07T19:51:09.4601764Z Collecting tabulate (from -r requirements.txt (line 27)) 2025-05-07T19:51:09.4634514Z Downloading tabulate-0.9.0-py3-none-any.whl.metadata (34 kB) 2025-05-07T19:51:09.4883013Z Collecting patchelf (from -r requirements.txt (line 28)) 2025-05-07T19:51:09.4910680Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl.metadata (3.5 kB) 2025-05-07T19:51:09.5000086Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from build->-r requirements.txt (line 14)) (25.0) 2025-05-07T19:51:09.5005009Z Requirement already satisfied: pyproject_hooks in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from build->-r requirements.txt (line 14)) (1.2.0) 2025-05-07T19:51:09.5051634Z Requirement already satisfied: attrs>=22.2.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from hypothesis->-r requirements.txt (line 17)) (25.3.0) 2025-05-07T19:51:09.5054526Z Requirement already satisfied: sortedcontainers<3.0.0,>=2.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from hypothesis->-r requirements.txt (line 17)) (2.4.0) 2025-05-07T19:51:09.5102751Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from jinja2->-r requirements.txt (line 18)) (3.0.2) 2025-05-07T19:51:09.5228415Z Collecting typing-inspect (from pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:51:09.5259893Z Downloading typing_inspect-0.9.0-py3-none-any.whl.metadata (1.5 kB) 2025-05-07T19:51:09.5332695Z Requirement already satisfied: typing-extensions in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from pyre-extensions->-r requirements.txt (line 22)) (4.13.2) 2025-05-07T19:51:09.5345666Z Requirement already satisfied: distro in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from scikit-build->-r requirements.txt (line 24)) (1.9.0) 2025-05-07T19:51:09.5355213Z Requirement already satisfied: wheel>=0.32.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from scikit-build->-r requirements.txt (line 24)) (0.45.1) 2025-05-07T19:51:09.5624318Z Collecting mypy-extensions>=0.3.0 (from typing-inspect->pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:51:09.5669025Z Downloading mypy_extensions-1.1.0-py3-none-any.whl.metadata (1.1 kB) 2025-05-07T19:51:09.5804089Z Downloading backports.tarfile-1.2.0-py3-none-any.whl (30 kB) 2025-05-07T19:51:09.5911214Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (27.9 MB) 2025-05-07T19:51:09.7323336Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 27.9/27.9 MB 201.4 MB/s eta 0:00:00 2025-05-07T19:51:09.7358064Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (422 kB) 2025-05-07T19:51:09.7437732Z Downloading pyre_extensions-0.0.32-py3-none-any.whl (12 kB) 2025-05-07T19:51:09.7510282Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl (10 kB) 2025-05-07T19:51:09.7784815Z Downloading tabulate-0.9.0-py3-none-any.whl (35 kB) 2025-05-07T19:51:09.7982629Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl (466 kB) 2025-05-07T19:51:09.8070502Z Downloading typing_inspect-0.9.0-py3-none-any.whl (8.8 kB) 2025-05-07T19:51:09.8123365Z Downloading mypy_extensions-1.1.0-py3-none-any.whl (5.0 kB) 2025-05-07T19:51:09.9613186Z Installing collected packages: tabulate, setuptools_git_versioning, patchelf, ninja, mypy-extensions, cmake, backports.tarfile, typing-inspect, pyre-extensions 2025-05-07T19:51:10.7887672Z 2025-05-07T19:51:10.7937865Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:51:10.7940353Z Successfully installed backports.tarfile-1.2.0 cmake-4.0.0 mypy-extensions-1.1.0 ninja-1.11.1.4 patchelf-0.17.2.2 pyre-extensions-0.0.32 setuptools_git_versioning-2.1.0 tabulate-0.9.0 typing-inspect-0.9.0 2025-05-07T19:51:10.9344841Z ################################################################################ 2025-05-07T19:51:10.9345520Z # Install PyTorch (PyTorch PIP) 2025-05-07T19:51:10.9345839Z # 2025-05-07T19:51:10.9360208Z # [2025-05-07T19:51:10.935Z] + install_triton_pip build_binary 2025-05-07T19:51:10.9361492Z ################################################################################ 2025-05-07T19:51:10.9362184Z 2025-05-07T19:51:10.9362884Z [BUILD] Installing pytorch-triton nightly/3.2.0+git4b3bb1f8 from PIP ... 2025-05-07T19:51:10.9364242Z ################################################################################ 2025-05-07T19:51:10.9365337Z # Install Package From PyTorch PIP: pytorch-triton 2025-05-07T19:51:10.9366881Z # 2025-05-07T19:51:10.9378121Z # [2025-05-07T19:51:10.937Z] + install_from_pytorch_pip build_binary pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:51:10.9378750Z ################################################################################ 2025-05-07T19:51:10.9379027Z 2025-05-07T19:51:10.9395880Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:51:11.0225265Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:51:11.0226357Z ################################################################################ 2025-05-07T19:51:11.0227158Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:51:11.0227450Z # 2025-05-07T19:51:11.0251201Z # [2025-05-07T19:51:11.024Z] + __prepare_pip_arguments pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:51:11.0251820Z ################################################################################ 2025-05-07T19:51:11.0252071Z 2025-05-07T19:51:11.0304511Z [INSTALL] Extracted package (channel, version): (nightly, 3.2.0+git4b3bb1f8) 2025-05-07T19:51:11.0315330Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:51:11.0316955Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:51:11.0321223Z [INSTALL] Extracted the full PIP package: --pre pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:51:11.0328634Z [INSTALL] Attempting to install [pytorch-triton, 3.2.0+git4b3bb1f8] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/ ... 2025-05-07T19:51:11.0352241Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre pytorch-triton==3.2.0+git4b3bb1f8 --index-url https://download.pytorch.org/whl/nightly/ 2025-05-07T19:51:16.4880410Z ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. 2025-05-07T19:51:16.4883401Z torch 2.8.0.dev20250507+cu118 requires pytorch-triton==3.3.0+git96316ce5; platform_system == "Linux" and platform_machine == "x86_64", but you have pytorch-triton 3.2.0+git4b3bb1f8 which is incompatible. 2025-05-07T19:51:16.4885743Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:51:16.4887266Z 2025-05-07T19:51:16.4887519Z Looking in indexes: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:51:16.4888012Z Collecting pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:51:16.4888918Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.3 kB) 2025-05-07T19:51:16.4890334Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (166.5 MB) 2025-05-07T19:51:16.4891618Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 166.5/166.5 MB 195.0 MB/s eta 0:00:00 2025-05-07T19:51:16.4892044Z Installing collected packages: pytorch-triton 2025-05-07T19:51:16.4892420Z Attempting uninstall: pytorch-triton 2025-05-07T19:51:16.4892829Z Found existing installation: pytorch-triton 3.3.0+git96316ce5 2025-05-07T19:51:16.4893285Z Uninstalling pytorch-triton-3.3.0+git96316ce5: 2025-05-07T19:51:16.4893864Z Successfully uninstalled pytorch-triton-3.3.0+git96316ce5 2025-05-07T19:51:16.4894346Z Successfully installed pytorch-triton-3.2.0+git4b3bb1f8 2025-05-07T19:51:16.4894615Z 2025-05-07T19:51:18.6036827Z [CHECK] Python (sub-)package 'triton' found ... 2025-05-07T19:51:18.6037353Z [CHECK] Printing out the pytorch-triton version ... 2025-05-07T19:51:20.6336152Z ################################################################################ 2025-05-07T19:51:20.6336921Z [CHECK] The installed VERSION of pytorch-triton is: 3.2.0 2025-05-07T19:51:20.6337927Z ################################################################################ 2025-05-07T19:51:20.6338168Z 2025-05-07T19:51:22.5955939Z [CHECK] Python (sub-)package 'numpy' found ... 2025-05-07T19:51:24.6549930Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:51:24.6550962Z [BUILD] Successfully ran git submodules update 2025-05-07T19:51:24.6631763Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:51:24.6632497Z . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:51:24.6633188Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:51:24.6633706Z env: 2025-05-07T19:51:24.6633937Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:51:24.6634290Z BUILD_ENV: build_binary 2025-05-07T19:51:24.6634538Z BUILD_TARGET: default 2025-05-07T19:51:24.6634787Z BUILD_VARIANT: cuda 2025-05-07T19:51:24.6635062Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:51:24.6635309Z ##[endgroup] 2025-05-07T19:51:25.0891150Z [BUILD] BUILD_TARGET_VARIANT: default/cuda 2025-05-07T19:51:25.0892207Z [BUILD] Extracted build target: default 2025-05-07T19:51:25.0893123Z [BUILD] Extracted build variant: cuda 2025-05-07T19:51:26.9147791Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:51:26.9148121Z 2025-05-07T19:51:26.9888847Z [CHECK] Binary cc found in PATH 2025-05-07T19:51:28.8277202Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:51:28.8277596Z 2025-05-07T19:51:28.8916267Z [CHECK] Binary gcc found in PATH 2025-05-07T19:51:30.7183647Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:51:30.7184041Z 2025-05-07T19:51:30.7828161Z [CHECK] Binary c++ found in PATH 2025-05-07T19:51:32.5916138Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:51:32.5916445Z 2025-05-07T19:51:32.6664267Z [CHECK] Binary g++ found in PATH 2025-05-07T19:51:34.5656918Z [BUILD] Extracted and set Python tag: py313 2025-05-07T19:51:34.5659049Z [BUILD] Extracted and set Python platform name: manylinux_2_28_x86_64 2025-05-07T19:51:34.5882396Z core = 24 2025-05-07T19:51:34.6099971Z sockets = 2 2025-05-07T19:51:34.6100878Z [BUILD] Set multicore run option for setup.py: -j 48 2025-05-07T19:51:34.6101957Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:51:34.6103152Z [BUILD] Running pre-build cleanups ... 2025-05-07T19:51:34.6104053Z + rm -rf dist 2025-05-07T19:51:34.6104452Z 2025-05-07T19:51:34.6119420Z 2025-05-07T19:51:34.6119834Z + conda run --no-capture-output -n build_binary python setup.py clean 2025-05-07T19:51:34.6120249Z 2025-05-07T19:51:37.7234389Z INFO:root:running clean 2025-05-07T19:51:37.7234799Z [SETUP.PY] ARGV: ['setup.py', 'clean'] 2025-05-07T19:51:37.7235932Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:51:37.7237127Z [SETUP.PY] Other arguments: ['clean'] 2025-05-07T19:51:37.7237631Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:51:37.7238253Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:51:37.7238871Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:51:37.7239623Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:51:37.7240182Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:51:37.7241374Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:51:38.0771547Z 2025-05-07T19:51:38.0772373Z [BUILD] Printing git status ... 2025-05-07T19:51:38.0773225Z + git status 2025-05-07T19:51:38.0774103Z 2025-05-07T19:51:38.7970517Z HEAD detached at pull/4066/merge 2025-05-07T19:51:38.7971468Z Untracked files: 2025-05-07T19:51:38.7972368Z (use "git add ..." to include in what will be committed) 2025-05-07T19:51:38.7973459Z ../build_only/ 2025-05-07T19:51:38.7974067Z ../collect_env.py 2025-05-07T19:51:38.7974781Z fbgemm_gpu/docs/version.py 2025-05-07T19:51:38.7975285Z 2025-05-07T19:51:38.7976585Z nothing added to commit but untracked files present (use "git add" to track) 2025-05-07T19:51:38.7976942Z 2025-05-07T19:51:38.7977039Z + git diff 2025-05-07T19:51:38.7977165Z 2025-05-07T19:51:38.8254308Z 2025-05-07T19:51:38.8255124Z ################################################################################ 2025-05-07T19:51:38.8256201Z # Configure FBGEMM-GPU Build 2025-05-07T19:51:38.8256988Z # 2025-05-07T19:51:38.8272424Z # [2025-05-07T19:51:38.826Z] + __configure_fbgemm_gpu_build 2025-05-07T19:51:38.8273019Z ################################################################################ 2025-05-07T19:51:38.8273501Z 2025-05-07T19:51:38.8278809Z [BUILD] Setting the build target: default ... 2025-05-07T19:51:38.8279934Z [BUILD] Configuring build as CUDA variant (this is the default behavior) ... 2025-05-07T19:51:40.6796620Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:51:40.6797011Z 2025-05-07T19:51:40.7538989Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:51:42.6202326Z /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:51:42.6202653Z 2025-05-07T19:51:42.6936250Z [CHECK] Environment variable CUDNN_INCLUDE_DIR is defined in the Conda environment 2025-05-07T19:51:44.5569896Z /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:51:44.5570690Z 2025-05-07T19:51:44.6295226Z [CHECK] Environment variable CUDNN_LIBRARY is defined in the Conda environment 2025-05-07T19:51:46.4898635Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:46.4899103Z 2025-05-07T19:51:46.5470488Z [CHECK] Environment variable NVML_LIB_PATH is defined in the Conda environment 2025-05-07T19:51:48.4658083Z [BUILD] Using the default architectures for CUDA nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:51:48.4658732Z Copyright (c) 2005-2022 NVIDIA Corporation 2025-05-07T19:51:48.4659133Z Built on Wed_Sep_21_10:33:58_PDT_2022 2025-05-07T19:51:48.4659496Z Cuda compilation tools, release 11.8, V11.8.89 2025-05-07T19:51:48.4659971Z Build cuda_11.8.r11.8/compiler.31833905_0 ... 2025-05-07T19:51:48.4660368Z [BUILD] Setting the following CUDA targets: 7.0;8.0 2025-05-07T19:51:48.4660789Z [BUILD] Looking up NVML filepath ... 2025-05-07T19:51:50.3485328Z [BUILD] Looking up NCCL filepath ... 2025-05-07T19:51:54.1560853Z [BUILD] Setting NVCC verbose mode ... 2025-05-07T19:51:54.1561386Z + conda env config vars set -n build_binary NVCC_VERBOSE=1 2025-05-07T19:51:54.1561791Z 2025-05-07T19:51:54.5838286Z 2025-05-07T19:51:54.5838959Z [BUILD] Setting CUDA build args ... 2025-05-07T19:51:54.5849929Z [BUILD] Looking up CUDA version ... 2025-05-07T19:51:58.3344636Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:51:58.3345638Z 2025-05-07T19:52:00.2142976Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:52:00.2143922Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:52:00.2144557Z 2025-05-07T19:52:00.2145099Z [BUILD] Setting NVCC flags ... 2025-05-07T19:52:00.2146126Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-std=c++17 -Xcompiler -std=c++17 -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler" 2025-05-07T19:52:00.2146975Z 2025-05-07T19:52:00.6233450Z 2025-05-07T19:52:00.6233915Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:52:00.6234258Z 2025-05-07T19:52:02.4401047Z -std=c++17 -Xcompiler -std=c++17 -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler 2025-05-07T19:52:02.4403829Z 2025-05-07T19:52:02.4964081Z 2025-05-07T19:52:02.4964589Z [BUILD] Setting CUDA build args ... 2025-05-07T19:52:02.4965613Z + conda run -n build_binary c++ --version 2025-05-07T19:52:02.4966252Z 2025-05-07T19:52:04.3081742Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:52:04.3082927Z Target: x86_64-conda-linux-gnu 2025-05-07T19:52:04.3083251Z Thread model: posix 2025-05-07T19:52:04.3083595Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:52:04.3084326Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:52:04.3084768Z 2025-05-07T19:52:04.3656183Z 2025-05-07T19:52:04.3657165Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:52:04.3658015Z 2025-05-07T19:52:06.2753473Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:52:06.2754438Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:52:06.2754910Z 2025-05-07T19:52:06.2755107Z [BUILD] Clang is available; configuring for Clang-based build ... 2025-05-07T19:52:08.1737393Z .github/scripts/fbgemm_gpu_build.bash: line 370: [: : integer expression expected 2025-05-07T19:52:08.1738010Z [BUILD] Enabling debug features in the build ... 2025-05-07T19:52:08.1739935Z [BUILD] FBGEMM_GPU build arguments have been set: --verbose --build-target=default --build-variant=cuda --nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 -DTORCH_CUDA_ARCH_LIST='7.0;8.0' -DCMAKE_CXX_STANDARD=17 --cxxprefix=/github/home/miniconda/envs/build_binary --debug 2025-05-07T19:52:08.1741849Z ################################################################################ 2025-05-07T19:52:08.1742231Z # Build FBGEMM-GPU Package (Wheel) 2025-05-07T19:52:08.1742510Z # 2025-05-07T19:52:08.1760931Z # [2025-05-07T19:52:08.175Z] + build_fbgemm_gpu_package build_binary nightly default/cuda 2025-05-07T19:52:08.1762526Z ################################################################################ 2025-05-07T19:52:08.1762914Z 2025-05-07T19:52:08.1763214Z [BUILD] Building FBGEMM wheel (TARGET=default, VARIANT=cuda) ... 2025-05-07T19:52:08.1767350Z + conda run --no-capture-output -n build_binary python -m build --wheel --no-isolation --config-setting=--build-option=--verbose --config-setting=--build-option=--build-target=default --config-setting=--build-option=--build-variant=cuda --config-setting=--build-option=--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --config-setting=--build-option=--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 --config-setting=--build-option=-DTORCH_CUDA_ARCH_LIST='7.0;8.0' --config-setting=--build-option=-DCMAKE_CXX_STANDARD=17 --config-setting=--build-option=--cxxprefix=/github/home/miniconda/envs/build_binary --config-setting=--build-option=--debug --config-setting=--build-option=--package_channel=nightly --config-setting=--build-option=--python-tag=py313 --config-setting=--build-option=--plat-name=manylinux_2_28_x86_64 2025-05-07T19:52:08.1771542Z 2025-05-07T19:52:10.0555389Z * Getting build dependencies for wheel... 2025-05-07T19:52:11.3352124Z INFO:root:running egg_info 2025-05-07T19:52:11.3388646Z INFO:root:creating fbgemm_gpu_nightly.egg-info 2025-05-07T19:52:11.3389724Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T19:52:11.3390415Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T19:52:11.3394395Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T19:52:11.3395041Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T19:52:11.3396023Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:52:11.3460004Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:52:11.3469752Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:52:11.3471488Z [SETUP.PY] ARGV: ['setup.py', 'egg_info'] 2025-05-07T19:52:11.3473046Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:52:11.3474198Z [SETUP.PY] Other arguments: ['egg_info'] 2025-05-07T19:52:11.3474747Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:52:11.3475372Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:52:11.3475993Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:52:11.3476627Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:52:11.3477060Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:52:11.3478384Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:52:11.6683688Z * Building wheel... 2025-05-07T19:52:12.9535151Z [SETUP.PY] ARGV: ['setup.py', 'bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-q69au0_e', '--verbose', '--build-target=default', '--build-variant=cuda', '--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0', '-DCMAKE_CXX_STANDARD=17', '--cxxprefix=/github/home/miniconda/envs/build_binary', '--debug', '--package_channel=nightly', '--python-tag=py313', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:52:12.9539214Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=True, debug=True, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path='/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', nccl_lib_path='/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2', use_fb_only=False, cxxprefix='/github/home/miniconda/envs/build_binary') 2025-05-07T19:52:12.9541639Z [SETUP.PY] Other arguments: ['bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-q69au0_e', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0', '-DCMAKE_CXX_STANDARD=17', '--python-tag=py313', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:52:12.9542704Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:52:12.9543268Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:52:12.9543837Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:52:12.9544386Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:52:12.9544789Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:52:12.9550350Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DCMAKE_VERBOSE_MAKEFILE=ON', '-DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', '-DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '-DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include', '-DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DCMAKE_C_COMPILER=/github/home/miniconda/envs/build_binary/bin/cc', '-DCMAKE_CXX_COMPILER=/github/home/miniconda/envs/build_binary/bin/c++', "-DCMAKE_C_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'", "-DCMAKE_CXX_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'", '-DTORCH_CUDA_ARCH_LIST=7.0;8.0', '-DCMAKE_CXX_STANDARD=17'] 2025-05-07T19:52:12.9570907Z 2025-05-07T19:52:12.9570914Z 2025-05-07T19:52:12.9571132Z -------------------------------------------------------------------------------- 2025-05-07T19:52:12.9571518Z -- Trying 'Ninja' generator 2025-05-07T19:52:12.9571799Z -------------------------------- 2025-05-07T19:52:12.9572059Z --------------------------- 2025-05-07T19:52:12.9572332Z ---------------------- 2025-05-07T19:52:12.9572587Z ----------------- 2025-05-07T19:52:12.9572839Z ------------ 2025-05-07T19:52:12.9573042Z ------- 2025-05-07T19:52:12.9573249Z -- 2025-05-07T19:52:12.9946461Z CMake Deprecation Warning at CMakeLists.txt:1 (cmake_minimum_required): 2025-05-07T19:52:12.9948083Z Not searching for unused variables given on the command line. 2025-05-07T19:52:12.9949703Z Compatibility with CMake < 3.10 will be removed from a future version of 2025-05-07T19:52:12.9950776Z CMake. 2025-05-07T19:52:12.9950924Z 2025-05-07T19:52:12.9951150Z Update the VERSION argument value. Or, use the ... syntax 2025-05-07T19:52:12.9951687Z to tell CMake that the project requires at least but has been updated 2025-05-07T19:52:12.9952159Z to work with policies introduced by or earlier. 2025-05-07T19:52:12.9952400Z 2025-05-07T19:52:12.9952405Z 2025-05-07T19:52:13.0794386Z -- The C compiler identification is Clang 16.0.6 2025-05-07T19:52:13.0882398Z -- Detecting C compiler ABI info 2025-05-07T19:52:13.2163802Z -- Detecting C compiler ABI info - done 2025-05-07T19:52:13.2293755Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/cc - skipped 2025-05-07T19:52:13.2295360Z -- Detecting C compile features 2025-05-07T19:52:13.2296651Z -- Detecting C compile features - done 2025-05-07T19:52:13.3757321Z -- The CXX compiler identification is Clang 16.0.6 2025-05-07T19:52:13.3832035Z -- Detecting CXX compiler ABI info 2025-05-07T19:52:13.5175540Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:52:13.5306113Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/c++ - skipped 2025-05-07T19:52:13.5308531Z -- Detecting CXX compile features 2025-05-07T19:52:13.5316794Z -- Detecting CXX compile features - done 2025-05-07T19:52:13.5333870Z -- Configuring done (0.6s) 2025-05-07T19:52:13.5385396Z -- Generating done (0.0s) 2025-05-07T19:52:13.5397457Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_cmake_test_compile/build 2025-05-07T19:52:13.5435103Z -- 2025-05-07T19:52:13.5435732Z ------- 2025-05-07T19:52:13.5436467Z ------------ 2025-05-07T19:52:13.5436703Z ----------------- 2025-05-07T19:52:13.5436966Z ---------------------- 2025-05-07T19:52:13.5437214Z --------------------------- 2025-05-07T19:52:13.5437513Z -------------------------------- 2025-05-07T19:52:13.5437820Z -- Trying 'Ninja' generator - success 2025-05-07T19:52:13.5438243Z -------------------------------------------------------------------------------- 2025-05-07T19:52:13.5438539Z 2025-05-07T19:52:13.5451435Z Configuring Project 2025-05-07T19:52:13.5452770Z Working directory: 2025-05-07T19:52:13.5453328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build 2025-05-07T19:52:13.5453825Z Command: 2025-05-07T19:52:13.5473091Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/cmake/data/bin/cmake /__w/FBGEMM/FBGEMM/fbgemm_gpu -G Ninja -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja --no-warn-unused-cli -DCMAKE_INSTALL_PREFIX:PATH=/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install -DPYTHON_VERSION_STRING:STRING=3.13.2 -DSKBUILD:INTERNAL=TRUE -DCMAKE_MODULE_PATH:PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/skbuild/resources/cmake -DPYTHON_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPYTHON_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.13 -DPYTHON_LIBRARY:PATH=/github/home/miniconda/envs/build_binary/lib/libpython3.13.so -DPython_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython_FIND_REGISTRY:STRING=NEVER -DPython_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.13 -DPython_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/numpy/_core/include -DPython3_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython3_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython3_FIND_REGISTRY:STRING=NEVER -DPython3_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.13 -DPython3_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/numpy/_core/include -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja -DCMAKE_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ar -DCMAKE_CXX_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_C_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ranlib -DCMAKE_CXX_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_C_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_LINKER=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ld -DCMAKE_STRIP=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-strip -DCMAKE_BUILD_TYPE=Release -DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch -D_GLIBCXX_USE_CXX11_ABI=1 -DCMAKE_VERBOSE_MAKEFILE=ON -DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE -DFBGEMM_BUILD_TARGET=default -DFBGEMM_BUILD_VARIANT=cuda -DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 -DCMAKE_C_COMPILER=/github/home/miniconda/envs/build_binary/bin/cc -DCMAKE_CXX_COMPILER=/github/home/miniconda/envs/build_binary/bin/c++ '-DCMAKE_C_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'"'"'' '-DCMAKE_CXX_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'"'"'' '-DTORCH_CUDA_ARCH_LIST=7.0;8.0' -DCMAKE_CXX_STANDARD=17 '-DTORCH_CUDA_ARCH_LIST=7.0;8.0' -DCMAKE_CXX_STANDARD=17 -DCMAKE_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ar -DCMAKE_CXX_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_C_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ranlib -DCMAKE_CXX_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_C_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_LINKER=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ld -DCMAKE_STRIP=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-strip -DCMAKE_BUILD_TYPE=Release 2025-05-07T19:52:13.5492814Z 2025-05-07T19:52:13.5862022Z 2025-05-07T19:52:13.5862034Z 2025-05-07T19:52:13.5862474Z ================================================================================ 2025-05-07T19:52:13.5862902Z Default C compiler flags 2025-05-07T19:52:13.5864761Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:52:13.5865193Z 2025-05-07T19:52:13.5866122Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include 2025-05-07T19:52:13.5867200Z ================================================================================ 2025-05-07T19:52:13.5867465Z 2025-05-07T19:52:13.5867469Z 2025-05-07T19:52:13.5867472Z 2025-05-07T19:52:13.5867583Z ================================================================================ 2025-05-07T19:52:13.5867964Z Default C++ compiler flags 2025-05-07T19:52:13.5868339Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:52:13.5868695Z 2025-05-07T19:52:13.5869577Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include 2025-05-07T19:52:13.5870700Z ================================================================================ 2025-05-07T19:52:13.5870946Z 2025-05-07T19:52:13.5870950Z 2025-05-07T19:52:13.5870954Z 2025-05-07T19:52:13.5871071Z ================================================================================ 2025-05-07T19:52:13.5871419Z AVX2_FLAGS: 2025-05-07T19:52:13.5871540Z 2025-05-07T19:52:13.5871620Z -mavx2 2025-05-07T19:52:13.5871843Z -mf16c 2025-05-07T19:52:13.5872051Z -mfma 2025-05-07T19:52:13.5872265Z -fopenmp 2025-05-07T19:52:13.5872508Z ================================================================================ 2025-05-07T19:52:13.5872733Z 2025-05-07T19:52:13.5872737Z 2025-05-07T19:52:13.5872741Z 2025-05-07T19:52:13.5872957Z ================================================================================ 2025-05-07T19:52:13.5873335Z AVX512_FLAGS: 2025-05-07T19:52:13.5873473Z 2025-05-07T19:52:13.5873560Z -mavx2 2025-05-07T19:52:13.5873802Z -mf16c 2025-05-07T19:52:13.5874014Z -mfma 2025-05-07T19:52:13.5874259Z -mavx512f 2025-05-07T19:52:13.5874460Z -mavx512bw 2025-05-07T19:52:13.5874669Z -mavx512dq 2025-05-07T19:52:13.5874872Z -mavx512vl 2025-05-07T19:52:13.5875059Z -fopenmp 2025-05-07T19:52:13.5875413Z Not searching for unused variables given on the command line. 2025-05-07T19:52:13.5875846Z ================================================================================ 2025-05-07T19:52:13.5876100Z 2025-05-07T19:52:13.5876103Z 2025-05-07T19:52:13.5876108Z 2025-05-07T19:52:13.5876230Z ================================================================================ 2025-05-07T19:52:13.5876639Z The project is built using scikit-build 2025-05-07T19:52:13.5876965Z ================================================================================ 2025-05-07T19:52:13.5877189Z 2025-05-07T19:52:13.5877193Z 2025-05-07T19:52:13.5877213Z 2025-05-07T19:52:13.5877334Z ================================================================================ 2025-05-07T19:52:13.5877643Z Build Settings 2025-05-07T19:52:13.5877809Z 2025-05-07T19:52:13.5877923Z FBGEMM_BUILD_TARGET : default 2025-05-07T19:52:13.5878251Z FBGEMM_BUILD_VARIANT : cuda 2025-05-07T19:52:13.5878444Z 2025-05-07T19:52:13.5878550Z NVCC_VERBOSE : 2025-05-07T19:52:13.5878845Z CUDNN_INCLUDE_DIR : 2025-05-07T19:52:13.5879102Z CUDNN_LIBRARY : 2025-05-07T19:52:13.5879557Z NVML_LIB_PATH : /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:13.5880043Z TORCH_CUDA_ARCH_LIST : 7.0 2025-05-07T19:52:13.5880465Z 8.0 2025-05-07T19:52:13.5880569Z 2025-05-07T19:52:13.5880660Z HIP_ROOT_DIR : 2025-05-07T19:52:13.5880922Z HIPCC_VERBOSE : 2025-05-07T19:52:13.5881167Z AMDGPU_TARGETS : 2025-05-07T19:52:13.5881431Z PYTORCH_ROCM_ARCH : 2025-05-07T19:52:13.5881717Z ================================================================================ 2025-05-07T19:52:13.5881941Z 2025-05-07T19:52:13.7421471Z -- The CXX compiler identification is Clang 16.0.6 2025-05-07T19:52:13.8154099Z -- The C compiler identification is Clang 16.0.6 2025-05-07T19:52:14.7893929Z -- The CUDA compiler identification is NVIDIA 11.8.89 with host compiler Clang 16.0.6 2025-05-07T19:52:14.8002698Z -- Detecting CXX compiler ABI info 2025-05-07T19:52:14.9355172Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:52:14.9486864Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/c++ - skipped 2025-05-07T19:52:14.9488527Z -- Detecting CXX compile features 2025-05-07T19:52:14.9496382Z -- Detecting CXX compile features - done 2025-05-07T19:52:14.9574084Z -- Detecting C compiler ABI info 2025-05-07T19:52:15.0842204Z -- Detecting C compiler ABI info - done 2025-05-07T19:52:15.0974237Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/cc - skipped 2025-05-07T19:52:15.0975858Z -- Detecting C compile features 2025-05-07T19:52:15.0977838Z -- Detecting C compile features - done 2025-05-07T19:52:15.1029135Z -- Detecting CUDA compiler ABI info 2025-05-07T19:52:16.0245578Z -- Detecting CUDA compiler ABI info - done 2025-05-07T19:52:16.0717652Z -- Check for working CUDA compiler: /github/home/miniconda/envs/build_binary/bin/nvcc - skipped 2025-05-07T19:52:16.0742811Z -- Detecting CUDA compile features 2025-05-07T19:52:16.0743785Z -- Detecting CUDA compile features - done 2025-05-07T19:52:16.0764792Z -- Performing Test C_HAS_AVX_1 2025-05-07T19:52:16.3717062Z -- Performing Test C_HAS_AVX_1 - Failed 2025-05-07T19:52:16.3717441Z -- Performing Test C_HAS_AVX_2 2025-05-07T19:52:16.7122616Z -- Performing Test C_HAS_AVX_2 - Success 2025-05-07T19:52:16.7123649Z -- Performing Test C_HAS_AVX2_1 2025-05-07T19:52:17.0053100Z -- Performing Test C_HAS_AVX2_1 - Failed 2025-05-07T19:52:17.0053527Z -- Performing Test C_HAS_AVX2_2 2025-05-07T19:52:17.3439443Z -- Performing Test C_HAS_AVX2_2 - Success 2025-05-07T19:52:17.3440608Z -- Performing Test C_HAS_AVX512_1 2025-05-07T19:52:17.6386949Z -- Performing Test C_HAS_AVX512_1 - Failed 2025-05-07T19:52:17.6388017Z -- Performing Test C_HAS_AVX512_2 2025-05-07T19:52:17.9790287Z -- Performing Test C_HAS_AVX512_2 - Success 2025-05-07T19:52:17.9791001Z -- Performing Test CXX_HAS_AVX_1 2025-05-07T19:52:18.2719203Z -- Performing Test CXX_HAS_AVX_1 - Failed 2025-05-07T19:52:18.2720250Z -- Performing Test CXX_HAS_AVX_2 2025-05-07T19:52:18.6116467Z -- Performing Test CXX_HAS_AVX_2 - Success 2025-05-07T19:52:18.6116872Z -- Performing Test CXX_HAS_AVX2_1 2025-05-07T19:52:18.9082125Z -- Performing Test CXX_HAS_AVX2_1 - Failed 2025-05-07T19:52:18.9083199Z -- Performing Test CXX_HAS_AVX2_2 2025-05-07T19:52:19.2466150Z -- Performing Test CXX_HAS_AVX2_2 - Success 2025-05-07T19:52:19.2467191Z -- Performing Test CXX_HAS_AVX512_1 2025-05-07T19:52:19.5391481Z -- Performing Test CXX_HAS_AVX512_1 - Failed 2025-05-07T19:52:19.5392510Z -- Performing Test CXX_HAS_AVX512_2 2025-05-07T19:52:19.8780421Z -- Performing Test CXX_HAS_AVX512_2 - Success 2025-05-07T19:52:19.8966953Z -- Found CUDA: /github/home/miniconda/envs/build_binary (found version "11.8") 2025-05-07T19:52:19.9003050Z -- Found CUDAToolkit: /github/home/miniconda/envs/build_binary/include (found version "11.8.89") 2025-05-07T19:52:19.9079893Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD 2025-05-07T19:52:20.0372570Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success 2025-05-07T19:52:20.0382123Z -- Found Threads: TRUE 2025-05-07T19:52:20.1160854Z -- PyTorch: CUDA detected: 11.8 2025-05-07T19:52:20.1162119Z -- PyTorch: CUDA nvcc is: /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:52:20.2712167Z -- PyTorch: CUDA toolkit directory: /github/home/miniconda/envs/build_binary 2025-05-07T19:52:20.2713806Z -- PyTorch: Header version is: 11.8 2025-05-07T19:52:20.3713622Z -- Found Python: /github/home/miniconda/envs/build_binary/bin/python (found version "3.13.2") found components: Interpreter 2025-05-07T19:52:20.3724361Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:140 (message): 2025-05-07T19:52:20.3726808Z Failed to compute shorthash for libnvrtc.so 2025-05-07T19:52:20.3727393Z Call Stack (most recent call first): 2025-05-07T19:52:20.3728133Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:52:20.3729268Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:52:20.3730156Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:52:20.3730617Z CMakeLists.txt:112 (include) 2025-05-07T19:52:20.3730821Z 2025-05-07T19:52:20.3730825Z 2025-05-07T19:52:20.3730989Z -- USE_CUDNN is set to 0. Compiling without cuDNN support 2025-05-07T19:52:20.3731587Z -- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support 2025-05-07T19:52:20.3732151Z -- USE_CUDSS is set to 0. Compiling without cuDSS support 2025-05-07T19:52:20.3732555Z -- USE_CUFILE is set to 0. Compiling without cuFile support 2025-05-07T19:52:20.3733103Z -- Added CUDA NVCC flags for: -gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_80,code=sm_80 2025-05-07T19:52:20.4060362Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:22 (message): 2025-05-07T19:52:20.4061326Z static library kineto_LIBRARY-NOTFOUND not found. 2025-05-07T19:52:20.4061676Z Call Stack (most recent call first): 2025-05-07T19:52:20.4062439Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:125 (append_torchlib_if_found) 2025-05-07T19:52:20.4063489Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:52:20.4063947Z CMakeLists.txt:112 (include) 2025-05-07T19:52:20.4064247Z 2025-05-07T19:52:20.4064251Z 2025-05-07T19:52:20.4064807Z 2025-05-07T19:52:20.4064811Z 2025-05-07T19:52:20.4065081Z ================================================================================ 2025-05-07T19:52:20.4065431Z PyTorch Flags: 2025-05-07T19:52:20.4065852Z 2025-05-07T19:52:20.4066053Z TORCH_INCLUDE_DIRS: 2025-05-07T19:52:20.4066505Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:20.4067328Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:20.4067917Z 2025-05-07T19:52:20.4068138Z TORCH_LIBRARIES: 2025-05-07T19:52:20.4068360Z torch 2025-05-07T19:52:20.4068571Z torch_library 2025-05-07T19:52:20.4069021Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:20.4069857Z -- Found Torch: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so 2025-05-07T19:52:20.4070559Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:20.4071175Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:20.4071745Z 2025-05-07T19:52:20.4071947Z TORCH_CUDA_OPTIONS: 2025-05-07T19:52:20.4072234Z --expt-relaxed-constexpr 2025-05-07T19:52:20.4072529Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:20.4072979Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:20.4073306Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:20.4073622Z ================================================================================ 2025-05-07T19:52:20.4073858Z 2025-05-07T19:52:20.4073863Z 2025-05-07T19:52:20.4073867Z 2025-05-07T19:52:20.4074003Z ================================================================================ 2025-05-07T19:52:20.4074543Z NCCL Flags 2025-05-07T19:52:20.4074691Z 2025-05-07T19:52:20.4075080Z NCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:20.4075993Z NCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:20.4076760Z ================================================================================ 2025-05-07T19:52:20.4076994Z 2025-05-07T19:52:20.4076998Z 2025-05-07T19:52:20.4077002Z 2025-05-07T19:52:20.4077132Z ================================================================================ 2025-05-07T19:52:20.4077446Z CUDA Driver Path 2025-05-07T19:52:20.4077602Z 2025-05-07T19:52:20.4077883Z CUDA_DRIVER_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:20.4078394Z ================================================================================ 2025-05-07T19:52:20.4078634Z 2025-05-07T19:52:20.4078938Z -- Found NVML_LIB_PATH: /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:20.4097369Z 2025-05-07T19:52:20.4097389Z 2025-05-07T19:52:20.4097876Z ================================================================================ 2025-05-07T19:52:20.4098987Z GPU CPP Library Target: asmjit (SHARED) 2025-05-07T19:52:20.4099916Z 2025-05-07T19:52:20.4100157Z CPU_SRCS: 2025-05-07T19:52:20.4100284Z 2025-05-07T19:52:20.4100363Z 2025-05-07T19:52:20.4100577Z GPU_SRCS: 2025-05-07T19:52:20.4100693Z 2025-05-07T19:52:20.4100774Z 2025-05-07T19:52:20.4100993Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:20.4101138Z 2025-05-07T19:52:20.4101232Z 2025-05-07T19:52:20.4101425Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:20.4101566Z 2025-05-07T19:52:20.4101660Z 2025-05-07T19:52:20.4101849Z OTHER_SRCS: 2025-05-07T19:52:20.4102587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:52:20.4103229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:52:20.4103892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:52:20.4104517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:52:20.4105162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:52:20.4105778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:52:20.4106472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:52:20.4107079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:52:20.4107653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:52:20.4108249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:52:20.4108873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:52:20.4109490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:52:20.4110106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:52:20.4110690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:52:20.4111480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:52:20.4112089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:52:20.4112718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:52:20.4113436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:52:20.4114119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:52:20.4114737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:52:20.4115337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:52:20.4116185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:52:20.4116805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:52:20.4117443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:52:20.4118213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:52:20.4118815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:52:20.4119561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:52:20.4120162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:52:20.4120760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:52:20.4121327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:52:20.4121917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:52:20.4122559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:52:20.4123144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:52:20.4123750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:52:20.4124327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:52:20.4124925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:52:20.4125519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:52:20.4126077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:52:20.4126656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:52:20.4127228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:52:20.4127830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:52:20.4128388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:52:20.4128978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:52:20.4129581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:52:20.4130151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:52:20.4130750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:52:20.4131333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:52:20.4131931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:52:20.4132546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:52:20.4133151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:52:20.4133764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:52:20.4134354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:52:20.4134976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:52:20.4135577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:52:20.4136191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:52:20.4136781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:52:20.4137370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:52:20.4137955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:52:20.4138531Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:52:20.4139059Z 2025-05-07T19:52:20.4139268Z CC_FLAGS: 2025-05-07T19:52:20.4139386Z 2025-05-07T19:52:20.4139500Z 2025-05-07T19:52:20.4139703Z NVCC_FLAGS: 2025-05-07T19:52:20.4139835Z 2025-05-07T19:52:20.4139953Z 2025-05-07T19:52:20.4140156Z HIPCC_FLAGS: 2025-05-07T19:52:20.4140322Z 2025-05-07T19:52:20.4140414Z 2025-05-07T19:52:20.4140675Z INCLUDE_DIRS: 2025-05-07T19:52:20.4141025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:20.4141390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:20.4141700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:20.4142056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:20.4142564Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:20.4143397Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:20.4144089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:20.4144521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:20.4144992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:20.4145483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:20.4146042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:20.4146508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:20.4147106Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:20.4147614Z 2025-05-07T19:52:20.4147803Z Selected Source Files: 2025-05-07T19:52:20.4147959Z 2025-05-07T19:52:20.4148059Z 2025-05-07T19:52:20.4148263Z HIPified Source Files: 2025-05-07T19:52:20.4148416Z 2025-05-07T19:52:20.4148522Z 2025-05-07T19:52:20.4148729Z Library Dependencies: 2025-05-07T19:52:20.4148999Z torch 2025-05-07T19:52:20.4149202Z torch_library 2025-05-07T19:52:20.4149648Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:20.4150258Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:20.4150903Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:20.4151733Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:20.4152400Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:20.4153044Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:20.4153645Z 2025-05-07T19:52:20.4153875Z Output Library: 2025-05-07T19:52:20.4154121Z asmjit 2025-05-07T19:52:20.4154352Z 2025-05-07T19:52:20.4154572Z Destination Directory: 2025-05-07T19:52:20.4154856Z fbgemm_gpu 2025-05-07T19:52:20.4155129Z ================================================================================ 2025-05-07T19:52:20.4155376Z 2025-05-07T19:52:20.4155380Z 2025-05-07T19:52:20.4155384Z 2025-05-07T19:52:20.4155514Z ================================================================================ 2025-05-07T19:52:20.4155897Z GPU CPP Library Target: fbgemm (SHARED) 2025-05-07T19:52:20.4156208Z 2025-05-07T19:52:20.4156434Z CPU_SRCS: 2025-05-07T19:52:20.4156558Z 2025-05-07T19:52:20.4156645Z 2025-05-07T19:52:20.4156872Z GPU_SRCS: 2025-05-07T19:52:20.4156995Z 2025-05-07T19:52:20.4157108Z 2025-05-07T19:52:20.4157314Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:20.4157466Z 2025-05-07T19:52:20.4157577Z 2025-05-07T19:52:20.4157779Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:20.4157929Z 2025-05-07T19:52:20.4158043Z 2025-05-07T19:52:20.4158244Z OTHER_SRCS: 2025-05-07T19:52:20.4158559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDM.cc 2025-05-07T19:52:20.4159021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:52:20.4159528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMNBit.cc 2025-05-07T19:52:20.4159989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtils.cc 2025-05-07T19:52:20.4160520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RefImplementations.cc 2025-05-07T19:52:20.4161052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RowWiseSparseAdagradFused.cc 2025-05-07T19:52:20.4161531Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/SparseAdagrad.cc 2025-05-07T19:52:20.4161950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/Utils.cc 2025-05-07T19:52:20.4162366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:52:20.4162913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:52:20.4163350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:52:20.4163826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:52:20.4164301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:52:20.4164682Z 2025-05-07T19:52:20.4164912Z CC_FLAGS: 2025-05-07T19:52:20.4165035Z 2025-05-07T19:52:20.4165119Z 2025-05-07T19:52:20.4165339Z NVCC_FLAGS: 2025-05-07T19:52:20.4165467Z 2025-05-07T19:52:20.4165668Z 2025-05-07T19:52:20.4165885Z HIPCC_FLAGS: 2025-05-07T19:52:20.4166020Z 2025-05-07T19:52:20.4166105Z 2025-05-07T19:52:20.4166325Z INCLUDE_DIRS: 2025-05-07T19:52:20.4166573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:20.4166918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:20.4167240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:20.4167558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:20.4168093Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:20.4168885Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:20.4169570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:20.4170014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:20.4170455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:20.4170964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:20.4171522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:20.4171993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:20.4172587Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:20.4173111Z 2025-05-07T19:52:20.4173355Z Selected Source Files: 2025-05-07T19:52:20.4173521Z 2025-05-07T19:52:20.4173612Z 2025-05-07T19:52:20.4173861Z HIPified Source Files: 2025-05-07T19:52:20.4174023Z 2025-05-07T19:52:20.4174112Z 2025-05-07T19:52:20.4174364Z Library Dependencies: 2025-05-07T19:52:20.4174648Z torch 2025-05-07T19:52:20.4174866Z torch_library 2025-05-07T19:52:20.4175348Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:20.4175955Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:20.4176595Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:20.4177406Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:20.4178121Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:20.4178516Z asmjit 2025-05-07T19:52:20.4178884Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:20.4179339Z 2025-05-07T19:52:20.4179554Z Output Library: 2025-05-07T19:52:20.4179813Z fbgemm 2025-05-07T19:52:20.4180009Z 2025-05-07T19:52:20.4180248Z Destination Directory: 2025-05-07T19:52:20.4180491Z fbgemm_gpu 2025-05-07T19:52:20.4180764Z ================================================================================ 2025-05-07T19:52:20.4180999Z 2025-05-07T19:52:20.4181003Z 2025-05-07T19:52:20.4181007Z 2025-05-07T19:52:20.4181126Z ================================================================================ 2025-05-07T19:52:20.4181491Z Running code generation script ... 2025-05-07T19:52:20.4182284Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py --opensource 2025-05-07T19:52:20.4183143Z ================================================================================ 2025-05-07T19:52:20.4183400Z 2025-05-07T19:52:20.9508134Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:52:20.9509410Z [GENERAATE BACKWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py', '--opensource'] 2025-05-07T19:52:20.9510223Z Written: gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:52:20.9510692Z Written: gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:52:20.9511195Z Written: gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:20.9511703Z Written: gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:20.9512211Z Written: gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:52:20.9512691Z Written: gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:52:20.9513518Z Written: gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:52:20.9514077Z Written: gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:20.9514631Z Written: gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:20.9515180Z Written: gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:52:20.9515719Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:20.9516291Z Written: gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:52:20.9516846Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:20.9517472Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:20.9518060Z Written: gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:52:20.9518627Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:20.9519205Z Written: gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:52:20.9519879Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:20.9520461Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:20.9520999Z Written: gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:52:20.9521525Z Written: gen_embedding_optimizer_dense_split_device_kernel.cuh 2025-05-07T19:52:20.9521974Z Written: gen_embedding_backward_split_dense.cpp 2025-05-07T19:52:20.9522354Z Written: gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:52:20.9522799Z Written: gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:20.9523290Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:20.9523817Z Written: gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:20.9524296Z Written: gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:20.9524820Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:20.9525360Z Written: gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:20.9525855Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:20.9526419Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:20.9526972Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:20.9527517Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:20.9528060Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:20.9528641Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:20.9529174Z Written: gen_embedding_optimizer_adagrad_split_device_kernel.cuh 2025-05-07T19:52:20.9529605Z Written: gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:52:20.9530023Z Written: gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:52:20.9530632Z Written: gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:20.9531066Z Written: lookup_adagrad.py 2025-05-07T19:52:20.9531386Z Written: gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:52:20.9531813Z Written: gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:52:20.9532281Z Written: gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:20.9532868Z Written: gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:52:20.9533355Z Written: gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:52:20.9533820Z Written: gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:20.9534337Z Written: gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:20.9534802Z Written: gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:52:20.9535291Z Written: gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:52:20.9535781Z Written: gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:52:20.9536264Z Written: gen_embedding_backward_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:20.9536790Z Written: gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:20.9537264Z Written: gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:52:20.9537782Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:20.9538285Z Written: gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:20.9538827Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:20.9539401Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:20.9539914Z Written: gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:20.9540424Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:20.9540923Z Written: gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:20.9541466Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:20.9542010Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:20.9542560Z Written: gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:20.9543073Z Written: gen_embedding_optimizer_adam_split_device_kernel.cuh 2025-05-07T19:52:20.9543494Z Written: gen_embedding_backward_split_adam.cpp 2025-05-07T19:52:20.9543900Z Written: gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:52:20.9544325Z Written: gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:20.9544739Z Written: lookup_adam.py 2025-05-07T19:52:20.9545040Z Written: gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:52:20.9545497Z Written: gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:20.9545954Z Written: gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:52:20.9546457Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:20.9546963Z Written: gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:20.9547417Z Written: gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:52:20.9547919Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:20.9548414Z Written: gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:20.9548914Z Written: gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:20.9549425Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:20.9549990Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:20.9550505Z Written: gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:20.9551027Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:20.9551588Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:20.9552070Z Written: gen_embedding_optimizer_lamb_split_device_kernel.cuh 2025-05-07T19:52:20.9552586Z Written: gen_embedding_backward_split_lamb.cpp 2025-05-07T19:52:20.9553069Z Written: gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:52:20.9553726Z Written: gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:20.9554178Z Written: lookup_lamb.py 2025-05-07T19:52:20.9554590Z Written: gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:52:20.9555068Z Written: gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:20.9555570Z Written: gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:52:20.9556134Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:20.9556682Z Written: gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:20.9557228Z Written: gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:52:20.9557794Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:20.9558352Z Written: gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:20.9558912Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:20.9559492Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:20.9560196Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:20.9560721Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:20.9561294Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:20.9561886Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:20.9562396Z Written: gen_embedding_optimizer_lars_sgd_split_device_kernel.cuh 2025-05-07T19:52:20.9562851Z Written: gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:52:20.9563246Z Written: gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:52:20.9563725Z Written: gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:20.9564138Z Written: lookup_lars_sgd.py 2025-05-07T19:52:20.9564495Z Written: gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:52:20.9564980Z Written: gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:20.9565509Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:52:20.9566126Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:20.9566722Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:52:20.9567320Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:52:20.9567908Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:20.9568543Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:52:20.9569158Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:20.9569792Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:20.9570456Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:20.9571078Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:20.9571748Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:20.9572399Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:21.0374180Z Written: gen_embedding_optimizer_partial_rowwise_adam_split_device_kernel.cuh 2025-05-07T19:52:21.0375968Z Written: gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:52:21.0377482Z Written: gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:52:21.0378808Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:21.0379530Z Written: lookup_partial_rowwise_adam.py 2025-05-07T19:52:21.0379960Z Written: gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:52:21.0380498Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:21.0381091Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:52:21.0381787Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:21.0382428Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:21.0383028Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:52:21.0383628Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:21.0384274Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:21.0384868Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:21.0385536Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:21.0386210Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:21.0386826Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:21.0387495Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:21.0388144Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:21.0388777Z Written: gen_embedding_optimizer_partial_rowwise_lamb_split_device_kernel.cuh 2025-05-07T19:52:21.0389306Z Written: gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:52:21.0389811Z Written: gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:52:21.0390373Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:21.0390836Z Written: lookup_partial_rowwise_lamb.py 2025-05-07T19:52:21.0391266Z Written: gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:52:21.0391799Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:21.0392387Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:52:21.0393072Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:52:21.0393846Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:52:21.0394495Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:21.0395070Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:52:21.0395732Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:21.0396370Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:52:21.0396963Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:21.0397589Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:52:21.0398174Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:21.0398794Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:52:21.0399392Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:52:21.0400088Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_meta.cpp 2025-05-07T19:52:21.0400648Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:21.0401212Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_meta.cpp 2025-05-07T19:52:21.0401834Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:21.0402840Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:52:21.0403482Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:21.0404230Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_meta.cpp 2025-05-07T19:52:21.0404847Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:21.0405492Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:21.0406263Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:21.0406919Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:52:21.0407528Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:21.0408195Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:21.0408866Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:21.0409564Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:21.0410254Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:21.0410894Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:52:21.0411543Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:21.0412168Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:21.0412838Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:21.0413487Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:52:21.0414091Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:21.0414974Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:21.0415597Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:21.0416236Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:21.0416843Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:21.0417460Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:52:21.0418056Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:21.0418664Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:21.0419302Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:52:21.0419916Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:21.0420571Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:52:21.0421222Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:21.0421838Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:52:21.0422494Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:21.0423125Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:52:21.0423751Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:52:21.0424315Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:52:21.0424911Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:52:21.0425515Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:52:21.0426059Z Written: gen_embedding_optimizer_rowwise_adagrad_ssd_device_kernel.cuh 2025-05-07T19:52:21.0426610Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:52:21.0427087Z Written: gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:52:21.0427538Z Written: gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:21.0428152Z Written: gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:21.0428630Z Written: lookup_rowwise_adagrad_ssd.py 2025-05-07T19:52:21.0429034Z Written: gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:52:21.0429479Z Written: gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:21.0430074Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:21.0430518Z Written: lookup_rowwise_adagrad.py 2025-05-07T19:52:21.0430913Z Written: gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:52:21.0431371Z Written: gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:21.0431896Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:21.0432489Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:52:21.0433111Z Written: gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:52:21.0433850Z Written: gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:21.0434448Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:21.0435082Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:21.0435675Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:21.0436392Z Written: gen_embedding_optimizer_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:52:21.0437085Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:21.0437695Z Written: gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:21.0438402Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:21.0439085Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:21.0439793Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:21.0440576Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:52:21.0441290Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:21.0441984Z Written: gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:21.0442719Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:21.0443486Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:21.1419222Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:21.1420137Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:52:21.1420802Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:52:21.1421489Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:21.1422159Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:21.1422824Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:52:21.1423471Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:52:21.1424134Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:52:21.1424809Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:21.1425488Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:21.1426176Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:52:21.1426852Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:21.1427800Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:52:21.1428464Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:21.1429278Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:21.1430001Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:52:21.1430687Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:21.1431399Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:52:21.1432095Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:21.1432965Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:21.1434012Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:52:21.1434724Z Written: gen_embedding_optimizer_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:52:21.1435384Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:21.1435975Z Written: gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:21.1436648Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:21.1437227Z Written: lookup_rowwise_adagrad_with_counter.py 2025-05-07T19:52:21.1437720Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:21.1438381Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:21.1439108Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:52:21.1439933Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:21.1440523Z Written: gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:21.1441210Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:21.1441882Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:21.1442519Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:21.1443204Z Written: gen_embedding_optimizer_rowwise_weighted_adagrad_split_device_kernel.cuh 2025-05-07T19:52:21.1443774Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:52:21.1444327Z Written: gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:52:21.1444907Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:21.1445524Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:52:21.1446116Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:21.1446666Z Written: gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:52:21.1447148Z Written: gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:52:21.1447623Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:21.1448142Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:21.1448610Z Written: gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:21.1449105Z Written: gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:52:21.1449593Z Written: gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:52:21.1450061Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:21.1450581Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:21.1451128Z Written: gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:21.1451642Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:21.1452130Z Written: gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:21.1452661Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:21.1453287Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:21.1453794Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:21.1454325Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:21.1454822Z Written: gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:21.1455358Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:21.1455895Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:21.1456437Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:21.1456951Z Written: gen_embedding_optimizer_sgd_split_device_kernel.cuh 2025-05-07T19:52:21.1457362Z Written: gen_embedding_backward_split_sgd.cpp 2025-05-07T19:52:21.1457768Z Written: gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:52:21.1458187Z Written: gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:21.1458599Z Written: lookup_sgd.py 2025-05-07T19:52:21.1458889Z Written: gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:52:21.1459283Z Written: gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:52:21.1459726Z Written: gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:21.1460206Z Written: gen_embedding_optimizer_approx_sgd_split_device_kernel.cuh 2025-05-07T19:52:21.1460679Z Written: gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:52:21.1461089Z Written: gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:52:21.1461585Z Written: gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:21.1462058Z Written: gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:52:21.1462548Z Written: gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:21.1463052Z Written: gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:52:21.1463520Z Written: gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:21.1464036Z Written: gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:52:21.1464491Z Written: gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:52:21.1464996Z Written: gen_embedding_backward_none_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:21.1465481Z Written: gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:52:21.1465979Z Written: gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:52:21.1466493Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:21.1467042Z Written: gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:52:21.1467564Z Written: gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:52:21.1468086Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:21.1468643Z Written: gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:52:21.1469136Z Written: gen_embedding_optimizer_none_split_device_kernel.cuh 2025-05-07T19:52:21.1469575Z Written: gen_embedding_backward_split_none.cpp 2025-05-07T19:52:21.1469946Z Written: gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:52:21.1470421Z Written: gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:52:21.1470837Z Written: lookup_none.py 2025-05-07T19:52:21.1471133Z Written: gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:52:21.1471583Z Written: gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:52:21.1472067Z Written: gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:52:21.1472710Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:52:21.1473576Z Written: gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:52:21.1474192Z Written: gen_embedding_backward_ssd_weighted_vbe_device_kernel.cuh 2025-05-07T19:52:21.1474753Z Written: gen_embedding_backward_split_weighted_vbe_device_kernel.cuh 2025-05-07T19:52:21.1475345Z Written: gen_embedding_backward_ssd_weighted_device_kernel.cuh 2025-05-07T19:52:21.1475868Z Written: gen_embedding_backward_split_weighted_device_kernel.cuh 2025-05-07T19:52:21.1476402Z Written: gen_embedding_backward_ssd_unweighted_nobag_device_kernel.cuh 2025-05-07T19:52:21.1476995Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel.cuh 2025-05-07T19:52:21.1477586Z Written: gen_embedding_backward_ssd_unweighted_vbe_device_kernel.cuh 2025-05-07T19:52:21.1478130Z Written: gen_embedding_backward_split_unweighted_vbe_device_kernel.cuh 2025-05-07T19:52:21.1478690Z Written: gen_embedding_backward_ssd_unweighted_device_kernel.cuh 2025-05-07T19:52:21.1479212Z Written: gen_embedding_backward_split_unweighted_device_kernel.cuh 2025-05-07T19:52:21.1479842Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:52:21.1480299Z Written: gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:52:21.1480804Z Written: gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:52:21.1481340Z Written: gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:52:21.1481841Z Written: gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:52:21.1482279Z Written: pt2_arg_utils.h 2025-05-07T19:52:21.1482536Z Written: __init__.py 2025-05-07T19:52:21.1482812Z Written: lookup_args_ssd.py 2025-05-07T19:52:21.1483080Z Written: lookup_args.py 2025-05-07T19:52:21.1554591Z 2025-05-07T19:52:21.1554868Z 2025-05-07T19:52:21.1555497Z ================================================================================ 2025-05-07T19:52:21.1555964Z Running code generation script ... 2025-05-07T19:52:21.1556866Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py --opensource 2025-05-07T19:52:21.1557687Z ================================================================================ 2025-05-07T19:52:21.1557952Z 2025-05-07T19:52:21.2630563Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:52:21.2631471Z [GENERATE OPTIMIZERS]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py', '--opensource'] 2025-05-07T19:52:21.2632231Z Written: gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:52:21.2632740Z Written: gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:52:21.2633313Z Written: gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:52:21.2633832Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:52:21.2634328Z Written: split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T19:52:21.2634729Z Written: optimizer_args.py 2025-05-07T19:52:21.2738293Z 2025-05-07T19:52:21.2738489Z 2025-05-07T19:52:21.2738744Z ================================================================================ 2025-05-07T19:52:21.2739975Z Running code generation script ... 2025-05-07T19:52:21.2740956Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py --opensource 2025-05-07T19:52:21.2741771Z ================================================================================ 2025-05-07T19:52:21.2742023Z 2025-05-07T19:52:21.3994954Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:52:21.3995949Z [GENERATE FORWARD QUANTIZED]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py', '--opensource'] 2025-05-07T19:52:21.3996856Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:52:21.3997897Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:52:21.3998605Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:52:21.3999339Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:52:21.4000179Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:52:21.4000927Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:52:21.4001706Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:52:21.4002650Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:52:21.4003451Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:52:21.4004220Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:52:21.4005017Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:52:21.4005800Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:52:21.4006536Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:52:21.4007278Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:52:21.4007990Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:52:21.4008730Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:52:21.4009472Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:52:21.4010178Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:52:21.4010901Z Written: gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:52:21.4011578Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:52:21.4012290Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:52:21.4012894Z Written: gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:52:21.4013462Z Written: gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:52:21.4102453Z 2025-05-07T19:52:21.4102525Z 2025-05-07T19:52:21.4103032Z ================================================================================ 2025-05-07T19:52:21.4103615Z Running code generation script ... 2025-05-07T19:52:21.4104416Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py --opensource 2025-05-07T19:52:21.4105254Z ================================================================================ 2025-05-07T19:52:21.4105520Z 2025-05-07T19:52:21.7522584Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:52:21.7525181Z [GENERATE FORWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py', '--opensource'] 2025-05-07T19:52:21.7527396Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:21.7528863Z Written: gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:52:21.7530147Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:21.7530672Z Written: gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:52:21.7531170Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:21.7531644Z Written: gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:21.7532138Z Written: gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:52:21.7532838Z Written: gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:52:21.7533329Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:21.7533824Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:21.7534329Z Written: gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:52:21.7534931Z Written: gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:52:21.7535429Z Written: gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:21.7535961Z Written: gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:52:21.7536476Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:21.7537027Z Written: gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:52:21.7537527Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:21.7538032Z Written: gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:52:21.7538549Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:21.7539043Z Written: gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:52:21.7539535Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:21.7540011Z Written: gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:21.7540508Z Written: gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:52:21.7540955Z Written: gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:52:21.7541450Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:21.7541974Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:21.7542465Z Written: gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:52:21.7542956Z Written: gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:52:21.7543412Z Written: gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:52:21.7543870Z Written: gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:52:21.7544307Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:52:21.7544801Z Written: gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:52:21.7545243Z Written: gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:52:21.7545693Z Written: gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:52:21.7546155Z Written: gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:52:21.7546575Z Written: gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:52:21.7547006Z Written: gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:52:21.7547435Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:52:21.7547925Z Written: gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:52:21.7548370Z Written: gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:52:21.7548811Z Written: gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:52:21.7549254Z Written: gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:52:21.7549652Z Written: gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:52:21.7550091Z Written: gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:52:21.7550521Z Written: gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:52:21.7550978Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:52:21.7551426Z Written: gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:52:21.7551862Z Written: gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:52:21.7552293Z Written: gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:52:21.7552756Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:21.7553586Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:21.7554124Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:21.7554765Z Written: gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:21.7555261Z Written: gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:52:21.7555724Z Written: gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:52:21.7556178Z Written: gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:21.7653171Z 2025-05-07T19:52:21.7653648Z 2025-05-07T19:52:21.7654222Z ================================================================================ 2025-05-07T19:52:21.7655329Z Running code generation script ... 2025-05-07T19:52:21.7657592Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py --opensource 2025-05-07T19:52:21.7659711Z ================================================================================ 2025-05-07T19:52:21.7659968Z 2025-05-07T19:52:22.0284357Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:52:22.0287071Z [INDEX SELECT GENERATOR]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py', '--opensource'] 2025-05-07T19:52:22.0301087Z Written: gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:52:22.0301621Z Written: gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:52:22.0302320Z Written: gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:52:22.0302954Z Written: gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:52:22.0303442Z Written: gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:52:22.0303937Z Written: gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:52:22.0304482Z Written: gen_embedding_backward_split_batch_index_select_device_kernel.cuh 2025-05-07T19:52:22.0305051Z Written: gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:52:22.0305519Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:52:22.0403771Z -- Adding merge_pooled_embeddings sources 2025-05-07T19:52:22.0419419Z 2025-05-07T19:52:22.0419522Z 2025-05-07T19:52:22.0420069Z ================================================================================ 2025-05-07T19:52:22.0421157Z GPU CPP Library Target: fbgemm_gpu_tbe_cache (SHARED) 2025-05-07T19:52:22.0421513Z 2025-05-07T19:52:22.0421749Z CPU_SRCS: 2025-05-07T19:52:22.0422185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:52:22.0422874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:52:22.0423520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:52:22.0424139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:52:22.0424753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:52:22.0425267Z 2025-05-07T19:52:22.0425499Z GPU_SRCS: 2025-05-07T19:52:22.0425864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:52:22.0426498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:52:22.0427131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:52:22.0427800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:52:22.0428411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:52:22.0429008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:52:22.0429640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:52:22.0430232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:52:22.0430830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:52:22.0431468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:52:22.0432188Z 2025-05-07T19:52:22.0432382Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.0432559Z 2025-05-07T19:52:22.0432645Z 2025-05-07T19:52:22.0433023Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.0433177Z 2025-05-07T19:52:22.0433436Z 2025-05-07T19:52:22.0433677Z OTHER_SRCS: 2025-05-07T19:52:22.0433807Z 2025-05-07T19:52:22.0433956Z 2025-05-07T19:52:22.0434293Z CC_FLAGS: 2025-05-07T19:52:22.0434421Z 2025-05-07T19:52:22.0434510Z 2025-05-07T19:52:22.0434730Z NVCC_FLAGS: 2025-05-07T19:52:22.0434970Z --expt-relaxed-constexpr 2025-05-07T19:52:22.0435289Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:22.0435580Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:22.0435893Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:22.0436178Z 2025-05-07T19:52:22.0436364Z HIPCC_FLAGS: 2025-05-07T19:52:22.0436495Z 2025-05-07T19:52:22.0436592Z 2025-05-07T19:52:22.0436778Z INCLUDE_DIRS: 2025-05-07T19:52:22.0437036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.0437351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.0437660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.0437980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.0438514Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.0439381Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.0440159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.0440583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.0441018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.0441532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.0442054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.0442542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.0443098Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.0443622Z 2025-05-07T19:52:22.0443846Z Selected Source Files: 2025-05-07T19:52:22.0444275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:52:22.0444970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:52:22.0445626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:52:22.0446256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:52:22.0446868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:52:22.0447532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:52:22.0448151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:52:22.0448768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:52:22.0449443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:52:22.0450042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:52:22.0450637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:52:22.0451275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:52:22.0451881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:52:22.0452492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:52:22.0453148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:52:22.0453646Z 2025-05-07T19:52:22.0453841Z HIPified Source Files: 2025-05-07T19:52:22.0454024Z 2025-05-07T19:52:22.0454101Z 2025-05-07T19:52:22.0454308Z Library Dependencies: 2025-05-07T19:52:22.0454555Z torch 2025-05-07T19:52:22.0454890Z torch_library 2025-05-07T19:52:22.0455343Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.0455976Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.0456577Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.0457452Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.0458101Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.0458640Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.0459073Z 2025-05-07T19:52:22.0459278Z Output Library: 2025-05-07T19:52:22.0459548Z fbgemm_gpu_tbe_cache 2025-05-07T19:52:22.0459790Z 2025-05-07T19:52:22.0460037Z Destination Directory: 2025-05-07T19:52:22.0460296Z fbgemm_gpu 2025-05-07T19:52:22.0460571Z ================================================================================ 2025-05-07T19:52:22.0460814Z 2025-05-07T19:52:22.0987863Z 2025-05-07T19:52:22.0988082Z 2025-05-07T19:52:22.0988631Z ================================================================================ 2025-05-07T19:52:22.0990010Z GPU CPP Library Target: fbgemm_gpu_tbe_inference (SHARED) 2025-05-07T19:52:22.0991081Z 2025-05-07T19:52:22.0991605Z CPU_SRCS: 2025-05-07T19:52:22.0992544Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:52:22.0994219Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:52:22.0995542Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:52:22.0996582Z 2025-05-07T19:52:22.0997095Z GPU_SRCS: 2025-05-07T19:52:22.0997928Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:52:22.0999283Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:52:22.1000579Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:52:22.1001174Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:52:22.1001810Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:52:22.1003055Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:52:22.1003696Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:52:22.1004361Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:52:22.1005038Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:52:22.1005756Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:52:22.1006436Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:52:22.1007157Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:52:22.1007841Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:52:22.1008560Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:52:22.1009361Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:52:22.1010087Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:52:22.1010709Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:52:22.1011289Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:52:22.1011899Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:52:22.1012492Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:52:22.1013074Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:52:22.1013658Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:52:22.1014656Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:52:22.1015104Z 2025-05-07T19:52:22.1015312Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.1015486Z 2025-05-07T19:52:22.1015571Z 2025-05-07T19:52:22.1015771Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.1015929Z 2025-05-07T19:52:22.1016005Z 2025-05-07T19:52:22.1016191Z OTHER_SRCS: 2025-05-07T19:52:22.1016431Z 2025-05-07T19:52:22.1016518Z 2025-05-07T19:52:22.1016713Z CC_FLAGS: 2025-05-07T19:52:22.1016825Z 2025-05-07T19:52:22.1016903Z 2025-05-07T19:52:22.1017121Z NVCC_FLAGS: 2025-05-07T19:52:22.1017349Z --expt-relaxed-constexpr 2025-05-07T19:52:22.1017646Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:22.1017940Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:22.1018268Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:22.1018523Z 2025-05-07T19:52:22.1018742Z HIPCC_FLAGS: 2025-05-07T19:52:22.1018872Z 2025-05-07T19:52:22.1018952Z 2025-05-07T19:52:22.1019160Z INCLUDE_DIRS: 2025-05-07T19:52:22.1019409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.1019726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.1020034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.1020344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.1020853Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.1021641Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.1022297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.1022704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.1023151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.1023634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.1024139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.1024613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.1025177Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.1025692Z 2025-05-07T19:52:22.1025890Z Selected Source Files: 2025-05-07T19:52:22.1026353Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:52:22.1026804Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:52:22.1027236Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:52:22.1027670Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:52:22.1028117Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:52:22.1028674Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:52:22.1029254Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:52:22.1029847Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:52:22.1030437Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:52:22.1031008Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:52:22.1031593Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:52:22.1032195Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:52:22.1032928Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:52:22.1033761Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:52:22.1034450Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:52:22.1035150Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:52:22.1035816Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:52:22.1036607Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:52:22.1037241Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:52:22.1037885Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:52:22.1038607Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:52:22.1039248Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:52:22.1039892Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:52:22.1040490Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:52:22.1041102Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:52:22.1041754Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:52:22.1042187Z 2025-05-07T19:52:22.1042403Z HIPified Source Files: 2025-05-07T19:52:22.1042566Z 2025-05-07T19:52:22.1042647Z 2025-05-07T19:52:22.1042861Z Library Dependencies: 2025-05-07T19:52:22.1043090Z torch 2025-05-07T19:52:22.1043303Z torch_library 2025-05-07T19:52:22.1043747Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.1044382Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.1045030Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.1045943Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.1046601Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.1046973Z asmjit 2025-05-07T19:52:22.1047191Z fbgemm 2025-05-07T19:52:22.1047377Z fbgemm_gpu_tbe_cache 2025-05-07T19:52:22.1047616Z fbgemm_gpu_config 2025-05-07T19:52:22.1047943Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.1048345Z 2025-05-07T19:52:22.1048521Z Output Library: 2025-05-07T19:52:22.1048754Z fbgemm_gpu_tbe_inference 2025-05-07T19:52:22.1048992Z 2025-05-07T19:52:22.1049173Z Destination Directory: 2025-05-07T19:52:22.1049410Z fbgemm_gpu 2025-05-07T19:52:22.1049624Z ================================================================================ 2025-05-07T19:52:22.1049840Z 2025-05-07T19:52:22.3699446Z 2025-05-07T19:52:22.3699667Z 2025-05-07T19:52:22.3700187Z ================================================================================ 2025-05-07T19:52:22.3701440Z GPU CPP Library Target: fbgemm_gpu_config (SHARED) 2025-05-07T19:52:22.3702720Z 2025-05-07T19:52:22.3703238Z CPU_SRCS: 2025-05-07T19:52:22.3703838Z src/config/feature_gates.cpp 2025-05-07T19:52:22.3704573Z 2025-05-07T19:52:22.3705067Z GPU_SRCS: 2025-05-07T19:52:22.3705405Z 2025-05-07T19:52:22.3705604Z 2025-05-07T19:52:22.3706104Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.3706492Z 2025-05-07T19:52:22.3706580Z 2025-05-07T19:52:22.3706792Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.3706942Z 2025-05-07T19:52:22.3707021Z 2025-05-07T19:52:22.3707216Z OTHER_SRCS: 2025-05-07T19:52:22.3707339Z 2025-05-07T19:52:22.3707426Z 2025-05-07T19:52:22.3707636Z CC_FLAGS: 2025-05-07T19:52:22.3707748Z 2025-05-07T19:52:22.3707829Z 2025-05-07T19:52:22.3708039Z NVCC_FLAGS: 2025-05-07T19:52:22.3708157Z 2025-05-07T19:52:22.3708244Z 2025-05-07T19:52:22.3708441Z HIPCC_FLAGS: 2025-05-07T19:52:22.3708568Z 2025-05-07T19:52:22.3708664Z 2025-05-07T19:52:22.3708842Z INCLUDE_DIRS: 2025-05-07T19:52:22.3709095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.3709412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.3709707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.3710021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.3710555Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.3711376Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.3713983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.3714415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.3714855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.3715358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.3715990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.3716478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.3717043Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.3717582Z 2025-05-07T19:52:22.3717775Z Selected Source Files: 2025-05-07T19:52:22.3718042Z src/config/feature_gates.cpp 2025-05-07T19:52:22.3718314Z 2025-05-07T19:52:22.3718505Z HIPified Source Files: 2025-05-07T19:52:22.3718671Z 2025-05-07T19:52:22.3718772Z 2025-05-07T19:52:22.3718969Z Library Dependencies: 2025-05-07T19:52:22.3719208Z torch 2025-05-07T19:52:22.3719400Z torch_library 2025-05-07T19:52:22.3719861Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.3720451Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.3721080Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.3721921Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.3722598Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.3723150Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.3723557Z 2025-05-07T19:52:22.3723775Z Output Library: 2025-05-07T19:52:22.3724250Z fbgemm_gpu_config 2025-05-07T19:52:22.3724463Z 2025-05-07T19:52:22.3724641Z Destination Directory: 2025-05-07T19:52:22.3724884Z fbgemm_gpu 2025-05-07T19:52:22.3725295Z ================================================================================ 2025-05-07T19:52:22.3725554Z 2025-05-07T19:52:22.3725629Z 2025-05-07T19:52:22.3725632Z 2025-05-07T19:52:22.3725768Z ================================================================================ 2025-05-07T19:52:22.3726160Z GPU CPP Library Target: fbgemm_gpu_tbe_utils (SHARED) 2025-05-07T19:52:22.3726691Z 2025-05-07T19:52:22.3726882Z CPU_SRCS: 2025-05-07T19:52:22.3727190Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:52:22.3727647Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:52:22.3728042Z 2025-05-07T19:52:22.3728256Z GPU_SRCS: 2025-05-07T19:52:22.3728524Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:52:22.3728955Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:52:22.3729345Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:52:22.3729751Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:52:22.3730152Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:52:22.3730526Z 2025-05-07T19:52:22.3730725Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.3730880Z 2025-05-07T19:52:22.3730960Z 2025-05-07T19:52:22.3731168Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.3731303Z 2025-05-07T19:52:22.3731394Z 2025-05-07T19:52:22.3731593Z OTHER_SRCS: 2025-05-07T19:52:22.3731757Z 2025-05-07T19:52:22.3731845Z 2025-05-07T19:52:22.3732056Z CC_FLAGS: 2025-05-07T19:52:22.3732210Z 2025-05-07T19:52:22.3732296Z 2025-05-07T19:52:22.3732501Z NVCC_FLAGS: 2025-05-07T19:52:22.3732653Z 2025-05-07T19:52:22.3732738Z 2025-05-07T19:52:22.3732964Z HIPCC_FLAGS: 2025-05-07T19:52:22.3733090Z 2025-05-07T19:52:22.3733178Z 2025-05-07T19:52:22.3733403Z INCLUDE_DIRS: 2025-05-07T19:52:22.3733657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.3734008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.3734309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.3734652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.3735375Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.3736197Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.3736877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.3737487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.3738010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.3738497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.3739049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.3739519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.3740112Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.3740651Z 2025-05-07T19:52:22.3740850Z Selected Source Files: 2025-05-07T19:52:22.3741214Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:52:22.3741693Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:52:22.3742163Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:52:22.3742584Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:52:22.3743000Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:52:22.3743510Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:52:22.3743933Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:52:22.3744307Z 2025-05-07T19:52:22.3744513Z HIPified Source Files: 2025-05-07T19:52:22.3744671Z 2025-05-07T19:52:22.3744770Z 2025-05-07T19:52:22.3744971Z Library Dependencies: 2025-05-07T19:52:22.3745212Z torch 2025-05-07T19:52:22.3745407Z torch_library 2025-05-07T19:52:22.3745872Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.3746461Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.3747100Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.3747915Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.3748581Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.3749113Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.3749529Z 2025-05-07T19:52:22.3749747Z Output Library: 2025-05-07T19:52:22.3749987Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:22.3750235Z 2025-05-07T19:52:22.3750437Z Destination Directory: 2025-05-07T19:52:22.3750704Z fbgemm_gpu 2025-05-07T19:52:22.3750938Z ================================================================================ 2025-05-07T19:52:22.3751198Z 2025-05-07T19:52:22.3751202Z 2025-05-07T19:52:22.3751206Z 2025-05-07T19:52:22.3751324Z ================================================================================ 2025-05-07T19:52:22.3751767Z GPU CPP Library Target: fbgemm_gpu_sparse_async_cumsum (SHARED) 2025-05-07T19:52:22.3752151Z 2025-05-07T19:52:22.3752361Z CPU_SRCS: 2025-05-07T19:52:22.3752593Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:52:22.3753014Z 2025-05-07T19:52:22.3753205Z GPU_SRCS: 2025-05-07T19:52:22.3753649Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:52:22.3754032Z 2025-05-07T19:52:22.3754265Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.3754414Z 2025-05-07T19:52:22.3754525Z 2025-05-07T19:52:22.3754731Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.3754876Z 2025-05-07T19:52:22.3754978Z 2025-05-07T19:52:22.3755169Z OTHER_SRCS: 2025-05-07T19:52:22.3755313Z 2025-05-07T19:52:22.3755394Z 2025-05-07T19:52:22.3755580Z CC_FLAGS: 2025-05-07T19:52:22.3755720Z 2025-05-07T19:52:22.3755804Z 2025-05-07T19:52:22.3755996Z NVCC_FLAGS: 2025-05-07T19:52:22.3756142Z 2025-05-07T19:52:22.3756225Z 2025-05-07T19:52:22.3756414Z HIPCC_FLAGS: 2025-05-07T19:52:22.3756576Z 2025-05-07T19:52:22.3756654Z 2025-05-07T19:52:22.3756881Z INCLUDE_DIRS: 2025-05-07T19:52:22.3757224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.3757584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.3757877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.3758229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.3758734Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.3759636Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.3760336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.3760773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.3761250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.3761762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.3762302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.3762804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.3763398Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.3763942Z 2025-05-07T19:52:22.3764151Z Selected Source Files: 2025-05-07T19:52:22.3764453Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:52:22.3764806Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:52:22.3765108Z 2025-05-07T19:52:22.3765338Z HIPified Source Files: 2025-05-07T19:52:22.3765499Z 2025-05-07T19:52:22.3765582Z 2025-05-07T19:52:22.3765821Z Library Dependencies: 2025-05-07T19:52:22.3766073Z torch 2025-05-07T19:52:22.3766307Z torch_library 2025-05-07T19:52:22.3766756Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.3767393Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.3768016Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.3768968Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.3769655Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.3770054Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:22.3770454Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.3770866Z 2025-05-07T19:52:22.3771099Z Output Library: 2025-05-07T19:52:22.3771344Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:52:22.3771642Z 2025-05-07T19:52:22.3771842Z Destination Directory: 2025-05-07T19:52:22.3772110Z fbgemm_gpu 2025-05-07T19:52:22.3772369Z ================================================================================ 2025-05-07T19:52:22.3772597Z 2025-05-07T19:52:22.3772601Z 2025-05-07T19:52:22.3772605Z 2025-05-07T19:52:22.3772720Z ================================================================================ 2025-05-07T19:52:22.3773117Z GPU CPP Library Target: fbgemm_gpu_tbe_common (SHARED) 2025-05-07T19:52:22.3773450Z 2025-05-07T19:52:22.3773671Z CPU_SRCS: 2025-05-07T19:52:22.3773937Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:52:22.3774385Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:52:22.3774833Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:52:22.3775258Z 2025-05-07T19:52:22.3775447Z GPU_SRCS: 2025-05-07T19:52:22.3775714Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:52:22.3776177Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:52:22.3776544Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:52:22.3776858Z 2025-05-07T19:52:22.3777078Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.3777404Z 2025-05-07T19:52:22.3777483Z 2025-05-07T19:52:22.3777718Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.3777866Z 2025-05-07T19:52:22.3777973Z 2025-05-07T19:52:22.3778166Z OTHER_SRCS: 2025-05-07T19:52:22.3778291Z 2025-05-07T19:52:22.3778398Z 2025-05-07T19:52:22.3778594Z CC_FLAGS: 2025-05-07T19:52:22.3778713Z 2025-05-07T19:52:22.3778818Z 2025-05-07T19:52:22.3779007Z NVCC_FLAGS: 2025-05-07T19:52:22.3779380Z --expt-relaxed-constexpr 2025-05-07T19:52:22.3779667Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:22.3780180Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:22.3780501Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:22.3780802Z 2025-05-07T19:52:22.3781024Z HIPCC_FLAGS: 2025-05-07T19:52:22.3781155Z 2025-05-07T19:52:22.3781241Z 2025-05-07T19:52:22.3781466Z INCLUDE_DIRS: 2025-05-07T19:52:22.3781774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.3782127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.3782429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.3782775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.3783286Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.3784121Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.3784819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.3785259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.3785744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.3786236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.3786803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.3787289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.3787893Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.3788438Z 2025-05-07T19:52:22.3788648Z Selected Source Files: 2025-05-07T19:52:22.3788961Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:52:22.3789386Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:52:22.3789811Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:52:22.3790166Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:52:22.3790553Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:52:22.3790918Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:52:22.3791242Z 2025-05-07T19:52:22.3791449Z HIPified Source Files: 2025-05-07T19:52:22.3791631Z 2025-05-07T19:52:22.3791715Z 2025-05-07T19:52:22.3791947Z Library Dependencies: 2025-05-07T19:52:22.3792191Z torch 2025-05-07T19:52:22.3792421Z torch_library 2025-05-07T19:52:22.3792988Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.3793627Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.3794251Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.3795075Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.3795776Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.3796161Z fbgemm 2025-05-07T19:52:22.3796381Z fbgemm_gpu_config 2025-05-07T19:52:22.3796749Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.3797176Z 2025-05-07T19:52:22.3797373Z Output Library: 2025-05-07T19:52:22.3797626Z fbgemm_gpu_tbe_common 2025-05-07T19:52:22.3797867Z 2025-05-07T19:52:22.3798099Z Destination Directory: 2025-05-07T19:52:22.3798350Z fbgemm_gpu 2025-05-07T19:52:22.3798618Z ================================================================================ 2025-05-07T19:52:22.3798855Z 2025-05-07T19:52:22.3798859Z 2025-05-07T19:52:22.3798863Z 2025-05-07T19:52:22.3799005Z ================================================================================ 2025-05-07T19:52:22.3799419Z GPU CPP Library Target: fbgemm_gpu_tbe_optimizers (SHARED) 2025-05-07T19:52:22.3799807Z 2025-05-07T19:52:22.3800000Z CPU_SRCS: 2025-05-07T19:52:22.3800137Z 2025-05-07T19:52:22.3800223Z 2025-05-07T19:52:22.3800403Z GPU_SRCS: 2025-05-07T19:52:22.3800705Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:52:22.3801107Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:52:22.3801654Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:52:22.3802209Z 2025-05-07T19:52:22.3802410Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.3802561Z 2025-05-07T19:52:22.3802685Z 2025-05-07T19:52:22.3802877Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.3803060Z 2025-05-07T19:52:22.3803150Z 2025-05-07T19:52:22.3803347Z OTHER_SRCS: 2025-05-07T19:52:22.3803497Z 2025-05-07T19:52:22.3803706Z 2025-05-07T19:52:22.3803897Z CC_FLAGS: 2025-05-07T19:52:22.3804048Z 2025-05-07T19:52:22.3804121Z 2025-05-07T19:52:22.3804310Z NVCC_FLAGS: 2025-05-07T19:52:22.3804561Z --expt-relaxed-constexpr 2025-05-07T19:52:22.3804867Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:22.3805150Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:22.3805483Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:22.3805750Z 2025-05-07T19:52:22.3805963Z HIPCC_FLAGS: 2025-05-07T19:52:22.3806085Z 2025-05-07T19:52:22.3806164Z 2025-05-07T19:52:22.3806373Z INCLUDE_DIRS: 2025-05-07T19:52:22.3806617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.3806968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.3807262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.3807601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.3808123Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.3808937Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.3809622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.3810049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.3810511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.3811008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.3811562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.3812059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.3812642Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.3813178Z 2025-05-07T19:52:22.3813386Z Selected Source Files: 2025-05-07T19:52:22.3813713Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:52:22.3814126Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:52:22.3814579Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:52:22.3815054Z 2025-05-07T19:52:22.3815275Z HIPified Source Files: 2025-05-07T19:52:22.3815434Z 2025-05-07T19:52:22.3815545Z 2025-05-07T19:52:22.3815744Z Library Dependencies: 2025-05-07T19:52:22.3816006Z torch 2025-05-07T19:52:22.3816205Z torch_library 2025-05-07T19:52:22.3816663Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.3817257Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.3817882Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.3818678Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.3819356Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.3819894Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.3820300Z 2025-05-07T19:52:22.3820522Z Output Library: 2025-05-07T19:52:22.3820773Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:52:22.3821039Z 2025-05-07T19:52:22.3821242Z Destination Directory: 2025-05-07T19:52:22.3821507Z fbgemm_gpu 2025-05-07T19:52:22.3821750Z ================================================================================ 2025-05-07T19:52:22.3822010Z 2025-05-07T19:52:22.3822014Z 2025-05-07T19:52:22.3822018Z 2025-05-07T19:52:22.3822137Z ================================================================================ 2025-05-07T19:52:22.3822582Z GPU CPP Library Target: fbgemm_gpu_tbe_training_forward (SHARED) 2025-05-07T19:52:22.3822971Z 2025-05-07T19:52:22.3823486Z CPU_SRCS: 2025-05-07T19:52:22.3823762Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3824127Z 2025-05-07T19:52:22.3824316Z GPU_SRCS: 2025-05-07T19:52:22.3824594Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:52:22.3824982Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:52:22.3825378Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:52:22.3825874Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:52:22.3826307Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:52:22.3826763Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:52:22.3827176Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:52:22.3827583Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:52:22.3827956Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:52:22.3828378Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:52:22.3828802Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:52:22.3829269Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:52:22.3829758Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:52:22.3830350Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:52:22.3830799Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:52:22.3831228Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:52:22.3831690Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:52:22.3832112Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:52:22.3832545Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:52:22.3833089Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:52:22.3833538Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:52:22.3834013Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:22.3834477Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:22.3834945Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:52:22.3835349Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:52:22.3835802Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:52:22.3836488Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:22.3836947Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:52:22.3837400Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:22.3837831Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:52:22.3838285Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:52:22.3838720Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:22.3839182Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:22.3839599Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:52:22.3840060Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:22.3840563Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:22.3841020Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:52:22.3841474Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:52:22.3841920Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:52:22.3842438Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:22.3842905Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:52:22.3843383Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:22.3843864Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:52:22.3844308Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:52:22.3844787Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:22.3845233Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:22.3845684Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:52:22.3846203Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:22.3846714Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:22.3847187Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:22.3847641Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3848172Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3848576Z 2025-05-07T19:52:22.3848796Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.3848942Z 2025-05-07T19:52:22.3849021Z 2025-05-07T19:52:22.3849250Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.3849389Z 2025-05-07T19:52:22.3849476Z 2025-05-07T19:52:22.3849695Z OTHER_SRCS: 2025-05-07T19:52:22.3849816Z 2025-05-07T19:52:22.3849898Z 2025-05-07T19:52:22.3850114Z CC_FLAGS: 2025-05-07T19:52:22.3850233Z 2025-05-07T19:52:22.3850323Z 2025-05-07T19:52:22.3850677Z NVCC_FLAGS: 2025-05-07T19:52:22.3850922Z --expt-relaxed-constexpr 2025-05-07T19:52:22.3851199Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:22.3851526Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:22.3851834Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:22.3852115Z 2025-05-07T19:52:22.3852309Z HIPCC_FLAGS: 2025-05-07T19:52:22.3852463Z 2025-05-07T19:52:22.3852549Z 2025-05-07T19:52:22.3852741Z INCLUDE_DIRS: 2025-05-07T19:52:22.3853009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.3853336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.3853650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.3853992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.3854503Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.3855336Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.3856005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.3856455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.3856893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.3857407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.3857966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.3858439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.3859032Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.3859554Z 2025-05-07T19:52:22.3859786Z Selected Source Files: 2025-05-07T19:52:22.3860083Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3860513Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:52:22.3860962Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:52:22.3861389Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:52:22.3861953Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:52:22.3862367Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:52:22.3862905Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:52:22.3863310Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:22.3863746Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:22.3864163Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:22.3864612Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:22.3865029Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3865387Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3865755Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:52:22.3866097Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:52:22.3866458Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:52:22.3867014Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:52:22.3867441Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:52:22.3867851Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:52:22.3868351Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:52:22.3868756Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:52:22.3869126Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:52:22.3869535Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:52:22.3869960Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:52:22.3870633Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:52:22.3871058Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:52:22.3871490Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:52:22.3871915Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:52:22.3872341Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:22.3872793Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:52:22.3873314Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:52:22.3873767Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:52:22.3874317Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:22.3874790Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:52:22.3875210Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:22.3875656Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:52:22.3876100Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:52:22.3876521Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:22.3876953Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:52:22.3877376Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:22.3877847Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:52:22.3878270Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:52:22.3878743Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:52:22.3879245Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:22.3879713Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:52:22.3880185Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:22.3880627Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:52:22.3881087Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:52:22.3881533Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:22.3881991Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:52:22.3882436Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:22.3882950Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:22.3883453Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:22.3883829Z 2025-05-07T19:52:22.3884071Z HIPified Source Files: 2025-05-07T19:52:22.3884236Z 2025-05-07T19:52:22.3884323Z 2025-05-07T19:52:22.3884565Z Library Dependencies: 2025-05-07T19:52:22.3884806Z torch 2025-05-07T19:52:22.3885033Z torch_library 2025-05-07T19:52:22.3885477Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.3886220Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.3886853Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.3887653Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.3888334Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.3888730Z fbgemm_gpu_tbe_common 2025-05-07T19:52:22.3889122Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.3889528Z 2025-05-07T19:52:22.3889747Z Output Library: 2025-05-07T19:52:22.3889994Z fbgemm_gpu_tbe_training_forward 2025-05-07T19:52:22.3890297Z 2025-05-07T19:52:22.3890532Z Destination Directory: 2025-05-07T19:52:22.3890779Z fbgemm_gpu 2025-05-07T19:52:22.3891149Z ================================================================================ 2025-05-07T19:52:22.3891385Z 2025-05-07T19:52:22.3891389Z 2025-05-07T19:52:22.3891393Z 2025-05-07T19:52:22.3891515Z ================================================================================ 2025-05-07T19:52:22.3891989Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_pt2 (SHARED) 2025-05-07T19:52:22.3892457Z 2025-05-07T19:52:22.3892672Z CPU_SRCS: 2025-05-07T19:52:22.3892942Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:52:22.3893332Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:22.3893736Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:52:22.3894076Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:52:22.3894451Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:52:22.3894812Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:52:22.3895243Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:52:22.3895900Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:52:22.3896336Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:52:22.3896784Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:22.3897240Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:52:22.3897700Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:22.3898225Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:22.3898840Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:22.3899428Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:22.3899982Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:52:22.3900459Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:22.3900890Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3901387Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3901862Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3902484Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3902901Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3903350Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3903850Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3904438Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3904954Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3905473Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3906052Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3906570Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3907217Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3907921Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3908624Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3909266Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3909698Z 2025-05-07T19:52:22.3909918Z GPU_SRCS: 2025-05-07T19:52:22.3910212Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3910716Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3911187Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3911631Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3912084Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3912530Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3913166Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3913890Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3914413Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3914940Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3915514Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3916146Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3916773Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3917515Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3918211Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3918861Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3919440Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3919873Z 2025-05-07T19:52:22.3920117Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.3920270Z 2025-05-07T19:52:22.3920357Z 2025-05-07T19:52:22.3920603Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.3920756Z 2025-05-07T19:52:22.3920846Z 2025-05-07T19:52:22.3921080Z OTHER_SRCS: 2025-05-07T19:52:22.3921206Z 2025-05-07T19:52:22.3921296Z 2025-05-07T19:52:22.3921530Z CC_FLAGS: 2025-05-07T19:52:22.3921657Z 2025-05-07T19:52:22.3921745Z 2025-05-07T19:52:22.3921976Z NVCC_FLAGS: 2025-05-07T19:52:22.3922220Z --expt-relaxed-constexpr 2025-05-07T19:52:22.3922517Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:22.3922821Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:22.3923111Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:22.3923391Z 2025-05-07T19:52:22.3923574Z HIPCC_FLAGS: 2025-05-07T19:52:22.3923707Z 2025-05-07T19:52:22.3923803Z 2025-05-07T19:52:22.3923991Z INCLUDE_DIRS: 2025-05-07T19:52:22.3924239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.3924559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.3924874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.3925196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.3925825Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.3926632Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.3927278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.3927710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.3928128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.3928614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.3929124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.3929584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.3930148Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.3930648Z 2025-05-07T19:52:22.3930858Z Selected Source Files: 2025-05-07T19:52:22.3931137Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:52:22.3931530Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:22.3931890Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:52:22.3932234Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:52:22.3932572Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:52:22.3932934Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:52:22.3933335Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:52:22.3933761Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:52:22.3934151Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:52:22.3934708Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:22.3935153Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:52:22.3935634Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:22.3936144Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:22.3936718Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:22.3937285Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:22.3937884Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:52:22.3938314Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:22.3938730Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3939189Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3939644Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3940041Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3940456Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3940876Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3941356Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3941894Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3942356Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3942862Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3943382Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3944007Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3944581Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3945216Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3945841Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3946410Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:22.3946893Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3947330Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3947770Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3948154Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3948546Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3948945Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3949406Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3949937Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3950384Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3950866Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3951385Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3951868Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3952454Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3953178Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3954041Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3954676Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3955219Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:22.3955618Z 2025-05-07T19:52:22.3955816Z HIPified Source Files: 2025-05-07T19:52:22.3955977Z 2025-05-07T19:52:22.3956083Z 2025-05-07T19:52:22.3956287Z Library Dependencies: 2025-05-07T19:52:22.3956520Z torch 2025-05-07T19:52:22.3956703Z torch_library 2025-05-07T19:52:22.3957168Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.3957871Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.3958497Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.3959301Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.3959997Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.3960383Z fbgemm 2025-05-07T19:52:22.3960570Z fbgemm_gpu_config 2025-05-07T19:52:22.3960795Z fbgemm_gpu_tbe_cache 2025-05-07T19:52:22.3961014Z fbgemm_gpu_tbe_common 2025-05-07T19:52:22.3961253Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:22.3961494Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:52:22.3961893Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.3962301Z 2025-05-07T19:52:22.3962482Z Output Library: 2025-05-07T19:52:22.3962739Z fbgemm_gpu_tbe_training_backward_pt2 2025-05-07T19:52:22.3963017Z 2025-05-07T19:52:22.3963219Z Destination Directory: 2025-05-07T19:52:22.3963451Z fbgemm_gpu 2025-05-07T19:52:22.3963684Z ================================================================================ 2025-05-07T19:52:22.3963916Z 2025-05-07T19:52:22.3963920Z 2025-05-07T19:52:22.3963924Z 2025-05-07T19:52:22.3964058Z ================================================================================ 2025-05-07T19:52:22.3964492Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward (SHARED) 2025-05-07T19:52:22.3964882Z 2025-05-07T19:52:22.3965051Z CPU_SRCS: 2025-05-07T19:52:22.3965374Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:52:22.3965907Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:52:22.3966240Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:52:22.3966678Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:22.3985777Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:52:22.3986833Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:52:22.3987193Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:52:22.3987537Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:52:22.3987919Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:52:22.3988375Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:52:22.3988766Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:52:22.3989174Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:22.3989601Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:52:22.3989986Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:22.3990469Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:22.3991024Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:22.3991578Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:22.3992071Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:52:22.3992465Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:52:22.3992957Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:52:22.3993489Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:52:22.3993784Z 2025-05-07T19:52:22.3993961Z GPU_SRCS: 2025-05-07T19:52:22.3994295Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:52:22.3994720Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:52:22.3995195Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:52:22.3995655Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:52:22.3996091Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:52:22.3996588Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:52:22.3997094Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:52:22.3997617Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.3998326Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.3998903Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.3999461Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:52:22.4000042Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4000578Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4001039Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:22.4001478Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4001941Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4002634Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4003153Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4003683Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4004180Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:22.4004624Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4005120Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4005598Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:22.4006118Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4006659Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4007192Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4007758Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4008339Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4008889Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:22.4009406Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4009966Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4010438Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:52:22.4010834Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4011266Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4011691Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4012172Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4012672Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4013123Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:22.4013532Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4014102Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4014636Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:52:22.4015017Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4015427Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4015827Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4016266Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4016740Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4017177Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:52:22.4017581Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4017997Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4018404Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:52:22.4018778Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4019206Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4019607Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4020181Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4020654Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4021062Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:22.4021458Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4021931Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4022359Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:52:22.4022767Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4023215Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4023657Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4024134Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4024639Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4025089Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:22.4025512Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4025964Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4026451Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:52:22.4026960Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4027490Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4028033Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4028581Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4029170Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4029725Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:52:22.4030258Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4030795Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4031325Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:52:22.4031840Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4032369Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4032991Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4033749Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4034409Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4035009Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:22.4035567Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4036167Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4036666Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:52:22.4037289Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4037719Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4038174Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4038659Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4039164Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4039627Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:52:22.4040047Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4040509Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4041012Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:52:22.4041699Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4042337Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4042963Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4043687Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4044393Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4045026Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:52:22.4045756Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4046354Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4046776Z 2025-05-07T19:52:22.4046960Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.4047092Z 2025-05-07T19:52:22.4047164Z 2025-05-07T19:52:22.4047341Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.4047659Z gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:52:22.4048127Z gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:52:22.4048564Z gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:52:22.4048906Z 2025-05-07T19:52:22.4049078Z OTHER_SRCS: 2025-05-07T19:52:22.4049191Z 2025-05-07T19:52:22.4049263Z 2025-05-07T19:52:22.4049437Z CC_FLAGS: 2025-05-07T19:52:22.4049538Z 2025-05-07T19:52:22.4049607Z 2025-05-07T19:52:22.4049771Z NVCC_FLAGS: 2025-05-07T19:52:22.4049958Z --expt-relaxed-constexpr 2025-05-07T19:52:22.4050213Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:22.4050467Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:22.4050746Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:22.4050982Z 2025-05-07T19:52:22.4051139Z HIPCC_FLAGS: 2025-05-07T19:52:22.4051249Z 2025-05-07T19:52:22.4051324Z 2025-05-07T19:52:22.4051482Z INCLUDE_DIRS: 2025-05-07T19:52:22.4051706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.4051985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.4052244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.4052521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.4052993Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.4053725Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.4054333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.4054716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.4055105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.4055548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.4056019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.4056445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.4056737Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.4056806Z 2025-05-07T19:52:22.4056898Z Selected Source Files: 2025-05-07T19:52:22.4057087Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:52:22.4057196Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:52:22.4057319Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:52:22.4057454Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:22.4057556Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:52:22.4057670Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:52:22.4057772Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:52:22.4057887Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:52:22.4058037Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:52:22.4058196Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:52:22.4058356Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:52:22.4058539Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:22.4058668Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:52:22.4058828Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:22.4059036Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:22.4059577Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:22.4059777Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:22.4059944Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:52:22.4060052Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:52:22.4060192Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:52:22.4060294Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:52:22.4060420Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:52:22.4060592Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:52:22.4060746Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:52:22.4060893Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:52:22.4061062Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:52:22.4061245Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:52:22.4061429Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:52:22.4061613Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4061830Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4062040Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4062205Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:52:22.4062397Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4062591Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4062730Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:22.4062899Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4063061Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4063234Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4063431Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4063618Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4063762Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:22.4063927Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4064103Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4064267Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:22.4064455Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4064656Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4064851Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4065066Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4065298Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4065477Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:22.4065670Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4065868Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4066008Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:52:22.4066157Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4066303Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4066511Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4066688Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4066867Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4067016Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:22.4067224Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4067377Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4067503Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:52:22.4067659Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4067808Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4067960Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4068144Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4068324Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4068464Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:52:22.4068622Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4068780Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4068909Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:52:22.4069062Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4069215Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4069367Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4069542Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4069731Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4069870Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:22.4070019Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4070189Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4070326Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:52:22.4070485Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4070646Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4070834Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4071028Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4071219Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4071368Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:22.4071535Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4071706Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4071893Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:52:22.4072098Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4072313Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4072536Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4072768Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4073106Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4073484Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:52:22.4073723Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4073951Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4074148Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:52:22.4074383Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4074656Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4074878Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4075142Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4075395Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4075645Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:22.4075886Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4076114Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4076250Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:52:22.4076409Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4076581Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4076747Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4076951Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4077155Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4077298Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:52:22.4077464Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4077659Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4077885Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:52:22.4078140Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4078396Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4078655Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4078939Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4079226Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4079471Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:52:22.4079724Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4079990Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4080076Z 2025-05-07T19:52:22.4080165Z HIPified Source Files: 2025-05-07T19:52:22.4080170Z 2025-05-07T19:52:22.4080242Z 2025-05-07T19:52:22.4080334Z Library Dependencies: 2025-05-07T19:52:22.4080416Z torch 2025-05-07T19:52:22.4080494Z torch_library 2025-05-07T19:52:22.4080806Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.4080994Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.4081325Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.4081687Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.4081895Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.4081965Z fbgemm 2025-05-07T19:52:22.4082047Z fbgemm_gpu_config 2025-05-07T19:52:22.4082134Z fbgemm_gpu_tbe_cache 2025-05-07T19:52:22.4082235Z fbgemm_gpu_tbe_common 2025-05-07T19:52:22.4082317Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:22.4082421Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:52:22.4082656Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.4082727Z 2025-05-07T19:52:22.4082811Z Output Library: 2025-05-07T19:52:22.4082908Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:52:22.4082989Z 2025-05-07T19:52:22.4083074Z Destination Directory: 2025-05-07T19:52:22.4083149Z fbgemm_gpu 2025-05-07T19:52:22.4083271Z ================================================================================ 2025-05-07T19:52:22.4083317Z 2025-05-07T19:52:22.4083322Z 2025-05-07T19:52:22.4083325Z 2025-05-07T19:52:22.4083434Z ================================================================================ 2025-05-07T19:52:22.4083640Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_gwd (SHARED) 2025-05-07T19:52:22.4083728Z 2025-05-07T19:52:22.4083805Z CPU_SRCS: 2025-05-07T19:52:22.4083809Z 2025-05-07T19:52:22.4083920Z 2025-05-07T19:52:22.4083995Z GPU_SRCS: 2025-05-07T19:52:22.4084202Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:52:22.4084424Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:52:22.4084648Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:52:22.4084859Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:52:22.4085080Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:52:22.4085307Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:52:22.4085646Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:52:22.4085862Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:22.4086078Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:22.4086288Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:52:22.4086510Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:22.4086732Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:22.4086804Z 2025-05-07T19:52:22.4086895Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.4086899Z 2025-05-07T19:52:22.4086969Z 2025-05-07T19:52:22.4087046Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.4087050Z 2025-05-07T19:52:22.4087126Z 2025-05-07T19:52:22.4087197Z OTHER_SRCS: 2025-05-07T19:52:22.4087201Z 2025-05-07T19:52:22.4087265Z 2025-05-07T19:52:22.4087343Z CC_FLAGS: 2025-05-07T19:52:22.4087364Z 2025-05-07T19:52:22.4087425Z 2025-05-07T19:52:22.4087497Z NVCC_FLAGS: 2025-05-07T19:52:22.4087591Z --expt-relaxed-constexpr 2025-05-07T19:52:22.4087700Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:22.4087801Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:22.4087887Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:22.4087961Z 2025-05-07T19:52:22.4088034Z HIPCC_FLAGS: 2025-05-07T19:52:22.4088038Z 2025-05-07T19:52:22.4088107Z 2025-05-07T19:52:22.4088183Z INCLUDE_DIRS: 2025-05-07T19:52:22.4088294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.4088379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.4088472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.4088575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.4088837Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.4089206Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.4089338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.4089501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.4089647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.4089839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.4090039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.4090170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.4090461Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.4090542Z 2025-05-07T19:52:22.4090628Z Selected Source Files: 2025-05-07T19:52:22.4090809Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:52:22.4091013Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:52:22.4091229Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:52:22.4091465Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:52:22.4091672Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:52:22.4091890Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:52:22.4092119Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:52:22.4092333Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:22.4092563Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:22.4092772Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:52:22.4092997Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:22.4093232Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:22.4093298Z 2025-05-07T19:52:22.4093381Z HIPified Source Files: 2025-05-07T19:52:22.4093386Z 2025-05-07T19:52:22.4093452Z 2025-05-07T19:52:22.4093546Z Library Dependencies: 2025-05-07T19:52:22.4093613Z torch 2025-05-07T19:52:22.4093686Z torch_library 2025-05-07T19:52:22.4093986Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.4094144Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.4094451Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.4094781Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.4094966Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.4095061Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:52:22.4095257Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.4095334Z 2025-05-07T19:52:22.4095409Z Output Library: 2025-05-07T19:52:22.4095510Z fbgemm_gpu_tbe_training_backward_gwd 2025-05-07T19:52:22.4095590Z 2025-05-07T19:52:22.4095672Z Destination Directory: 2025-05-07T19:52:22.4095742Z fbgemm_gpu 2025-05-07T19:52:22.4095863Z ================================================================================ 2025-05-07T19:52:22.4095867Z 2025-05-07T19:52:22.4095871Z 2025-05-07T19:52:22.4095875Z 2025-05-07T19:52:22.4095976Z ================================================================================ 2025-05-07T19:52:22.4096169Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_vbe (SHARED) 2025-05-07T19:52:22.4096256Z 2025-05-07T19:52:22.4096325Z CPU_SRCS: 2025-05-07T19:52:22.4096329Z 2025-05-07T19:52:22.4096396Z 2025-05-07T19:52:22.4096471Z GPU_SRCS: 2025-05-07T19:52:22.4096670Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:52:22.4096851Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:52:22.4097045Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:22.4097244Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:52:22.4097475Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:52:22.4097710Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:22.4097863Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:52:22.4098019Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:22.4098168Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:52:22.4098322Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:22.4098481Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:52:22.4098631Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:22.4098808Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:52:22.4099029Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4099293Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4099467Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:52:22.4099676Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4099876Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4100177Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:22.4100391Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4100620Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4100798Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:52:22.4100996Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4101217Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4101446Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:52:22.4101689Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4101955Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4102516Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:22.4102825Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4103115Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4103263Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:52:22.4103435Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4103611Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4103779Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:22.4103961Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4104142Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4104306Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:52:22.4104486Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4104674Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4104854Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:22.4105042Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4105232Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4105396Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:52:22.4105567Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4105742Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4105900Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:22.4106103Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4106293Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4106368Z 2025-05-07T19:52:22.4106469Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.4106473Z 2025-05-07T19:52:22.4106544Z 2025-05-07T19:52:22.4106631Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.4106636Z 2025-05-07T19:52:22.4106706Z 2025-05-07T19:52:22.4106805Z OTHER_SRCS: 2025-05-07T19:52:22.4106809Z 2025-05-07T19:52:22.4106880Z 2025-05-07T19:52:22.4106955Z CC_FLAGS: 2025-05-07T19:52:22.4106959Z 2025-05-07T19:52:22.4107051Z 2025-05-07T19:52:22.4107130Z NVCC_FLAGS: 2025-05-07T19:52:22.4107222Z --expt-relaxed-constexpr 2025-05-07T19:52:22.4107330Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:22.4107435Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:22.4107530Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:22.4107601Z 2025-05-07T19:52:22.4107695Z HIPCC_FLAGS: 2025-05-07T19:52:22.4107791Z 2025-05-07T19:52:22.4107869Z 2025-05-07T19:52:22.4107947Z INCLUDE_DIRS: 2025-05-07T19:52:22.4108069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.4108165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.4108268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.4108374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.4108745Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.4109148Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.4109290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.4109461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.4109619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.4109821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.4110032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.4110185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.4110499Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.4110579Z 2025-05-07T19:52:22.4110691Z Selected Source Files: 2025-05-07T19:52:22.4110906Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:52:22.4111106Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:52:22.4111326Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:22.4111528Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:52:22.4111783Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:52:22.4112052Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:22.4112207Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:52:22.4112378Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:22.4112543Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:52:22.4112731Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:22.4112973Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:52:22.4113146Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:22.4113355Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:52:22.4113580Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4113825Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4114027Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:52:22.4114238Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4114460Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4114669Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:22.4114912Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4115146Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4115343Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:52:22.4115581Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4115799Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4116045Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:52:22.4116329Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4116598Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4116848Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:22.4117195Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4117475Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4117626Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:52:22.4117851Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4118046Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4118202Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:22.4118385Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4118586Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4118744Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:52:22.4118922Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4119126Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4119294Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:22.4119482Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4119678Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4119842Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:52:22.4120027Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4120207Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4120382Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:22.4120565Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:22.4120758Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:22.4120852Z 2025-05-07T19:52:22.4120947Z HIPified Source Files: 2025-05-07T19:52:22.4120952Z 2025-05-07T19:52:22.4121026Z 2025-05-07T19:52:22.4121123Z Library Dependencies: 2025-05-07T19:52:22.4121218Z torch 2025-05-07T19:52:22.4121300Z torch_library 2025-05-07T19:52:22.4121612Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.4121793Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.4122127Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.4122481Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.4122683Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.4122785Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:52:22.4122997Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.4123069Z 2025-05-07T19:52:22.4123170Z Output Library: 2025-05-07T19:52:22.4123274Z fbgemm_gpu_tbe_training_backward_vbe 2025-05-07T19:52:22.4123349Z 2025-05-07T19:52:22.4123454Z Destination Directory: 2025-05-07T19:52:22.4123539Z fbgemm_gpu 2025-05-07T19:52:22.4123647Z ================================================================================ 2025-05-07T19:52:22.4123652Z 2025-05-07T19:52:22.4123657Z 2025-05-07T19:52:22.4123660Z 2025-05-07T19:52:22.4123787Z ================================================================================ 2025-05-07T19:52:22.4124006Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_dense (SHARED) 2025-05-07T19:52:22.4124080Z 2025-05-07T19:52:22.4124174Z CPU_SRCS: 2025-05-07T19:52:22.4124178Z 2025-05-07T19:52:22.4124259Z 2025-05-07T19:52:22.4124334Z GPU_SRCS: 2025-05-07T19:52:22.4124474Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:52:22.4124622Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:52:22.4124797Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4124964Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4125136Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4125505Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:22.4125686Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4125876Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4126026Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:52:22.4126226Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:52:22.4126396Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4126561Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4126675Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:52:22.4126750Z 2025-05-07T19:52:22.4126833Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.4126837Z 2025-05-07T19:52:22.4126919Z 2025-05-07T19:52:22.4127000Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.4127004Z 2025-05-07T19:52:22.4127078Z 2025-05-07T19:52:22.4127163Z OTHER_SRCS: 2025-05-07T19:52:22.4127170Z 2025-05-07T19:52:22.4127239Z 2025-05-07T19:52:22.4127316Z CC_FLAGS: 2025-05-07T19:52:22.4127320Z 2025-05-07T19:52:22.4127392Z 2025-05-07T19:52:22.4127486Z NVCC_FLAGS: 2025-05-07T19:52:22.4127581Z --expt-relaxed-constexpr 2025-05-07T19:52:22.4127676Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:22.4127782Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:22.4127868Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:22.4127937Z 2025-05-07T19:52:22.4128016Z HIPCC_FLAGS: 2025-05-07T19:52:22.4128020Z 2025-05-07T19:52:22.4128102Z 2025-05-07T19:52:22.4128180Z INCLUDE_DIRS: 2025-05-07T19:52:22.4128287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.4128391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.4128491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.4128593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.4128859Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.4129234Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.4129371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.4129521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.4129672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.4129865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.4130059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.4130207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.4130495Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.4130562Z 2025-05-07T19:52:22.4130649Z Selected Source Files: 2025-05-07T19:52:22.4130801Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:52:22.4130963Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:22.4131107Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:52:22.4131223Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:52:22.4131348Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:52:22.4131495Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:52:22.4131664Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:52:22.4131826Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:22.4132007Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:22.4132187Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:22.4132337Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:52:22.4132494Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:52:22.4132652Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:52:22.4132729Z 2025-05-07T19:52:22.4132808Z HIPified Source Files: 2025-05-07T19:52:22.4132877Z 2025-05-07T19:52:22.4132942Z 2025-05-07T19:52:22.4133042Z Library Dependencies: 2025-05-07T19:52:22.4133105Z torch 2025-05-07T19:52:22.4133173Z torch_library 2025-05-07T19:52:22.4133460Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.4133619Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.4133974Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.4134300Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.4134482Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.4134575Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:52:22.4134768Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.4134833Z 2025-05-07T19:52:22.4134919Z Output Library: 2025-05-07T19:52:22.4135018Z fbgemm_gpu_tbe_training_backward_dense 2025-05-07T19:52:22.4135088Z 2025-05-07T19:52:22.4135185Z Destination Directory: 2025-05-07T19:52:22.4135255Z fbgemm_gpu 2025-05-07T19:52:22.4135366Z ================================================================================ 2025-05-07T19:52:22.4135371Z 2025-05-07T19:52:22.4135374Z 2025-05-07T19:52:22.4135378Z 2025-05-07T19:52:22.4135487Z ================================================================================ 2025-05-07T19:52:22.4135698Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_split_host (SHARED) 2025-05-07T19:52:22.4135763Z 2025-05-07T19:52:22.4135856Z CPU_SRCS: 2025-05-07T19:52:22.4135860Z 2025-05-07T19:52:22.4135923Z 2025-05-07T19:52:22.4135994Z GPU_SRCS: 2025-05-07T19:52:22.4136102Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:52:22.4136245Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:52:22.4136339Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:52:22.4136439Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:52:22.4136548Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:52:22.4136655Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:52:22.4136947Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:52:22.4137092Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:52:22.4137199Z gen_embedding_backward_split_none.cpp 2025-05-07T19:52:22.4137369Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:22.4137485Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:52:22.4137644Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:52:22.4137835Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:22.4138043Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:22.4138239Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:22.4138398Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:52:22.4138512Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:52:22.4138654Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:22.4138814Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:22.4138983Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:22.4139161Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:22.4139306Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:52:22.4139443Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:22.4139571Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:52:22.4139723Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:52:22.4139856Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:52:22.4139993Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:22.4140130Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:52:22.4140297Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:22.4141157Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:52:22.4141356Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:52:22.4141559Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:52:22.4141759Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:22.4141934Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:52:22.4142090Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:52:22.4142307Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:52:22.4142530Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:52:22.4142596Z 2025-05-07T19:52:22.4142691Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.4142696Z 2025-05-07T19:52:22.4142763Z 2025-05-07T19:52:22.4142844Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.4142847Z 2025-05-07T19:52:22.4142930Z 2025-05-07T19:52:22.4143001Z OTHER_SRCS: 2025-05-07T19:52:22.4143005Z 2025-05-07T19:52:22.4143071Z 2025-05-07T19:52:22.4143143Z CC_FLAGS: 2025-05-07T19:52:22.4143157Z 2025-05-07T19:52:22.4143232Z 2025-05-07T19:52:22.4143304Z NVCC_FLAGS: 2025-05-07T19:52:22.4143391Z --expt-relaxed-constexpr 2025-05-07T19:52:22.4143490Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:22.4143587Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:22.4143677Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:22.4143759Z 2025-05-07T19:52:22.4143838Z HIPCC_FLAGS: 2025-05-07T19:52:22.4143842Z 2025-05-07T19:52:22.4143911Z 2025-05-07T19:52:22.4143987Z INCLUDE_DIRS: 2025-05-07T19:52:22.4144094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.4144181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.4144278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.4144384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.4144647Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.4145017Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.4145148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.4145313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.4145457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.4145649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.4145848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.4145985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.4146272Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.4146359Z 2025-05-07T19:52:22.4146440Z Selected Source Files: 2025-05-07T19:52:22.4146547Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:52:22.4146671Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:52:22.4146788Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:52:22.4146885Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:52:22.4146983Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:52:22.4147105Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:52:22.4147246Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:52:22.4147385Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:52:22.4147480Z gen_embedding_backward_split_none.cpp 2025-05-07T19:52:22.4147673Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:22.4147777Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:52:22.4147919Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:52:22.4148124Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:22.4148329Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:22.4148514Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:22.4148720Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:52:22.4148834Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:52:22.4148970Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:22.4149117Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:22.4149345Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:22.4149524Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:22.4149651Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:52:22.4149793Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:22.4149924Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:52:22.4150065Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:52:22.4150200Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:52:22.4150333Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:22.4150476Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:52:22.4150626Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:22.4150827Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:52:22.4151020Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:52:22.4151208Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:52:22.4151416Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:22.4151543Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:52:22.4151673Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:52:22.4151890Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:52:22.4152111Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:52:22.4152175Z 2025-05-07T19:52:22.4152258Z HIPified Source Files: 2025-05-07T19:52:22.4152272Z 2025-05-07T19:52:22.4152337Z 2025-05-07T19:52:22.4152422Z Library Dependencies: 2025-05-07T19:52:22.4152486Z torch 2025-05-07T19:52:22.4152564Z torch_library 2025-05-07T19:52:22.4152931Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.4153098Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.4153612Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.4153969Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.4154164Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.4154248Z fbgemm_gpu_config 2025-05-07T19:52:22.4154348Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:22.4154568Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.4154644Z 2025-05-07T19:52:22.4154739Z Output Library: 2025-05-07T19:52:22.4154863Z fbgemm_gpu_tbe_training_backward_split_host 2025-05-07T19:52:22.4154941Z 2025-05-07T19:52:22.4155029Z Destination Directory: 2025-05-07T19:52:22.4155120Z fbgemm_gpu 2025-05-07T19:52:22.4155232Z ================================================================================ 2025-05-07T19:52:22.4155237Z 2025-05-07T19:52:22.4155241Z 2025-05-07T19:52:22.4155245Z 2025-05-07T19:52:22.4155358Z ================================================================================ 2025-05-07T19:52:22.4155543Z GPU CPP Library Target: fbgemm_gpu_tbe_index_select (SHARED) 2025-05-07T19:52:22.4155619Z 2025-05-07T19:52:22.4155695Z CPU_SRCS: 2025-05-07T19:52:22.4155917Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:52:22.4156106Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:52:22.4156182Z 2025-05-07T19:52:22.4156262Z GPU_SRCS: 2025-05-07T19:52:22.4156464Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:52:22.4156658Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:52:22.4156787Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:52:22.4156934Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:52:22.4157074Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:52:22.4157214Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:52:22.4157424Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:52:22.4157558Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:52:22.4157631Z 2025-05-07T19:52:22.4157719Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.4157723Z 2025-05-07T19:52:22.4157809Z 2025-05-07T19:52:22.4157896Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.4157900Z 2025-05-07T19:52:22.4157975Z 2025-05-07T19:52:22.4158077Z OTHER_SRCS: 2025-05-07T19:52:22.4158081Z 2025-05-07T19:52:22.4158153Z 2025-05-07T19:52:22.4158228Z CC_FLAGS: 2025-05-07T19:52:22.4158232Z 2025-05-07T19:52:22.4158304Z 2025-05-07T19:52:22.4158404Z NVCC_FLAGS: 2025-05-07T19:52:22.4158498Z --expt-relaxed-constexpr 2025-05-07T19:52:22.4158593Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:22.4158717Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:22.4158811Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:22.4158880Z 2025-05-07T19:52:22.4158961Z HIPCC_FLAGS: 2025-05-07T19:52:22.4158983Z 2025-05-07T19:52:22.4159054Z 2025-05-07T19:52:22.4159133Z INCLUDE_DIRS: 2025-05-07T19:52:22.4159239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.4159349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.4159452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.4159557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.4159852Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.4160247Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.4160388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.4160547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.4160719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.4160926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.4161126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.4161280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.4161591Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.4161663Z 2025-05-07T19:52:22.4161771Z Selected Source Files: 2025-05-07T19:52:22.4161983Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:52:22.4162169Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:52:22.4162365Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:52:22.4162509Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:52:22.4162638Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:52:22.4162775Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:52:22.4162930Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:52:22.4163065Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:52:22.4163198Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:52:22.4163352Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:52:22.4163428Z 2025-05-07T19:52:22.4163514Z HIPified Source Files: 2025-05-07T19:52:22.4163518Z 2025-05-07T19:52:22.4163589Z 2025-05-07T19:52:22.4163687Z Library Dependencies: 2025-05-07T19:52:22.4163756Z torch 2025-05-07T19:52:22.4163833Z torch_library 2025-05-07T19:52:22.4164150Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.4164312Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.4164640Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.4165091Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.4165296Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.4165397Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:52:22.4165481Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:22.4165858Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.4165927Z 2025-05-07T19:52:22.4166003Z Output Library: 2025-05-07T19:52:22.4166094Z fbgemm_gpu_tbe_index_select 2025-05-07T19:52:22.4166176Z 2025-05-07T19:52:22.4166261Z Destination Directory: 2025-05-07T19:52:22.4166332Z fbgemm_gpu 2025-05-07T19:52:22.4166446Z ================================================================================ 2025-05-07T19:52:22.4166450Z 2025-05-07T19:52:22.4166454Z 2025-05-07T19:52:22.4166457Z 2025-05-07T19:52:22.4166552Z ================================================================================ 2025-05-07T19:52:22.4166739Z GPU CPP Library Target: fbgemm_gpu_embedding_inplace_ops (SHARED) 2025-05-07T19:52:22.4166819Z 2025-05-07T19:52:22.4166889Z CPU_SRCS: 2025-05-07T19:52:22.4167051Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:52:22.4167124Z 2025-05-07T19:52:22.4167210Z GPU_SRCS: 2025-05-07T19:52:22.4167377Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:52:22.4167521Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:52:22.4167604Z 2025-05-07T19:52:22.4167682Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.4167686Z 2025-05-07T19:52:22.4167755Z 2025-05-07T19:52:22.4167847Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.4167851Z 2025-05-07T19:52:22.4167918Z 2025-05-07T19:52:22.4167991Z OTHER_SRCS: 2025-05-07T19:52:22.4167995Z 2025-05-07T19:52:22.4168063Z 2025-05-07T19:52:22.4168157Z CC_FLAGS: 2025-05-07T19:52:22.4168161Z 2025-05-07T19:52:22.4168224Z 2025-05-07T19:52:22.4168298Z NVCC_FLAGS: 2025-05-07T19:52:22.4168406Z --expt-relaxed-constexpr 2025-05-07T19:52:22.4168497Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:22.4168589Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:22.4168680Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:22.4168761Z 2025-05-07T19:52:22.4168832Z HIPCC_FLAGS: 2025-05-07T19:52:22.4168835Z 2025-05-07T19:52:22.4168900Z 2025-05-07T19:52:22.4168991Z INCLUDE_DIRS: 2025-05-07T19:52:22.4169096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.4169180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.4169286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.4169392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.4169663Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.4170030Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.4170174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.4170321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.4170464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.4170667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.4170852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.4170983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.4171268Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.4171346Z 2025-05-07T19:52:22.4171428Z Selected Source Files: 2025-05-07T19:52:22.4171588Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:52:22.4171766Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:52:22.4171910Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:52:22.4171975Z 2025-05-07T19:52:22.4172074Z HIPified Source Files: 2025-05-07T19:52:22.4172078Z 2025-05-07T19:52:22.4172194Z 2025-05-07T19:52:22.4172275Z Library Dependencies: 2025-05-07T19:52:22.4172345Z torch 2025-05-07T19:52:22.4172432Z torch_library 2025-05-07T19:52:22.4172716Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.4172872Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.4173228Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.4173551Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.4173728Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.4173941Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.4174008Z 2025-05-07T19:52:22.4174085Z Output Library: 2025-05-07T19:52:22.4174183Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:52:22.4174267Z 2025-05-07T19:52:22.4174357Z Destination Directory: 2025-05-07T19:52:22.4174439Z fbgemm_gpu 2025-05-07T19:52:22.4174548Z ================================================================================ 2025-05-07T19:52:22.4174565Z 2025-05-07T19:52:22.4174569Z 2025-05-07T19:52:22.4174573Z 2025-05-07T19:52:22.4174667Z ================================================================================ 2025-05-07T19:52:22.4174785Z GPU CPP Library Target: fbgemm_gpu_py (SHARED) 2025-05-07T19:52:22.4174870Z 2025-05-07T19:52:22.4174938Z CPU_SRCS: 2025-05-07T19:52:22.4175032Z src/memory_utils/memory_utils.cpp 2025-05-07T19:52:22.4175133Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:52:22.4175335Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:52:22.4175535Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:52:22.4175731Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:52:22.4175947Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:52:22.4176147Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:52:22.4176364Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:52:22.4176507Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:52:22.4176642Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:52:22.4176756Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:52:22.4176867Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:52:22.4177015Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:52:22.4177111Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:52:22.4177208Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:52:22.4177333Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:52:22.4177424Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:52:22.4177513Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:52:22.4177595Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:52:22.4177684Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:52:22.4177776Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:52:22.4177865Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:52:22.4177971Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:52:22.4178059Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:52:22.4178280Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:52:22.4178429Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:52:22.4178633Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:52:22.4178851Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:52:22.4178944Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:52:22.4179055Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:52:22.4179146Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:52:22.4179253Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:52:22.4179455Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:52:22.4179582Z src/topology_utils.cpp 2025-05-07T19:52:22.4179648Z 2025-05-07T19:52:22.4179719Z GPU_SRCS: 2025-05-07T19:52:22.4179847Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:52:22.4179944Z src/input_combine_ops/input_combine.cu 2025-05-07T19:52:22.4180145Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:52:22.4180260Z src/memory_utils/memory_utils.cu 2025-05-07T19:52:22.4180399Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:52:22.4180579Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:52:22.4180756Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:52:22.4180906Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:52:22.4181025Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:52:22.4181265Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:52:22.4181455Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:52:22.4181627Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:52:22.4181760Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:52:22.4181921Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:52:22.4182053Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:52:22.4182178Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:52:22.4182299Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:52:22.4182425Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:52:22.4182572Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:52:22.4182720Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:52:22.4182855Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:52:22.4182998Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:52:22.4183129Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:52:22.4183234Z src/metric_ops/metric_ops.cu 2025-05-07T19:52:22.4183448Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:52:22.4183628Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:52:22.4183801Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:52:22.4183909Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:52:22.4184016Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:52:22.4184135Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:52:22.4184263Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:52:22.4184360Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:52:22.4184451Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:52:22.4184571Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:52:22.4184673Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:52:22.4184789Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:52:22.4184920Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:52:22.4185040Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:52:22.4185168Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:52:22.4185303Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:52:22.4185447Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:52:22.4185545Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:52:22.4185642Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:52:22.4185742Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:52:22.4185855Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:52:22.4185974Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:52:22.4186093Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:52:22.4186201Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:52:22.4186296Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:52:22.4186389Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:52:22.4186498Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:52:22.4186600Z src/sparse_ops/sparse_range.cu 2025-05-07T19:52:22.4186751Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:52:22.4186856Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:52:22.4186965Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:52:22.4187030Z 2025-05-07T19:52:22.4187109Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:22.4187113Z 2025-05-07T19:52:22.4187180Z 2025-05-07T19:52:22.4187280Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:22.4187340Z 2025-05-07T19:52:22.4187408Z 2025-05-07T19:52:22.4187482Z OTHER_SRCS: 2025-05-07T19:52:22.4187486Z 2025-05-07T19:52:22.4187573Z 2025-05-07T19:52:22.4187642Z CC_FLAGS: 2025-05-07T19:52:22.4187646Z 2025-05-07T19:52:22.4187708Z 2025-05-07T19:52:22.4187799Z NVCC_FLAGS: 2025-05-07T19:52:22.4187883Z --expt-relaxed-constexpr 2025-05-07T19:52:22.4187968Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:22.4188057Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:22.4188156Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:22.4188221Z 2025-05-07T19:52:22.4188296Z HIPCC_FLAGS: 2025-05-07T19:52:22.4188303Z 2025-05-07T19:52:22.4188376Z 2025-05-07T19:52:22.4188453Z INCLUDE_DIRS: 2025-05-07T19:52:22.4188553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.4188645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:22.4188746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:22.4188846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:22.4189116Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:52:22.4189487Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:22.4189622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:22.4189768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:22.4189911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:22.4190123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:22.4190310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:22.4190446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:22.4190744Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:52:22.4190816Z 2025-05-07T19:52:22.4190908Z Selected Source Files: 2025-05-07T19:52:22.4191018Z src/memory_utils/memory_utils.cpp 2025-05-07T19:52:22.4191123Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:52:22.4191313Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:52:22.4191521Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:52:22.4191727Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:52:22.4191939Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:52:22.4192138Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:52:22.4192368Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:52:22.4192509Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:52:22.4192632Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:52:22.4192760Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:52:22.4192959Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:52:22.4193107Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:52:22.4193385Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:52:22.4193511Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:52:22.4193645Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:52:22.4193751Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:52:22.4193869Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:52:22.4193968Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:52:22.4194058Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:52:22.4194200Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:52:22.4194317Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:52:22.4194475Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:52:22.4194574Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:52:22.4194837Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:52:22.4194990Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:52:22.4195204Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:52:22.4195494Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:52:22.4195605Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:52:22.4195708Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:52:22.4195807Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:52:22.4195947Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:52:22.4196152Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:52:22.4196241Z src/topology_utils.cpp 2025-05-07T19:52:22.4196372Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:52:22.4196488Z src/input_combine_ops/input_combine.cu 2025-05-07T19:52:22.4196708Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:52:22.4196805Z src/memory_utils/memory_utils.cu 2025-05-07T19:52:22.4196921Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:52:22.4197116Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:52:22.4197310Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:52:22.4197458Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:52:22.4197599Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:52:22.4197857Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:52:22.4198060Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:52:22.4198242Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:52:22.4198386Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:52:22.4198534Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:52:22.4198684Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:52:22.4198820Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:52:22.4198944Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:52:22.4199075Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:52:22.4199243Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:52:22.4199400Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:52:22.4199535Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:52:22.4199691Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:52:22.4199827Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:52:22.4199932Z src/metric_ops/metric_ops.cu 2025-05-07T19:52:22.4200164Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:52:22.4200359Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:52:22.4200554Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:52:22.4200670Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:52:22.4200780Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:52:22.4200912Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:52:22.4201052Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:52:22.4201153Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:52:22.4201259Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:52:22.4201386Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:52:22.4201500Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:52:22.4201622Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:52:22.4201760Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:52:22.4201895Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:52:22.4202187Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:52:22.4202329Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:52:22.4202560Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:52:22.4202677Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:52:22.4202779Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:52:22.4202886Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:52:22.4203014Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:52:22.4203145Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:52:22.4203338Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:52:22.4203451Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:52:22.4203573Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:52:22.4203672Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:52:22.4203793Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:52:22.4203910Z src/sparse_ops/sparse_range.cu 2025-05-07T19:52:22.4204023Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:52:22.4204127Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:52:22.4204235Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:52:22.4204309Z 2025-05-07T19:52:22.4204394Z HIPified Source Files: 2025-05-07T19:52:22.4204399Z 2025-05-07T19:52:22.4204467Z 2025-05-07T19:52:22.4204567Z Library Dependencies: 2025-05-07T19:52:22.4204637Z torch 2025-05-07T19:52:22.4204713Z torch_library 2025-05-07T19:52:22.4205042Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:52:22.4205215Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:22.4205665Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:22.4206008Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:22.4206211Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:22.4206286Z fbgemm 2025-05-07T19:52:22.4206383Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:52:22.4206500Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:52:22.4206598Z fbgemm_gpu_tbe_index_select 2025-05-07T19:52:22.4206682Z fbgemm_gpu_tbe_cache 2025-05-07T19:52:22.4206772Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:52:22.4206867Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:22.4207079Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:22.4207151Z 2025-05-07T19:52:22.4207248Z Output Library: 2025-05-07T19:52:22.4207331Z fbgemm_gpu_py 2025-05-07T19:52:22.4207407Z 2025-05-07T19:52:22.4207497Z Destination Directory: 2025-05-07T19:52:22.4207591Z fbgemm_gpu 2025-05-07T19:52:22.4207701Z ================================================================================ 2025-05-07T19:52:22.4207705Z 2025-05-07T19:52:22.4207800Z -- Configuring done (8.8s) 2025-05-07T19:52:22.5446558Z -- Generating done (0.1s) 2025-05-07T19:52:22.5467098Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build 2025-05-07T19:52:22.5597301Z Change Dir: '/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build' 2025-05-07T19:52:22.5598472Z 2025-05-07T19:52:22.5599375Z Run Build Command(s): /github/home/miniconda/envs/build_binary/bin/ninja -v -j 48 install 2025-05-07T19:52:22.6661954Z [1/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:52:22.6676041Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.6844889Z [2/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:52:22.6855734Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.6870456Z [3/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:52:22.6880989Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.6945990Z [4/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:52:22.6956562Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.6966895Z [5/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:52:22.6977101Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.7067809Z [6/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:52:22.7078565Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.7112981Z [7/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:52:22.7119361Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.7167232Z [8/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:52:22.7173877Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.7179727Z [9/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:52:22.7185701Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.7301549Z [10/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:52:22.7308281Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.7635331Z [11/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:52:22.7646422Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.7773310Z [12/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:52:22.7785080Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.7842838Z [13/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:52:22.7858607Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.7900859Z [14/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:52:22.7911525Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.7921536Z [15/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:52:22.7932065Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.7942258Z [16/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:52:22.7952331Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.7998637Z [17/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:52:22.8009155Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.8019422Z [18/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:52:22.8030227Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.8145240Z [19/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:52:22.8156189Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.8213438Z [20/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:52:22.8224190Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.8404766Z [21/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:52:22.8416362Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.8795811Z [22/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:52:22.8807481Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.8818185Z [23/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:52:22.8828921Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.8842696Z [24/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:52:22.8853858Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.8878102Z [25/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:52:22.8897784Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.8909160Z [26/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:52:22.8920448Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.9098509Z [27/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:52:22.9109881Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.9120202Z [28/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:52:22.9131308Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.9200558Z [29/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:52:22.9212366Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.9281936Z [30/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:52:22.9293352Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.9359608Z [31/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:52:22.9370635Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.9446956Z [32/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:52:22.9458544Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.9499977Z [33/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:52:22.9511182Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.9545425Z [34/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:52:22.9556572Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.9644701Z [35/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:52:22.9655883Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.9755038Z [36/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:52:22.9766308Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:22.9991850Z [37/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:52:23.0003436Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.0200409Z [38/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:52:23.0211560Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.0249543Z [39/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:52:23.0316152Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.0326948Z [40/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:52:23.0336523Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.0401066Z [41/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:52:23.0407562Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.0441091Z [42/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:52:23.0447164Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.0608304Z [43/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:52:23.0619031Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.0831457Z [44/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:52:23.0842704Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.1112665Z [45/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:52:23.1124431Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.2026307Z [46/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:52:23.2038353Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.2511618Z [47/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:52:23.2523551Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.2811450Z [48/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:52:23.2823277Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.2852632Z [49/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:52:23.2864540Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.3003927Z [50/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:52:23.3015820Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.3077139Z [51/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:52:23.3088913Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.3539534Z [52/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:52:23.3551297Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.3876118Z [53/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:52:23.3888075Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.4523556Z [54/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:52:23.4535154Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.5109809Z [55/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:52:23.5128615Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.6772000Z [56/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:52:23.6784133Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.6874206Z [57/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:52:23.6886543Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.7008752Z [58/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -mavx512f -mavx512bw -mavx512dq -mavx512vl -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:52:23.7028188Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.8348255Z [59/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtils.cc 2025-05-07T19:52:23.8382512Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.9225127Z [60/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:52:23.9237384Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:23.9878859Z [61/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:52:23.9890805Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:24.2989835Z [62/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:52:24.3001240Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:24.9352408Z [63/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -c /__w/FBGEMM/FBGEMM/src/Utils.cc 2025-05-07T19:52:24.9361502Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:24.9979237Z [64/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,asmjit.so -o asmjit.so CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:52:25.3077936Z [65/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -c /__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc 2025-05-07T19:52:25.3094069Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.7056771Z [66/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -c /__w/FBGEMM/FBGEMM/src/RefImplementations.cc 2025-05-07T19:52:28.7073800Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.6386711Z [67/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -c /__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc 2025-05-07T19:52:29.6402686Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.3755773Z [68/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:52:30.3775413Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.4042537Z [69/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:52:30.4058921Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.4211152Z [70/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:52:30.4227329Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.5117542Z [71/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:52:30.5132933Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.6891120Z [72/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:52:30.6907404Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:31.3251135Z [73/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:52:31.3270360Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:31.7323772Z [74/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:52:31.7341169Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:32.7485376Z [75/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:52:32.7504435Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:33.2490473Z [76/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_config_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -MF CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o.d -o CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/config/feature_gates.cpp 2025-05-07T19:52:33.2507050Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:33.8617020Z [77/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_config.so -o fbgemm_gpu_config.so CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:52:34.3289990Z [78/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc 2025-05-07T19:52:34.3303330Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:34.6252696Z [79/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:52:34.6271119Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:37.5507412Z [80/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:52:37.5526606Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:38.5832375Z [81/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:52:38.5851464Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:39.2647210Z [82/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:52:39.2664781Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:41.1354852Z [83/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:52:41.1374075Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:41.3426460Z [84/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:52:41.3443731Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:41.6596778Z [85/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:52:41.6613554Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:43.3487500Z [86/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:52:43.3505670Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:44.8581238Z [87/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:52:44.8598903Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:48.3610257Z [88/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc 2025-05-07T19:52:48.3627585Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:50.3443518Z [89/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:52:50.3461266Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:51.4525723Z [90/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:52:51.4539835Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:51.7196412Z [91/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:52:51.7221011Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:53.2890554Z [92/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:52:53.2910488Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:55.4002810Z [93/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:52:55.4020111Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:56.4905256Z [94/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:52:56.4922544Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:58.5079444Z [95/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:52:58.5098826Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:59.6900027Z [96/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:52:59.6918482Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:00.0621314Z [97/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:53:00.0639624Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:01.3350788Z [98/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o 2025-05-07T19:53:01.3372311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.3374333Z 2025-05-07T19:53:01.3376092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.3378053Z 2025-05-07T19:53:01.3379778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.3381581Z 2025-05-07T19:53:01.3382958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.3384502Z 2025-05-07T19:53:01.3386020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.3387763Z 2025-05-07T19:53:01.3389306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.3391072Z 2025-05-07T19:53:01.5267221Z [99/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:53:01.5284854Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:01.7777874Z [100/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o 2025-05-07T19:53:01.7800547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.7803049Z 2025-05-07T19:53:01.7804738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.7806710Z 2025-05-07T19:53:01.7808420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.7810341Z 2025-05-07T19:53:01.7812047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.7813959Z 2025-05-07T19:53:01.7815657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.7817509Z 2025-05-07T19:53:01.7819174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.7821118Z 2025-05-07T19:53:01.8693653Z [101/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o 2025-05-07T19:53:01.8713676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.8715312Z 2025-05-07T19:53:01.8716800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.8718600Z 2025-05-07T19:53:01.8719819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu(13): warning #177-D: variable "::TORCH_LIBRARY_FRAGMENT_static_init_fbgemm_2" was declared but never referenced 2025-05-07T19:53:01.8721095Z 2025-05-07T19:53:01.8722341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.8723711Z 2025-05-07T19:53:01.8725330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.8726996Z 2025-05-07T19:53:01.8728312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu(13): warning #177-D: variable "::TORCH_LIBRARY_FRAGMENT_static_init_fbgemm_2" was declared but never referenced 2025-05-07T19:53:01.8729870Z 2025-05-07T19:53:01.8731406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.8733170Z 2025-05-07T19:53:01.8734735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:01.8736629Z 2025-05-07T19:53:02.2173924Z [102/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o 2025-05-07T19:53:02.2196155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.2198025Z 2025-05-07T19:53:02.2199568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.2201330Z 2025-05-07T19:53:02.2203146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.2204870Z 2025-05-07T19:53:02.2206445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.2208170Z 2025-05-07T19:53:02.2209699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.2211064Z 2025-05-07T19:53:02.2212544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.2214335Z 2025-05-07T19:53:02.5397626Z [103/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o 2025-05-07T19:53:02.5418887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.5420724Z 2025-05-07T19:53:02.5422445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.5424409Z 2025-05-07T19:53:02.5426018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.5427771Z 2025-05-07T19:53:02.5429389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.5431197Z 2025-05-07T19:53:02.5432788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.5434722Z 2025-05-07T19:53:02.5436328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.5438272Z 2025-05-07T19:53:02.6066187Z [104/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o 2025-05-07T19:53:02.6088104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.6089783Z 2025-05-07T19:53:02.6091372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.6093334Z 2025-05-07T19:53:02.6094880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.6096652Z 2025-05-07T19:53:02.6098205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.6100044Z 2025-05-07T19:53:02.6101720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.6103890Z 2025-05-07T19:53:02.6105625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.6107411Z 2025-05-07T19:53:03.0571482Z [105/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o 2025-05-07T19:53:03.0591289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:03.0593238Z 2025-05-07T19:53:03.0594841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:03.0596590Z 2025-05-07T19:53:03.0598218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:03.0600106Z 2025-05-07T19:53:03.0601627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:03.0603684Z 2025-05-07T19:53:03.0605235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:03.0606941Z 2025-05-07T19:53:03.0608559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:03.0610298Z 2025-05-07T19:53:03.1294254Z [106/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o 2025-05-07T19:53:03.1311322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:03.1313036Z 2025-05-07T19:53:03.1314556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:03.1316075Z 2025-05-07T19:53:03.1317356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:03.1318792Z 2025-05-07T19:53:03.1320067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:03.1321468Z 2025-05-07T19:53:03.1322662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:03.1324021Z 2025-05-07T19:53:03.1325225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:03.1326575Z 2025-05-07T19:53:03.6602679Z [107/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:53:03.6617517Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:04.7750991Z [108/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:53:04.7768420Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:05.5748752Z [109/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o 2025-05-07T19:53:06.3998320Z [110/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o 2025-05-07T19:53:06.4018297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:06.4020501Z 2025-05-07T19:53:06.4022140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:06.4024035Z 2025-05-07T19:53:06.4025725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:06.4027526Z 2025-05-07T19:53:06.4029130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:06.4030987Z 2025-05-07T19:53:06.4032574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:06.4034449Z 2025-05-07T19:53:06.4036051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:06.4037904Z 2025-05-07T19:53:06.7615749Z [111/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:53:06.7635287Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:12.4848598Z [112/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o 2025-05-07T19:53:12.4868437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.4870258Z 2025-05-07T19:53:12.4871857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.4873774Z 2025-05-07T19:53:12.4875329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.4877116Z 2025-05-07T19:53:12.4878723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.4880531Z 2025-05-07T19:53:12.4882002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.4883733Z 2025-05-07T19:53:12.4885318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.4887139Z 2025-05-07T19:53:14.1422206Z [113/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:53:14.1440528Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:14.2845416Z [114/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o 2025-05-07T19:53:14.2864944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.2866594Z 2025-05-07T19:53:14.2867955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.2869489Z 2025-05-07T19:53:14.2870770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.2872295Z 2025-05-07T19:53:14.2873777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.2875375Z 2025-05-07T19:53:14.2876742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.2878332Z 2025-05-07T19:53:14.2879783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.2881376Z 2025-05-07T19:53:15.6856694Z [115/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:53:15.6876477Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:16.8983893Z [116/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o 2025-05-07T19:53:16.9006077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:16.9008012Z 2025-05-07T19:53:16.9009721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:16.9011622Z 2025-05-07T19:53:16.9013282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:16.9015159Z 2025-05-07T19:53:16.9016828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:16.9018687Z 2025-05-07T19:53:16.9020320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:16.9022115Z 2025-05-07T19:53:16.9023942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:16.9025546Z 2025-05-07T19:53:18.7985580Z [117/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o 2025-05-07T19:53:18.8007270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.8009212Z 2025-05-07T19:53:18.8010870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.8012762Z 2025-05-07T19:53:18.8014399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.8016275Z 2025-05-07T19:53:18.8017896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.8019735Z 2025-05-07T19:53:18.8021392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.8023569Z 2025-05-07T19:53:18.8025230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.8027056Z 2025-05-07T19:53:19.7241787Z [118/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o 2025-05-07T19:53:19.7263379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.7265060Z 2025-05-07T19:53:19.7266215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.7267489Z 2025-05-07T19:53:19.7268713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.7270296Z 2025-05-07T19:53:19.7271820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.7273683Z 2025-05-07T19:53:19.7275215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.7277214Z 2025-05-07T19:53:19.7278876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.7280842Z 2025-05-07T19:53:20.4307790Z [119/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o 2025-05-07T19:53:20.4331039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4332986Z 2025-05-07T19:53:20.4334699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4336603Z 2025-05-07T19:53:20.4338140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4339545Z 2025-05-07T19:53:20.4341115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4343470Z 2025-05-07T19:53:20.4345152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4347073Z 2025-05-07T19:53:20.4348973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4350887Z 2025-05-07T19:53:21.7355876Z [120/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:21.7379317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.7381233Z 2025-05-07T19:53:21.7382923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.7384903Z 2025-05-07T19:53:21.7386596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.7388984Z 2025-05-07T19:53:21.7390628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.7392531Z 2025-05-07T19:53:21.7394560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.7396459Z 2025-05-07T19:53:21.7398143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.7400136Z 2025-05-07T19:53:22.0023839Z [121/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o 2025-05-07T19:53:22.0047101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.0049055Z 2025-05-07T19:53:22.0050784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.0052735Z 2025-05-07T19:53:22.0054425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.0056610Z 2025-05-07T19:53:22.0058492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.0060360Z 2025-05-07T19:53:22.0061957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.0063792Z 2025-05-07T19:53:22.0065491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:22.0067405Z 2025-05-07T19:53:23.5701884Z [122/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cu -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o 2025-05-07T19:53:23.5723216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.5725142Z 2025-05-07T19:53:23.5726859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.5728728Z 2025-05-07T19:53:23.5730362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.5732723Z 2025-05-07T19:53:23.5734456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.5736486Z 2025-05-07T19:53:23.5738199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.5740047Z 2025-05-07T19:53:23.5741700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.5743596Z 2025-05-07T19:53:32.4513931Z [123/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o 2025-05-07T19:53:32.4525771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.4526754Z 2025-05-07T19:53:32.4527648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.4528802Z 2025-05-07T19:53:32.4529759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.4531068Z 2025-05-07T19:53:32.4532041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.4533013Z 2025-05-07T19:53:32.4534235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.4535732Z 2025-05-07T19:53:32.4537061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:32.4538739Z 2025-05-07T19:53:33.1457433Z [124/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_cache.so -o fbgemm_gpu_tbe_cache.so CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:53:39.7725806Z [125/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o 2025-05-07T19:53:39.7746971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.7748886Z 2025-05-07T19:53:39.7750631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.7752553Z 2025-05-07T19:53:39.7754266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.7755738Z 2025-05-07T19:53:39.7757213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.7759433Z 2025-05-07T19:53:39.7761115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.7762927Z 2025-05-07T19:53:39.7764212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.7765834Z 2025-05-07T19:53:40.2322329Z [126/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o 2025-05-07T19:53:40.2343699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:40.2345639Z 2025-05-07T19:53:40.2347293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:40.2349212Z 2025-05-07T19:53:40.2350875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:40.2352759Z 2025-05-07T19:53:40.2354577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:40.2356986Z 2025-05-07T19:53:40.2358648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:40.2360481Z 2025-05-07T19:53:40.2362357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:40.2364274Z 2025-05-07T19:53:40.3834431Z [127/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_optimizers.so -o fbgemm_gpu_tbe_optimizers.so CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:53:42.8031144Z [128/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc 2025-05-07T19:53:42.8053513Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:42.9270750Z [129/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:42.9299579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:42.9301937Z 2025-05-07T19:53:42.9304304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:42.9307039Z 2025-05-07T19:53:42.9309378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:42.9311811Z 2025-05-07T19:53:42.9314053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:42.9316584Z 2025-05-07T19:53:42.9318738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:42.9321017Z 2025-05-07T19:53:42.9323106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:42.9325461Z 2025-05-07T19:53:43.4674384Z [130/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o 2025-05-07T19:53:43.4705287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:43.4707967Z 2025-05-07T19:53:43.4710253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:43.4712492Z 2025-05-07T19:53:43.4714628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:43.4716897Z 2025-05-07T19:53:43.4718910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:43.4721212Z 2025-05-07T19:53:43.4723175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:43.4725485Z 2025-05-07T19:53:43.4727535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:43.4729881Z 2025-05-07T19:53:43.4732079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:43.4734690Z 2025-05-07T19:53:43.4736721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:43.4739147Z 2025-05-07T19:53:43.4741396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:43.4743672Z 2025-05-07T19:53:43.4745647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:43.4747983Z 2025-05-07T19:53:43.4750120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:43.4752588Z 2025-05-07T19:53:43.4754856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:43.4757261Z 2025-05-07T19:53:43.4759210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:43.4761486Z 2025-05-07T19:53:43.4763523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:43.4765728Z 2025-05-07T19:53:43.4767704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:43.4770241Z 2025-05-07T19:53:43.5524917Z [131/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm.so -o fbgemm.so CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so asmjit.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so && : 2025-05-07T19:53:44.2531092Z [132/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_common.so -o fbgemm_gpu_tbe_common.so CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T19:53:47.3222656Z [133/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:47.3243400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:47.3245381Z 2025-05-07T19:53:47.3246942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:47.3248447Z 2025-05-07T19:53:47.3249835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:47.3251420Z 2025-05-07T19:53:47.3252887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:47.3254499Z 2025-05-07T19:53:47.3255902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:47.3257504Z 2025-05-07T19:53:47.3259042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:47.3260713Z 2025-05-07T19:53:47.3262318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:47.3264165Z 2025-05-07T19:53:47.3265771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:47.3267683Z 2025-05-07T19:53:47.3269243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:47.3271032Z 2025-05-07T19:53:47.3272590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:47.3274509Z 2025-05-07T19:53:47.3276064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:47.3277874Z 2025-05-07T19:53:47.3279436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:47.3281121Z 2025-05-07T19:53:47.3282710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:47.3284455Z 2025-05-07T19:53:47.3285724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:47.3287394Z 2025-05-07T19:53:47.3288762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:47.3290698Z 2025-05-07T19:53:47.3292162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:47.3293868Z 2025-05-07T19:53:47.3295342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:47.3297114Z 2025-05-07T19:53:47.3298540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:47.3300146Z 2025-05-07T19:53:49.5516511Z [134/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:49.5539057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.5540957Z 2025-05-07T19:53:49.5542495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.5544869Z 2025-05-07T19:53:49.5546380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5548075Z 2025-05-07T19:53:49.5549918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5551631Z 2025-05-07T19:53:49.5553269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5554969Z 2025-05-07T19:53:49.5556449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5558156Z 2025-05-07T19:53:49.5559852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.5561712Z 2025-05-07T19:53:49.5563343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.5565268Z 2025-05-07T19:53:49.5566811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5568520Z 2025-05-07T19:53:49.5570054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5571696Z 2025-05-07T19:53:49.5573253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5574993Z 2025-05-07T19:53:49.5576369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5577900Z 2025-05-07T19:53:49.5579373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.5581046Z 2025-05-07T19:53:49.5582574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.5584284Z 2025-05-07T19:53:49.5585742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5587450Z 2025-05-07T19:53:49.5588978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5590789Z 2025-05-07T19:53:49.5592097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5594068Z 2025-05-07T19:53:49.5595634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5597194Z 2025-05-07T19:53:54.0522616Z [135/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:54.0544986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.0546854Z 2025-05-07T19:53:54.0548342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.0549814Z 2025-05-07T19:53:54.0551256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.0553117Z 2025-05-07T19:53:54.0554594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.0556831Z 2025-05-07T19:53:54.0558382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.0560117Z 2025-05-07T19:53:54.0561991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.0563884Z 2025-05-07T19:53:54.5780958Z [136/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o 2025-05-07T19:53:54.5801700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.5803625Z 2025-05-07T19:53:54.5805150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.5806717Z 2025-05-07T19:53:54.5808359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.5809996Z 2025-05-07T19:53:54.5811614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.5813719Z 2025-05-07T19:53:54.5815282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.5816944Z 2025-05-07T19:53:54.5818488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.5820234Z 2025-05-07T19:53:54.6978305Z [137/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:54.6999912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.7001761Z 2025-05-07T19:53:54.7003646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.7005322Z 2025-05-07T19:53:54.7006918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.7009016Z 2025-05-07T19:53:54.7010586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.7012293Z 2025-05-07T19:53:54.7014085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.7015886Z 2025-05-07T19:53:54.7017399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:54.7019260Z 2025-05-07T19:53:57.9929563Z [138/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:57.9949790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:57.9951538Z 2025-05-07T19:53:57.9953153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:57.9954804Z 2025-05-07T19:53:57.9956872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:57.9958542Z 2025-05-07T19:53:57.9960360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:57.9962123Z 2025-05-07T19:53:57.9963607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:57.9965357Z 2025-05-07T19:53:57.9966897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:57.9968631Z 2025-05-07T19:54:01.6606573Z [139/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o 2025-05-07T19:54:01.6624951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.6626775Z 2025-05-07T19:54:01.6628421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.6630261Z 2025-05-07T19:54:01.6631743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.6634151Z 2025-05-07T19:54:01.6636009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.6637798Z 2025-05-07T19:54:01.6639417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.6641228Z 2025-05-07T19:54:01.6642847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.6644713Z 2025-05-07T19:54:02.3584141Z [140/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_utils.so -o fbgemm_gpu_tbe_utils.so CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:54:03.0083800Z [141/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_sparse_async_cumsum.so -o fbgemm_gpu_sparse_async_cumsum.so CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:54:03.1928909Z [142/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o 2025-05-07T19:54:03.1946705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:03.1948722Z 2025-05-07T19:54:03.1950199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:03.1951554Z 2025-05-07T19:54:03.1952931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:03.1954564Z 2025-05-07T19:54:03.1955921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:03.1957433Z 2025-05-07T19:54:03.1958769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:03.1960285Z 2025-05-07T19:54:03.1961739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:03.1963499Z 2025-05-07T19:54:10.3015031Z [143/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o 2025-05-07T19:54:10.3034517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:10.3036198Z 2025-05-07T19:54:10.3037794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:10.3039291Z 2025-05-07T19:54:10.3040681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:10.3042142Z 2025-05-07T19:54:10.3043387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:10.3044824Z 2025-05-07T19:54:10.3046156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:10.3047744Z 2025-05-07T19:54:10.3049013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:10.3050746Z 2025-05-07T19:54:10.8155700Z [144/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:10.8176847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:10.8178350Z 2025-05-07T19:54:10.8179771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:10.8181352Z 2025-05-07T19:54:10.8182749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:10.8184306Z 2025-05-07T19:54:10.8185845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:10.8187640Z 2025-05-07T19:54:10.8189229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:10.8191089Z 2025-05-07T19:54:10.8192744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:10.8194786Z 2025-05-07T19:54:19.7763312Z [145/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:19.7784733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:19.7786579Z 2025-05-07T19:54:19.7788168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:19.7789968Z 2025-05-07T19:54:19.7791538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:19.7793430Z 2025-05-07T19:54:19.7795015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:19.7796784Z 2025-05-07T19:54:19.7798345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:19.7799983Z 2025-05-07T19:54:19.7801567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:19.7803642Z 2025-05-07T19:54:21.7867653Z [146/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o 2025-05-07T19:54:21.7887726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:21.7889358Z 2025-05-07T19:54:21.7890799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:21.7892531Z 2025-05-07T19:54:21.7894028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:21.7895683Z 2025-05-07T19:54:21.7897197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:21.7898852Z 2025-05-07T19:54:21.7900384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:21.7902368Z 2025-05-07T19:54:21.7903864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:21.7905411Z 2025-05-07T19:54:30.8707051Z [147/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o 2025-05-07T19:54:30.8737334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.8739682Z 2025-05-07T19:54:30.8741744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.8744191Z 2025-05-07T19:54:30.8746281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:30.8748665Z 2025-05-07T19:54:30.8750787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:30.8753343Z 2025-05-07T19:54:30.8755424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:30.8757620Z 2025-05-07T19:54:30.8759652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.8761959Z 2025-05-07T19:54:30.8763989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.8766320Z 2025-05-07T19:54:30.8768289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:30.8770531Z 2025-05-07T19:54:30.8772544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:30.8774933Z 2025-05-07T19:54:30.8776870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:30.8779420Z 2025-05-07T19:54:30.8785464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.8787924Z 2025-05-07T19:54:30.8790094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:30.8792531Z 2025-05-07T19:54:30.8794616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:30.8796863Z 2025-05-07T19:54:30.8798858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:30.8801091Z 2025-05-07T19:54:30.8803242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:30.8805504Z 2025-05-07T19:54:32.0880066Z [148/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:54:32.0899441Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:33.4687776Z [149/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o 2025-05-07T19:54:33.4710666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:33.4712584Z 2025-05-07T19:54:33.4714356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:33.4716278Z 2025-05-07T19:54:33.4717937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:33.4719821Z 2025-05-07T19:54:33.4721499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:33.4723376Z 2025-05-07T19:54:33.4724991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:33.4726847Z 2025-05-07T19:54:33.4728501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:33.4730769Z 2025-05-07T19:54:35.3133380Z [150/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:54:35.3156073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.3158022Z 2025-05-07T19:54:35.3159518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.3161378Z 2025-05-07T19:54:35.3162969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.3164646Z 2025-05-07T19:54:35.3166316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.3168101Z 2025-05-07T19:54:35.3169813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.3171739Z 2025-05-07T19:54:35.3173769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.3175463Z 2025-05-07T19:54:35.9151677Z [151/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:35.9172621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.9174313Z 2025-05-07T19:54:35.9175790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.9177569Z 2025-05-07T19:54:35.9179165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.9180545Z 2025-05-07T19:54:35.9181782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.9183414Z 2025-05-07T19:54:35.9184864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.9187122Z 2025-05-07T19:54:35.9188668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.9194007Z 2025-05-07T19:54:36.4540899Z [152/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o 2025-05-07T19:54:36.4562018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.4563900Z 2025-05-07T19:54:36.4565465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.4567315Z 2025-05-07T19:54:36.4568562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4572781Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4576311Z (955): here 2025-05-07T19:54:36.4576523Z 2025-05-07T19:54:36.4578017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4582405Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4585634Z (1007): here 2025-05-07T19:54:36.4585856Z 2025-05-07T19:54:36.4587087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4591352Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4594663Z (1059): here 2025-05-07T19:54:36.4594890Z 2025-05-07T19:54:36.4596105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4600404Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4603812Z (1111): here 2025-05-07T19:54:36.4604041Z 2025-05-07T19:54:36.4605238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4609473Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4612628Z (1163): here 2025-05-07T19:54:36.4612834Z 2025-05-07T19:54:36.4614072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4618316Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4621755Z (1215): here 2025-05-07T19:54:36.4621978Z 2025-05-07T19:54:36.4623227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4627911Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4631035Z (1267): here 2025-05-07T19:54:36.4631238Z 2025-05-07T19:54:36.4632473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4636949Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4640216Z (1319): here 2025-05-07T19:54:36.4640447Z 2025-05-07T19:54:36.4641747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4646219Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4649524Z (1371): here 2025-05-07T19:54:36.4649740Z 2025-05-07T19:54:36.4650982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4655318Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4658544Z (1423): here 2025-05-07T19:54:36.4658759Z 2025-05-07T19:54:36.4660013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4664323Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4667743Z (1475): here 2025-05-07T19:54:36.4667956Z 2025-05-07T19:54:36.4669200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4673839Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4677160Z (1527): here 2025-05-07T19:54:36.4677387Z 2025-05-07T19:54:36.4678695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4683192Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4686523Z (1579): here 2025-05-07T19:54:36.4686739Z 2025-05-07T19:54:36.4688014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4692489Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4695816Z (1631): here 2025-05-07T19:54:36.4696029Z 2025-05-07T19:54:36.4697277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4701691Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4705145Z (1683): here 2025-05-07T19:54:36.4705363Z 2025-05-07T19:54:36.4706642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4711053Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4714428Z (1735): here 2025-05-07T19:54:36.4714649Z 2025-05-07T19:54:36.4716244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4720836Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4724083Z (1787): here 2025-05-07T19:54:36.4724315Z 2025-05-07T19:54:36.4725559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4729977Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4733253Z (1839): here 2025-05-07T19:54:36.4733467Z 2025-05-07T19:54:36.4734714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4739151Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4742432Z (1891): here 2025-05-07T19:54:36.4742644Z 2025-05-07T19:54:36.4743922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4748300Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4751585Z (1943): here 2025-05-07T19:54:36.4751794Z 2025-05-07T19:54:36.4753168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4757766Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4760978Z (1995): here 2025-05-07T19:54:36.4761226Z 2025-05-07T19:54:36.4762428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4766861Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4769852Z (2047): here 2025-05-07T19:54:36.4770048Z 2025-05-07T19:54:36.4771177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4775132Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4778075Z (2099): here 2025-05-07T19:54:36.4778255Z 2025-05-07T19:54:36.4779342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4783263Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4786463Z (2151): here 2025-05-07T19:54:36.4786660Z 2025-05-07T19:54:36.4788204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.4789861Z 2025-05-07T19:54:36.4791382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.4793364Z 2025-05-07T19:54:36.4794437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4798028Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4800746Z (955): here 2025-05-07T19:54:36.4800916Z 2025-05-07T19:54:36.4801917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4805707Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4812477Z (1007): here 2025-05-07T19:54:36.4812670Z 2025-05-07T19:54:36.4813694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4817217Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4819826Z (1059): here 2025-05-07T19:54:36.4820014Z 2025-05-07T19:54:36.4821133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4824607Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4827289Z (1111): here 2025-05-07T19:54:36.4827510Z 2025-05-07T19:54:36.4828505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4831977Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4834802Z (1163): here 2025-05-07T19:54:36.4834979Z 2025-05-07T19:54:36.4835954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4839418Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4841951Z (1215): here 2025-05-07T19:54:36.4842134Z 2025-05-07T19:54:36.4843177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4846756Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4849645Z (1267): here 2025-05-07T19:54:36.4849827Z 2025-05-07T19:54:36.4850968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4854960Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4857789Z (1319): here 2025-05-07T19:54:36.4858011Z 2025-05-07T19:54:36.4859093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4862986Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4865910Z (1371): here 2025-05-07T19:54:36.4866104Z 2025-05-07T19:54:36.4867174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4871151Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4874274Z (1423): here 2025-05-07T19:54:36.4874472Z 2025-05-07T19:54:36.4875590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4879595Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4882614Z (1475): here 2025-05-07T19:54:36.4882812Z 2025-05-07T19:54:36.4883923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4887608Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4891098Z (1527): here 2025-05-07T19:54:36.4891348Z 2025-05-07T19:54:36.4892717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4897148Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4900486Z (1579): here 2025-05-07T19:54:36.4900714Z 2025-05-07T19:54:36.4902260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4906749Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4909967Z (1631): here 2025-05-07T19:54:36.4910170Z 2025-05-07T19:54:36.4911395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4915853Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4938727Z (1683): here 2025-05-07T19:54:36.4939052Z 2025-05-07T19:54:36.4940409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4945158Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4948695Z (1735): here 2025-05-07T19:54:36.4948921Z 2025-05-07T19:54:36.4950272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4954953Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4958752Z (1787): here 2025-05-07T19:54:36.4958963Z 2025-05-07T19:54:36.4960453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4965197Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4968686Z (1839): here 2025-05-07T19:54:36.4968923Z 2025-05-07T19:54:36.4970261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4974858Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4978362Z (1891): here 2025-05-07T19:54:36.4978597Z 2025-05-07T19:54:36.4979925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4984699Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4988162Z (1943): here 2025-05-07T19:54:36.4988376Z 2025-05-07T19:54:36.4989706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.4994538Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.4998035Z (1995): here 2025-05-07T19:54:36.4998252Z 2025-05-07T19:54:36.4999585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5004554Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5008434Z (2047): here 2025-05-07T19:54:36.5008668Z 2025-05-07T19:54:36.5009919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5014719Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5018145Z (2099): here 2025-05-07T19:54:36.5018363Z 2025-05-07T19:54:36.5019696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5024304Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5027714Z (2151): here 2025-05-07T19:54:36.5027946Z 2025-05-07T19:54:36.5029622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.5031550Z 2025-05-07T19:54:36.5033387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.5035236Z 2025-05-07T19:54:36.5036539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5041076Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5044449Z (955): here 2025-05-07T19:54:36.5044655Z 2025-05-07T19:54:36.5045979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5050492Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5053831Z (1007): here 2025-05-07T19:54:36.5054070Z 2025-05-07T19:54:36.5055367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5060212Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5063524Z (1059): here 2025-05-07T19:54:36.5063755Z 2025-05-07T19:54:36.5064999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5069407Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5072787Z (1111): here 2025-05-07T19:54:36.5073104Z 2025-05-07T19:54:36.5074387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5078909Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5082267Z (1163): here 2025-05-07T19:54:36.5082486Z 2025-05-07T19:54:36.5083827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5088366Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5091716Z (1215): here 2025-05-07T19:54:36.5091955Z 2025-05-07T19:54:36.5093281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5097802Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5101152Z (1267): here 2025-05-07T19:54:36.5101385Z 2025-05-07T19:54:36.5102971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5107980Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5111303Z (1319): here 2025-05-07T19:54:36.5111530Z 2025-05-07T19:54:36.5112788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5117444Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5120862Z (1371): here 2025-05-07T19:54:36.5121081Z 2025-05-07T19:54:36.5122395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5127067Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5130482Z (1423): here 2025-05-07T19:54:36.5130705Z 2025-05-07T19:54:36.5132035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5136463Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5139627Z (1475): here 2025-05-07T19:54:36.5139865Z 2025-05-07T19:54:36.5141037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5145166Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5148092Z (1527): here 2025-05-07T19:54:36.5148302Z 2025-05-07T19:54:36.5149423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5153891Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5157041Z (1579): here 2025-05-07T19:54:36.5157247Z 2025-05-07T19:54:36.5158423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5162279Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5165013Z (1631): here 2025-05-07T19:54:36.5165195Z 2025-05-07T19:54:36.5166232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5170438Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5173734Z (1683): here 2025-05-07T19:54:36.5173952Z 2025-05-07T19:54:36.5175234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5179690Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5183028Z (1735): here 2025-05-07T19:54:36.5183259Z 2025-05-07T19:54:36.5184534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5188902Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5192317Z (1787): here 2025-05-07T19:54:36.5192529Z 2025-05-07T19:54:36.5193822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5198283Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5202228Z (1839): here 2025-05-07T19:54:36.5202455Z 2025-05-07T19:54:36.5204004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5208639Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5212087Z (1891): here 2025-05-07T19:54:36.5212311Z 2025-05-07T19:54:36.5213656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5218378Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5221843Z (1943): here 2025-05-07T19:54:36.5222076Z 2025-05-07T19:54:36.5223417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5228069Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5231511Z (1995): here 2025-05-07T19:54:36.5231727Z 2025-05-07T19:54:36.5233173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5237737Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5241072Z (2047): here 2025-05-07T19:54:36.5241296Z 2025-05-07T19:54:36.5242619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5247305Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5251119Z (2099): here 2025-05-07T19:54:36.5251350Z 2025-05-07T19:54:36.5252798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:36.5257345Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:36.5260761Z (2151): here 2025-05-07T19:54:36.5261003Z 2025-05-07T19:54:37.3224282Z [153/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o 2025-05-07T19:54:37.3245887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.3247714Z 2025-05-07T19:54:37.3249326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.3251655Z 2025-05-07T19:54:37.3253386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:37.3255123Z 2025-05-07T19:54:37.3256655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:37.3258385Z 2025-05-07T19:54:37.3259892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:37.3261586Z 2025-05-07T19:54:37.3263059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.3264758Z 2025-05-07T19:54:37.3266430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.3267968Z 2025-05-07T19:54:37.3269235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:37.3270660Z 2025-05-07T19:54:37.3271995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:37.3273883Z 2025-05-07T19:54:37.3275464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:37.3277221Z 2025-05-07T19:54:37.3278795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.3280560Z 2025-05-07T19:54:37.3282141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:37.3283931Z 2025-05-07T19:54:37.3285461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:37.3287182Z 2025-05-07T19:54:37.3288717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:37.3290407Z 2025-05-07T19:54:37.3291752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:37.3293202Z 2025-05-07T19:54:39.3175809Z [154/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:39.3198261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.3199939Z 2025-05-07T19:54:39.3201336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.3203513Z 2025-05-07T19:54:39.3205192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.3207084Z 2025-05-07T19:54:39.3208779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.3210645Z 2025-05-07T19:54:39.3212318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.3214090Z 2025-05-07T19:54:39.3215591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.3217866Z 2025-05-07T19:54:39.4256836Z [155/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o 2025-05-07T19:54:39.4279102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.4280901Z 2025-05-07T19:54:39.4282401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.4284375Z 2025-05-07T19:54:39.4285978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.4287808Z 2025-05-07T19:54:39.4289345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.4290879Z 2025-05-07T19:54:39.4292297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.4294395Z 2025-05-07T19:54:39.4295983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.4297801Z 2025-05-07T19:54:39.4299650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.4301520Z 2025-05-07T19:54:39.4303166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.4304657Z 2025-05-07T19:54:39.4306055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.4307656Z 2025-05-07T19:54:39.4308976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.4310543Z 2025-05-07T19:54:39.4311881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.4313618Z 2025-05-07T19:54:39.4315053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.4316730Z 2025-05-07T19:54:39.4318355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.4320298Z 2025-05-07T19:54:39.4322006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.4323854Z 2025-05-07T19:54:39.4325246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.4326830Z 2025-05-07T19:54:39.4328277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.4329884Z 2025-05-07T19:54:39.4331369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.4333031Z 2025-05-07T19:54:39.4334518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.4336288Z 2025-05-07T19:54:39.6755565Z [156/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o 2025-05-07T19:54:39.6778242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.6780006Z 2025-05-07T19:54:39.6781417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.6783269Z 2025-05-07T19:54:39.6784758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.6786594Z 2025-05-07T19:54:39.6788261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.6790081Z 2025-05-07T19:54:39.6791670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.6793446Z 2025-05-07T19:54:39.6794868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.6796595Z 2025-05-07T19:54:39.6798274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.6800483Z 2025-05-07T19:54:39.6802460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.6804367Z 2025-05-07T19:54:39.6806004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.6807477Z 2025-05-07T19:54:39.6808803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.6810421Z 2025-05-07T19:54:39.6811812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.6813405Z 2025-05-07T19:54:39.6814858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.6816432Z 2025-05-07T19:54:39.6817929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.6819766Z 2025-05-07T19:54:39.6821425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.6823367Z 2025-05-07T19:54:39.6824968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.6826660Z 2025-05-07T19:54:39.6828105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.6829791Z 2025-05-07T19:54:39.6831137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.6832974Z 2025-05-07T19:54:39.6834529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.6836189Z 2025-05-07T19:54:41.8771053Z [157/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:41.8791222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.8792776Z 2025-05-07T19:54:41.8794396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.8796069Z 2025-05-07T19:54:41.8797523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.8799244Z 2025-05-07T19:54:41.8800830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.8802767Z 2025-05-07T19:54:41.8804324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.8806016Z 2025-05-07T19:54:41.8807535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.8809270Z 2025-05-07T19:54:43.0754856Z [158/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o 2025-05-07T19:54:43.0777180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.0778826Z 2025-05-07T19:54:43.0780197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.0781864Z 2025-05-07T19:54:43.0783492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.0785192Z 2025-05-07T19:54:43.0786636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.0788222Z 2025-05-07T19:54:43.0789781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.0791360Z 2025-05-07T19:54:43.0792980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.0794707Z 2025-05-07T19:54:44.7947889Z [159/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:54:44.7965261Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:45.2719660Z [160/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:45.2742206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.2744543Z 2025-05-07T19:54:45.2746262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.2748219Z 2025-05-07T19:54:45.2749912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.2751831Z 2025-05-07T19:54:45.2753716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.2755646Z 2025-05-07T19:54:45.2757353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.2759244Z 2025-05-07T19:54:45.2760948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.2762898Z 2025-05-07T19:54:45.5006051Z [161/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o 2025-05-07T19:54:45.5026795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.5028618Z 2025-05-07T19:54:45.5030034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.5031616Z 2025-05-07T19:54:45.5033211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.5034796Z 2025-05-07T19:54:45.5036233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.5037900Z 2025-05-07T19:54:45.5039428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.5041255Z 2025-05-07T19:54:45.5042841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.5044494Z 2025-05-07T19:54:50.1541910Z [162/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:50.1564619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.1566480Z 2025-05-07T19:54:50.1568108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.1570046Z 2025-05-07T19:54:50.1571766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.1573593Z 2025-05-07T19:54:50.1575218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.1577136Z 2025-05-07T19:54:50.1578835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.1580775Z 2025-05-07T19:54:50.1582489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.1584436Z 2025-05-07T19:54:50.3350515Z [163/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:54:50.3370708Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:50.5667981Z [164/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o 2025-05-07T19:54:50.5689773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.5691709Z 2025-05-07T19:54:50.5693386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.5695152Z 2025-05-07T19:54:50.5696609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:50.5698242Z 2025-05-07T19:54:50.5699723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:50.5701934Z 2025-05-07T19:54:50.5703982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:50.5705681Z 2025-05-07T19:54:50.5707569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:50.5709365Z 2025-05-07T19:54:50.5710975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.5712695Z 2025-05-07T19:54:50.5714348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.5716303Z 2025-05-07T19:54:50.5717928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:50.5719756Z 2025-05-07T19:54:50.5721337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:50.5723179Z 2025-05-07T19:54:50.5724757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:50.5726562Z 2025-05-07T19:54:50.5728114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:50.5729899Z 2025-05-07T19:54:50.5731555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.5733433Z 2025-05-07T19:54:50.5735106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.5736884Z 2025-05-07T19:54:50.5738347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:50.5739981Z 2025-05-07T19:54:50.5741481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:50.5743151Z 2025-05-07T19:54:50.5744510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:50.5746263Z 2025-05-07T19:54:50.5747739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:50.5749537Z 2025-05-07T19:54:50.6005102Z [165/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:50.6023143Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:50.6414450Z [166/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:50.6435283Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:50.6750716Z [167/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:50.6769111Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:50.7085287Z [168/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:50.7104819Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:50.7494810Z [169/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:50.7515581Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:50.7903686Z [170/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:54:50.7923543Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:50.8242777Z [171/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:50.8262174Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:50.8446484Z [172/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:50.8465417Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:50.8648283Z [173/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:50.8668573Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:50.8750208Z [174/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:50.8770444Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:50.9046114Z [175/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:50.9066053Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:50.9086169Z [176/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:50.9106125Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:50.9349301Z [177/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:50.9369726Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:50.9432770Z [178/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:50.9452696Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:51.3290278Z [179/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:51.3310103Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:52.0083249Z [180/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:52.0105036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.0106994Z 2025-05-07T19:54:52.0108668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.0110528Z 2025-05-07T19:54:52.0112192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.0114235Z 2025-05-07T19:54:52.0115827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.0117309Z 2025-05-07T19:54:52.0118684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.0120514Z 2025-05-07T19:54:52.0121967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.0123739Z 2025-05-07T19:54:52.8517397Z [181/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:52.8534463Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:53.1607949Z [182/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:53.1626587Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:53.3122507Z [183/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o 2025-05-07T19:54:53.3145891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:53.3147751Z 2025-05-07T19:54:53.3149408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:53.3151323Z 2025-05-07T19:54:53.3153038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.3154764Z 2025-05-07T19:54:53.3156379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.3158233Z 2025-05-07T19:54:53.3159869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.3161701Z 2025-05-07T19:54:53.3163334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.3165164Z 2025-05-07T19:54:53.3166805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:53.3169033Z 2025-05-07T19:54:53.3170711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:53.3172617Z 2025-05-07T19:54:53.3174391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.3176210Z 2025-05-07T19:54:53.3177644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.3179429Z 2025-05-07T19:54:53.3180918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.3182794Z 2025-05-07T19:54:53.3184331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.3186149Z 2025-05-07T19:54:53.3187822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:53.3189685Z 2025-05-07T19:54:53.3191357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:53.3193477Z 2025-05-07T19:54:53.3195035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.3196860Z 2025-05-07T19:54:53.3198474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.3200205Z 2025-05-07T19:54:53.3201788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.3203783Z 2025-05-07T19:54:53.3205315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:53.3207091Z 2025-05-07T19:54:54.2647297Z [184/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:54.2667284Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:54.4360406Z [185/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:54:54.4379418Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:54.4508955Z [186/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:54:54.4528053Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:55.7402838Z [187/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:55.7423005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.7424807Z 2025-05-07T19:54:55.7426142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.7427543Z 2025-05-07T19:54:55.7428887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.7430560Z 2025-05-07T19:54:55.7432140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.7433926Z 2025-05-07T19:54:55.7435362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.7436965Z 2025-05-07T19:54:55.7438399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.7439984Z 2025-05-07T19:54:55.7577642Z [188/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:54:55.7597068Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:55.7739443Z [189/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:55.7757645Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:55.7890569Z [190/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:55.7910312Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:55.8152263Z [191/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:54:55.8171830Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:55.8191845Z [192/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:55.8211872Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:55.8483309Z [193/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:55.8503290Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:55.8521593Z [194/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:55.8539632Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.8470847Z [195/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:54:57.1441066Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.1463586Z [196/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o 2025-05-07T19:54:57.1486988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.1488803Z 2025-05-07T19:54:57.1490549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.1492452Z 2025-05-07T19:54:57.1494050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1496362Z 2025-05-07T19:54:57.1498143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1500011Z 2025-05-07T19:54:57.1501646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1503821Z 2025-05-07T19:54:57.1505506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.1507388Z 2025-05-07T19:54:57.1509031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.1510875Z 2025-05-07T19:54:57.1512532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1514509Z 2025-05-07T19:54:57.1516110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1517999Z 2025-05-07T19:54:57.1519610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1521491Z 2025-05-07T19:54:57.1523160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.1525019Z 2025-05-07T19:54:57.1526641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.1528575Z 2025-05-07T19:54:57.1530213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1531989Z 2025-05-07T19:54:57.1533641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1535496Z 2025-05-07T19:54:57.1537118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1538826Z 2025-05-07T19:54:57.3759837Z [197/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:54:57.3779969Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.6375113Z [198/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:57.6392397Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:58.2635539Z [199/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:54:58.2653565Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:58.8777162Z [200/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:58.8800406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.8802629Z 2025-05-07T19:54:58.8804412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.8806326Z 2025-05-07T19:54:58.8807937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.8809854Z 2025-05-07T19:54:58.8811468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.8813247Z 2025-05-07T19:54:58.8814761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.8816628Z 2025-05-07T19:54:58.8818310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.8820146Z 2025-05-07T19:54:59.3132248Z [201/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:54:59.3154366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:59.3156275Z 2025-05-07T19:54:59.3157960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:59.3159892Z 2025-05-07T19:54:59.3161476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:59.3163117Z 2025-05-07T19:54:59.3164824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:59.3166797Z 2025-05-07T19:54:59.3168489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:59.3170070Z 2025-05-07T19:54:59.3171302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:59.3172684Z 2025-05-07T19:55:00.6629042Z [202/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:55:00.6647590Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.8250012Z [203/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:55:02.8266398Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.8744900Z [204/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:55:02.8765132Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:06.3711643Z [205/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:07.4983311Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:07.5006824Z [206/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:55:07.5028504Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:07.6156271Z [207/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o 2025-05-07T19:55:07.6179683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:07.6181764Z 2025-05-07T19:55:07.6183571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:07.6185653Z 2025-05-07T19:55:07.6187362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:07.6189380Z 2025-05-07T19:55:07.6191199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:07.6193421Z 2025-05-07T19:55:07.6195039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:07.6197084Z 2025-05-07T19:55:07.6198863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:07.6200763Z 2025-05-07T19:55:08.1031865Z [208/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:55:08.1054006Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:08.6758653Z [209/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:55:08.6779998Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:09.2875508Z [210/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:55:09.2893407Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:09.7350735Z [211/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:55:09.7368919Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:10.1684317Z [212/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:55:10.1703239Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:10.3654292Z [213/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:55:10.3675404Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:10.4226610Z [214/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:55:10.4247272Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:10.4648058Z [215/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:55:10.4670272Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:10.8019004Z [216/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:55:10.8043988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:10.8046356Z 2025-05-07T19:55:10.8048191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:10.8050236Z 2025-05-07T19:55:10.8052022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:10.8054058Z 2025-05-07T19:55:10.8055851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:10.8057898Z 2025-05-07T19:55:10.8060031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:10.8062043Z 2025-05-07T19:55:10.8064037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:10.8066083Z 2025-05-07T19:55:11.0041868Z [217/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:55:11.0062828Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:11.1512670Z [218/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:55:11.1533932Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:11.2094749Z [219/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:11.2116366Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:11.2340724Z [220/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:55:11.2361924Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:11.4462600Z [221/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:11.4487831Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:11.4884240Z [222/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:55:11.4910039Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:12.2589426Z [223/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:55:12.2611874Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:12.4973752Z [224/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o 2025-05-07T19:55:12.4997553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:12.4999496Z 2025-05-07T19:55:12.5001180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:12.5003272Z 2025-05-07T19:55:12.5004873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:12.5006822Z 2025-05-07T19:55:12.5008444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:12.5010549Z 2025-05-07T19:55:12.5012435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:12.5014332Z 2025-05-07T19:55:12.5015983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:12.5017871Z 2025-05-07T19:55:12.5019591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:12.5021515Z 2025-05-07T19:55:12.5023146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:12.5024981Z 2025-05-07T19:55:12.5026702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:12.5028614Z 2025-05-07T19:55:12.5030256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:12.5032095Z 2025-05-07T19:55:12.5033846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:12.5035802Z 2025-05-07T19:55:12.5037556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:12.5039529Z 2025-05-07T19:55:12.5041195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:12.5043044Z 2025-05-07T19:55:12.5044665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:12.5046556Z 2025-05-07T19:55:12.5048225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:12.5050123Z 2025-05-07T19:55:14.2783869Z [225/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:55:14.2804941Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:14.2888794Z [226/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:55:14.2909818Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:14.3264688Z [227/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:55:14.3285348Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:14.3828091Z [228/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:55:15.0029211Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:15.0046580Z [229/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:15.0063559Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:16.9883297Z [230/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:55:16.9900473Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:17.8983992Z [231/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:17.9001014Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:18.7967597Z [232/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:55:18.7988060Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:19.1722811Z [233/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:19.1743835Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:19.6219192Z [234/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:19.6240897Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:20.1345807Z [235/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:20.1367563Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:20.7470948Z [236/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:20.7486637Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:20.8327311Z [237/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:55:20.8345818Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:21.0806980Z [238/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:55:21.0827110Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:21.6583045Z [239/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_pt2.so -o fbgemm_gpu_tbe_training_backward_pt2.so CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T19:55:21.9598534Z [240/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o 2025-05-07T19:55:21.9622445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:21.9624407Z 2025-05-07T19:55:21.9626115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:21.9628057Z 2025-05-07T19:55:21.9629670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.9631482Z 2025-05-07T19:55:21.9633167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.9634963Z 2025-05-07T19:55:21.9636545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.9638336Z 2025-05-07T19:55:21.9639947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.9641739Z 2025-05-07T19:55:21.9643460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:21.9645363Z 2025-05-07T19:55:21.9647066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:21.9648992Z 2025-05-07T19:55:21.9650582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.9652695Z 2025-05-07T19:55:21.9654289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.9656079Z 2025-05-07T19:55:21.9657799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.9659617Z 2025-05-07T19:55:21.9661193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.9663001Z 2025-05-07T19:55:21.9664682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:21.9666593Z 2025-05-07T19:55:21.9668315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:21.9670231Z 2025-05-07T19:55:21.9671825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.9673733Z 2025-05-07T19:55:21.9675290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.9677103Z 2025-05-07T19:55:21.9678684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.9680481Z 2025-05-07T19:55:21.9682088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.9683886Z 2025-05-07T19:55:22.2357432Z [241/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o 2025-05-07T19:55:22.2377454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.2379118Z 2025-05-07T19:55:22.2380585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.2382201Z 2025-05-07T19:55:22.2383553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.2385053Z 2025-05-07T19:55:22.2386376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.2387886Z 2025-05-07T19:55:22.2389178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.2390783Z 2025-05-07T19:55:22.2392084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.2393706Z 2025-05-07T19:55:22.2395119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.2396740Z 2025-05-07T19:55:22.2398164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.2399785Z 2025-05-07T19:55:22.2401108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.2402909Z 2025-05-07T19:55:22.2404172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.2405675Z 2025-05-07T19:55:22.2407010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.2408813Z 2025-05-07T19:55:22.2410326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.2411859Z 2025-05-07T19:55:22.2413284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.2414889Z 2025-05-07T19:55:22.2416352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:22.2417985Z 2025-05-07T19:55:22.2419263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.2420764Z 2025-05-07T19:55:22.2422119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.2423678Z 2025-05-07T19:55:22.2424988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.2426441Z 2025-05-07T19:55:22.2427717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:22.2429238Z 2025-05-07T19:55:22.6975292Z [242/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:55:22.6994493Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:22.8877423Z [243/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:55:22.8894491Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:23.1361092Z [244/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:55:23.1378969Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:24.8303109Z [245/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o 2025-05-07T19:55:24.8319772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:24.8321127Z 2025-05-07T19:55:24.8322321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:24.8323649Z 2025-05-07T19:55:24.8324776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.8326373Z 2025-05-07T19:55:24.8327478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.8331991Z 2025-05-07T19:55:24.8333387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.8334893Z 2025-05-07T19:55:24.8336162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.8337684Z 2025-05-07T19:55:24.8339081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:24.8340692Z 2025-05-07T19:55:24.8342102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:24.8343800Z 2025-05-07T19:55:24.8345134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.8346619Z 2025-05-07T19:55:24.8347959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.8349485Z 2025-05-07T19:55:24.8350787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.8352387Z 2025-05-07T19:55:24.8353916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.8355402Z 2025-05-07T19:55:24.8356828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:24.8358431Z 2025-05-07T19:55:24.8359891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:24.8361553Z 2025-05-07T19:55:24.8362978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.8364517Z 2025-05-07T19:55:24.8365851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.8367317Z 2025-05-07T19:55:24.8368534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.8370202Z 2025-05-07T19:55:24.8371495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:24.8372989Z 2025-05-07T19:55:24.8623527Z [246/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o 2025-05-07T19:55:24.8642769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:24.8644453Z 2025-05-07T19:55:24.8645878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:24.8647467Z 2025-05-07T19:55:24.8648638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8652419Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8655579Z (946): here 2025-05-07T19:55:24.8655767Z 2025-05-07T19:55:24.8656879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8660860Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8663694Z (996): here 2025-05-07T19:55:24.8663904Z 2025-05-07T19:55:24.8664988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8668808Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8671586Z (1046): here 2025-05-07T19:55:24.8671783Z 2025-05-07T19:55:24.8673099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8676731Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8679366Z (1096): here 2025-05-07T19:55:24.8679577Z 2025-05-07T19:55:24.8680669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8684417Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8687195Z (1146): here 2025-05-07T19:55:24.8687400Z 2025-05-07T19:55:24.8688527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8692310Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8695027Z (1196): here 2025-05-07T19:55:24.8695241Z 2025-05-07T19:55:24.8696346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8700541Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8703601Z (1246): here 2025-05-07T19:55:24.8703796Z 2025-05-07T19:55:24.8704871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8708659Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8711440Z (1296): here 2025-05-07T19:55:24.8711629Z 2025-05-07T19:55:24.8712862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8716706Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8719446Z (1346): here 2025-05-07T19:55:24.8719631Z 2025-05-07T19:55:24.8720646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8724245Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8727040Z (1396): here 2025-05-07T19:55:24.8727242Z 2025-05-07T19:55:24.8728333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8732124Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8734868Z (1446): here 2025-05-07T19:55:24.8735080Z 2025-05-07T19:55:24.8736192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8740583Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8743396Z (1496): here 2025-05-07T19:55:24.8743597Z 2025-05-07T19:55:24.8744685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8748446Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8751315Z (1546): here 2025-05-07T19:55:24.8751515Z 2025-05-07T19:55:24.8752671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8756636Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8759512Z (1596): here 2025-05-07T19:55:24.8759726Z 2025-05-07T19:55:24.8760866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8764478Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8767148Z (1646): here 2025-05-07T19:55:24.8767381Z 2025-05-07T19:55:24.8768485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8772316Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8775111Z (1696): here 2025-05-07T19:55:24.8775330Z 2025-05-07T19:55:24.8776425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8780487Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8783440Z (1746): here 2025-05-07T19:55:24.8783639Z 2025-05-07T19:55:24.8784811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8788586Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8791391Z (1796): here 2025-05-07T19:55:24.8791577Z 2025-05-07T19:55:24.8792715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8796678Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8799531Z (1846): here 2025-05-07T19:55:24.8799750Z 2025-05-07T19:55:24.8800896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8804957Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8824050Z (1896): here 2025-05-07T19:55:24.8824264Z 2025-05-07T19:55:24.8825382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8829243Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8832005Z (1946): here 2025-05-07T19:55:24.8832197Z 2025-05-07T19:55:24.8833510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8837318Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8840598Z (1996): here 2025-05-07T19:55:24.8840790Z 2025-05-07T19:55:24.8842147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8846015Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8848680Z (2046): here 2025-05-07T19:55:24.8848870Z 2025-05-07T19:55:24.8849885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8853660Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8856453Z (2096): here 2025-05-07T19:55:24.8856638Z 2025-05-07T19:55:24.8858013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:24.8859665Z 2025-05-07T19:55:24.8861107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:24.8862699Z 2025-05-07T19:55:24.8863836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8867575Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8870440Z (946): here 2025-05-07T19:55:24.8870641Z 2025-05-07T19:55:24.8871757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8875658Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8878647Z (996): here 2025-05-07T19:55:24.8878855Z 2025-05-07T19:55:24.8879944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8883941Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8886760Z (1046): here 2025-05-07T19:55:24.8886951Z 2025-05-07T19:55:24.8888041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8891655Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8894297Z (1096): here 2025-05-07T19:55:24.8894516Z 2025-05-07T19:55:24.8895621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8899395Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8902446Z (1146): here 2025-05-07T19:55:24.8902682Z 2025-05-07T19:55:24.8903791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8907609Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8910377Z (1196): here 2025-05-07T19:55:24.8910574Z 2025-05-07T19:55:24.8911673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8915715Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8918572Z (1246): here 2025-05-07T19:55:24.8918766Z 2025-05-07T19:55:24.8919883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8924209Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8927114Z (1296): here 2025-05-07T19:55:24.8927308Z 2025-05-07T19:55:24.8928449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8932384Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8935026Z (1346): here 2025-05-07T19:55:24.8935253Z 2025-05-07T19:55:24.8936299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8940112Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8942859Z (1396): here 2025-05-07T19:55:24.8943048Z 2025-05-07T19:55:24.8944200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8948006Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8950841Z (1446): here 2025-05-07T19:55:24.8951051Z 2025-05-07T19:55:24.8952169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8956257Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8959100Z (1496): here 2025-05-07T19:55:24.8959303Z 2025-05-07T19:55:24.8960442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8964606Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8967406Z (1546): here 2025-05-07T19:55:24.8967639Z 2025-05-07T19:55:24.8968747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8972644Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8975427Z (1596): here 2025-05-07T19:55:24.8975618Z 2025-05-07T19:55:24.8976659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8980265Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8983071Z (1646): here 2025-05-07T19:55:24.8983271Z 2025-05-07T19:55:24.8984341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8988131Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8990924Z (1696): here 2025-05-07T19:55:24.8991137Z 2025-05-07T19:55:24.8992253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.8996232Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.8999126Z (1746): here 2025-05-07T19:55:24.8999339Z 2025-05-07T19:55:24.9000464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9004522Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9007847Z (1796): here 2025-05-07T19:55:24.9008098Z 2025-05-07T19:55:24.9009212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9013071Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9015987Z (1846): here 2025-05-07T19:55:24.9016184Z 2025-05-07T19:55:24.9017302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9020935Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9023681Z (1896): here 2025-05-07T19:55:24.9023888Z 2025-05-07T19:55:24.9024970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9028781Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9031563Z (1946): here 2025-05-07T19:55:24.9031780Z 2025-05-07T19:55:24.9033041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9036852Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9039671Z (1996): here 2025-05-07T19:55:24.9039892Z 2025-05-07T19:55:24.9041048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9044865Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9047907Z (2046): here 2025-05-07T19:55:24.9048133Z 2025-05-07T19:55:24.9049408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9053291Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9056156Z (2096): here 2025-05-07T19:55:24.9056361Z 2025-05-07T19:55:24.9057789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:24.9059447Z 2025-05-07T19:55:24.9060932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:24.9062526Z 2025-05-07T19:55:24.9063628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9067144Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9069805Z (946): here 2025-05-07T19:55:24.9070026Z 2025-05-07T19:55:24.9071164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9075109Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9077975Z (996): here 2025-05-07T19:55:24.9078167Z 2025-05-07T19:55:24.9079288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9083071Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9085919Z (1046): here 2025-05-07T19:55:24.9086353Z 2025-05-07T19:55:24.9087399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9091301Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9094163Z (1096): here 2025-05-07T19:55:24.9094355Z 2025-05-07T19:55:24.9095501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9099258Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9102326Z (1146): here 2025-05-07T19:55:24.9102552Z 2025-05-07T19:55:24.9103735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9107496Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9110022Z (1196): here 2025-05-07T19:55:24.9110226Z 2025-05-07T19:55:24.9111300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9115159Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9117933Z (1246): here 2025-05-07T19:55:24.9118127Z 2025-05-07T19:55:24.9119212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9123018Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9125851Z (1296): here 2025-05-07T19:55:24.9126021Z 2025-05-07T19:55:24.9127115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9131546Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9134310Z (1346): here 2025-05-07T19:55:24.9134512Z 2025-05-07T19:55:24.9135623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9139465Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9142274Z (1396): here 2025-05-07T19:55:24.9142487Z 2025-05-07T19:55:24.9143598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9147400Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9150295Z (1446): here 2025-05-07T19:55:24.9150515Z 2025-05-07T19:55:24.9151581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9155360Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9157967Z (1496): here 2025-05-07T19:55:24.9158151Z 2025-05-07T19:55:24.9159259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9163040Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9165785Z (1546): here 2025-05-07T19:55:24.9165980Z 2025-05-07T19:55:24.9167122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9171223Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9174075Z (1596): here 2025-05-07T19:55:24.9174273Z 2025-05-07T19:55:24.9175379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9179233Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9181995Z (1646): here 2025-05-07T19:55:24.9182189Z 2025-05-07T19:55:24.9183290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9187176Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9189977Z (1696): here 2025-05-07T19:55:24.9190200Z 2025-05-07T19:55:24.9191310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9195364Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9198137Z (1746): here 2025-05-07T19:55:24.9198363Z 2025-05-07T19:55:24.9199489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9203319Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9206094Z (1796): here 2025-05-07T19:55:24.9206289Z 2025-05-07T19:55:24.9207398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9211136Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9214246Z (1846): here 2025-05-07T19:55:24.9214453Z 2025-05-07T19:55:24.9215762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9219566Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9222434Z (1896): here 2025-05-07T19:55:24.9222641Z 2025-05-07T19:55:24.9223805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9227588Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9230360Z (1946): here 2025-05-07T19:55:24.9230591Z 2025-05-07T19:55:24.9231792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9235778Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9238593Z (1996): here 2025-05-07T19:55:24.9238789Z 2025-05-07T19:55:24.9239883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9243788Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9246558Z (2046): here 2025-05-07T19:55:24.9246747Z 2025-05-07T19:55:24.9247753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:24.9251450Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:24.9254471Z (2096): here 2025-05-07T19:55:24.9254660Z 2025-05-07T19:55:25.8815023Z [247/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o 2025-05-07T19:55:25.8834747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:25.8836526Z 2025-05-07T19:55:25.8838087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:25.8839939Z 2025-05-07T19:55:25.8841444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:25.8843200Z 2025-05-07T19:55:25.8844808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:25.8846589Z 2025-05-07T19:55:25.8848203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:25.8850334Z 2025-05-07T19:55:25.8851916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:25.8853748Z 2025-05-07T19:55:26.9774400Z [248/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o 2025-05-07T19:55:26.9795725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:26.9797780Z 2025-05-07T19:55:26.9799450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:26.9801301Z 2025-05-07T19:55:26.9803044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.9804736Z 2025-05-07T19:55:26.9806261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.9808216Z 2025-05-07T19:55:26.9809715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.9811437Z 2025-05-07T19:55:26.9813225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.9815032Z 2025-05-07T19:55:26.9816754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:26.9818639Z 2025-05-07T19:55:26.9820322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:26.9822229Z 2025-05-07T19:55:26.9823734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.9825354Z 2025-05-07T19:55:26.9826589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.9828008Z 2025-05-07T19:55:26.9829330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.9830850Z 2025-05-07T19:55:26.9832303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.9834136Z 2025-05-07T19:55:26.9835737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:26.9837258Z 2025-05-07T19:55:26.9838677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:26.9840374Z 2025-05-07T19:55:26.9841708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.9843322Z 2025-05-07T19:55:26.9844819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.9846537Z 2025-05-07T19:55:26.9848157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.9849858Z 2025-05-07T19:55:26.9851464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.9853154Z 2025-05-07T19:55:29.1186236Z [249/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o 2025-05-07T19:55:29.1207480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.1209390Z 2025-05-07T19:55:29.1211028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.1212901Z 2025-05-07T19:55:29.1214533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.1216399Z 2025-05-07T19:55:29.1218190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.1220257Z 2025-05-07T19:55:29.1221963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.1223925Z 2025-05-07T19:55:29.1225633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.1227707Z 2025-05-07T19:55:35.3390912Z [250/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o 2025-05-07T19:55:35.3411648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:35.3413439Z 2025-05-07T19:55:35.3414986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:35.3416815Z 2025-05-07T19:55:35.3418454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:35.3419938Z 2025-05-07T19:55:35.3421229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:35.3423067Z 2025-05-07T19:55:35.3424527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:35.3426231Z 2025-05-07T19:55:35.3427795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:35.3430012Z 2025-05-07T19:55:47.4776938Z [251/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_grad_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o 2025-05-07T19:55:47.4798924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:47.4800926Z 2025-05-07T19:55:47.4802810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:47.4804682Z 2025-05-07T19:55:47.4806353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:47.4808177Z 2025-05-07T19:55:47.4809740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:47.4811446Z 2025-05-07T19:55:47.4813037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:47.4815308Z 2025-05-07T19:55:47.4816860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:47.4818625Z 2025-05-07T19:56:23.1680675Z [252/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:56:23.1702303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:23.1704162Z 2025-05-07T19:56:23.1705730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:23.1707542Z 2025-05-07T19:56:23.1709143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:23.1710919Z 2025-05-07T19:56:23.1712505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:23.1714515Z 2025-05-07T19:56:23.1716042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:23.1718214Z 2025-05-07T19:56:23.1720022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:23.1721833Z 2025-05-07T19:56:24.6282946Z [253/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:56:24.6305285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:24.6307167Z 2025-05-07T19:56:24.6308940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:24.6310568Z 2025-05-07T19:56:24.6311974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:24.6313672Z 2025-05-07T19:56:24.6315167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:24.6317428Z 2025-05-07T19:56:24.6319332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:24.6320825Z 2025-05-07T19:56:24.6322253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:24.6324109Z 2025-05-07T19:56:24.6817717Z [254/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:56:24.6839368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:24.6841221Z 2025-05-07T19:56:24.6842797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:24.6844574Z 2025-05-07T19:56:24.6846145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:24.6848424Z 2025-05-07T19:56:24.6849996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:24.6851803Z 2025-05-07T19:56:24.6853679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:24.6855431Z 2025-05-07T19:56:24.6857013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:24.6858794Z 2025-05-07T19:56:26.1737670Z [255/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:26.1761180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:26.1763107Z 2025-05-07T19:56:26.1764785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:26.1766658Z 2025-05-07T19:56:26.1768776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:26.1770643Z 2025-05-07T19:56:26.1772601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:26.1774545Z 2025-05-07T19:56:26.1776271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:26.1778133Z 2025-05-07T19:56:26.1779845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:26.1781768Z 2025-05-07T19:56:26.4524183Z [256/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o 2025-05-07T19:56:26.4536067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:26.4537051Z 2025-05-07T19:56:26.4537964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:26.4539218Z 2025-05-07T19:56:26.4540240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:26.4541241Z 2025-05-07T19:56:26.4542177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:26.4543162Z 2025-05-07T19:56:26.4544053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:26.4545029Z 2025-05-07T19:56:26.4545932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:26.4546913Z 2025-05-07T19:56:27.8333929Z [257/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o 2025-05-07T19:56:27.8355429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:27.8357626Z 2025-05-07T19:56:27.8359346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:27.8361071Z 2025-05-07T19:56:27.8362500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.8364255Z 2025-05-07T19:56:27.8365786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.8367425Z 2025-05-07T19:56:27.8368711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.8370386Z 2025-05-07T19:56:27.8372023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:27.8373800Z 2025-05-07T19:56:27.8375365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:27.8376846Z 2025-05-07T19:56:27.8378120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.8379769Z 2025-05-07T19:56:27.8381075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.8382611Z 2025-05-07T19:56:27.8383958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.8385529Z 2025-05-07T19:56:27.8386932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:27.8388605Z 2025-05-07T19:56:27.8390048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:27.8391625Z 2025-05-07T19:56:27.8393172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.8394694Z 2025-05-07T19:56:27.8396080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.8397654Z 2025-05-07T19:56:27.8398947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:27.8400894Z 2025-05-07T19:56:27.8575688Z [258/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:56:27.8597389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:27.8599024Z 2025-05-07T19:56:27.8600447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:27.8602318Z 2025-05-07T19:56:27.8603818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:27.8605339Z 2025-05-07T19:56:27.8606804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:27.8608459Z 2025-05-07T19:56:27.8609888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:27.8611831Z 2025-05-07T19:56:27.8613348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:27.8615049Z 2025-05-07T19:56:28.4838445Z [259/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o 2025-05-07T19:56:28.4859021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.4860619Z 2025-05-07T19:56:28.4862101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.4863742Z 2025-05-07T19:56:28.4865124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.4866736Z 2025-05-07T19:56:28.4868229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.4870351Z 2025-05-07T19:56:28.4871808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.4873720Z 2025-05-07T19:56:28.4875374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.4877015Z 2025-05-07T19:56:28.9171111Z [260/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:56:28.9191086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.9192997Z 2025-05-07T19:56:28.9194595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.9196335Z 2025-05-07T19:56:28.9197672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.9199210Z 2025-05-07T19:56:28.9200665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.9203105Z 2025-05-07T19:56:28.9206816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.9208572Z 2025-05-07T19:56:28.9210074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.9211757Z 2025-05-07T19:56:28.9327320Z [261/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o 2025-05-07T19:56:28.9347252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.9348960Z 2025-05-07T19:56:28.9350494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.9352178Z 2025-05-07T19:56:28.9353745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.9355950Z 2025-05-07T19:56:28.9357470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.9359223Z 2025-05-07T19:56:28.9360964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.9362703Z 2025-05-07T19:56:28.9364206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.9365798Z 2025-05-07T19:56:30.7102836Z [262/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o 2025-05-07T19:56:30.7123726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.7125364Z 2025-05-07T19:56:30.7126803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.7129106Z 2025-05-07T19:56:30.7130655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.7132377Z 2025-05-07T19:56:30.7134239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.7136016Z 2025-05-07T19:56:30.7137569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.7139189Z 2025-05-07T19:56:30.7140527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.7142150Z 2025-05-07T19:56:30.9855786Z [263/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:30.9876605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.9878429Z 2025-05-07T19:56:30.9879991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.9882297Z 2025-05-07T19:56:30.9885597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.9887277Z 2025-05-07T19:56:30.9888752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.9890455Z 2025-05-07T19:56:30.9891971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.9893674Z 2025-05-07T19:56:30.9895123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.9896865Z 2025-05-07T19:56:31.1741310Z [264/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:56:31.1761820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1764099Z 2025-05-07T19:56:31.1765581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1767214Z 2025-05-07T19:56:31.1768946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1770742Z 2025-05-07T19:56:31.1772219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1773890Z 2025-05-07T19:56:31.1775327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1777001Z 2025-05-07T19:56:31.1778485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1780132Z 2025-05-07T19:56:31.9284479Z [265/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o 2025-05-07T19:56:31.9301591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9303761Z 2025-05-07T19:56:31.9305197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9306526Z 2025-05-07T19:56:31.9307455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:56:31.9308527Z 2025-05-07T19:56:31.9309780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9311074Z 2025-05-07T19:56:31.9312195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9313587Z 2025-05-07T19:56:31.9314500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:56:31.9315605Z 2025-05-07T19:56:31.9316804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9318128Z 2025-05-07T19:56:31.9319396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9320854Z 2025-05-07T19:56:35.1901589Z [266/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:56:35.1923877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.1925647Z 2025-05-07T19:56:35.1927155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.1929008Z 2025-05-07T19:56:35.1930643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.1932310Z 2025-05-07T19:56:35.1933872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.1935674Z 2025-05-07T19:56:35.1937245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.1939017Z 2025-05-07T19:56:35.1940585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.1942394Z 2025-05-07T19:56:37.0279393Z [267/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:37.0302419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.0304208Z 2025-05-07T19:56:37.0305829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.0307640Z 2025-05-07T19:56:37.0309356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.0311268Z 2025-05-07T19:56:37.0313069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.0314984Z 2025-05-07T19:56:37.0316692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.0318567Z 2025-05-07T19:56:37.0320171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.0321826Z 2025-05-07T19:56:37.8148800Z [268/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:56:37.8172444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.8174400Z 2025-05-07T19:56:37.8176203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.8178166Z 2025-05-07T19:56:37.8179862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.8181725Z 2025-05-07T19:56:37.8183425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.8185080Z 2025-05-07T19:56:37.8186587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.8188370Z 2025-05-07T19:56:37.8190226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.8191976Z 2025-05-07T19:56:38.5778366Z [269/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:56:38.5801625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.5803811Z 2025-05-07T19:56:38.5805505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.5807425Z 2025-05-07T19:56:38.5809094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.5811001Z 2025-05-07T19:56:38.5812692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.5814591Z 2025-05-07T19:56:38.5816270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.5818147Z 2025-05-07T19:56:38.5819825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.5821737Z 2025-05-07T19:56:38.7159621Z [270/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:56:38.7171473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.7172479Z 2025-05-07T19:56:38.7173346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.7174335Z 2025-05-07T19:56:38.7175214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.7176181Z 2025-05-07T19:56:38.7177050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.7178036Z 2025-05-07T19:56:38.7178890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.7179869Z 2025-05-07T19:56:38.7180743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.7181720Z 2025-05-07T19:56:38.8775988Z [271/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:56:38.8799593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.8801560Z 2025-05-07T19:56:38.8803544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.8805459Z 2025-05-07T19:56:38.8806934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.8808808Z 2025-05-07T19:56:38.8810517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.8812466Z 2025-05-07T19:56:38.8814069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.8815864Z 2025-05-07T19:56:38.8817395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.8819183Z 2025-05-07T19:56:39.6544060Z [272/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:56:39.6567916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.6569823Z 2025-05-07T19:56:39.6571478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.6573355Z 2025-05-07T19:56:39.6575028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.6576899Z 2025-05-07T19:56:39.6578598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.6580507Z 2025-05-07T19:56:39.6582136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.6584058Z 2025-05-07T19:56:39.6585728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.6587609Z 2025-05-07T19:56:40.1595160Z [273/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:56:40.1618559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.1620464Z 2025-05-07T19:56:40.1622206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.1624040Z 2025-05-07T19:56:40.1625422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.1627092Z 2025-05-07T19:56:40.1628752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.1630671Z 2025-05-07T19:56:40.1632344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.1634323Z 2025-05-07T19:56:40.1635996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.1637888Z 2025-05-07T19:56:41.0163871Z [274/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:56:41.0185767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:41.0187603Z 2025-05-07T19:56:41.0189184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:41.0190989Z 2025-05-07T19:56:41.0192592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:41.0194515Z 2025-05-07T19:56:41.0196133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:41.0197969Z 2025-05-07T19:56:41.0199545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:41.0201359Z 2025-05-07T19:56:41.0203197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:41.0205047Z 2025-05-07T19:56:42.5455958Z [275/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:56:42.5476461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.5478226Z 2025-05-07T19:56:42.5479814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.5481621Z 2025-05-07T19:56:42.5483234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.5485017Z 2025-05-07T19:56:42.5486590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.5488289Z 2025-05-07T19:56:42.5489744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.5491439Z 2025-05-07T19:56:42.5492951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.5496584Z 2025-05-07T19:56:43.5830075Z [276/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:56:43.5851047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.5852888Z 2025-05-07T19:56:43.5854532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.5856399Z 2025-05-07T19:56:43.5857815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.5859698Z 2025-05-07T19:56:43.5861422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.5863303Z 2025-05-07T19:56:43.5864962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.5867235Z 2025-05-07T19:56:43.5868911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:43.5870820Z 2025-05-07T19:56:44.6281531Z [277/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o 2025-05-07T19:56:44.6305125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.6307112Z 2025-05-07T19:56:44.6308775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.6310676Z 2025-05-07T19:56:44.6312318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.6314266Z 2025-05-07T19:56:44.6315923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.6317790Z 2025-05-07T19:56:44.6319853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.6321708Z 2025-05-07T19:56:44.6323514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.6325401Z 2025-05-07T19:56:46.3015973Z [278/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o 2025-05-07T19:56:46.3036894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3038800Z 2025-05-07T19:56:46.3040510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3042204Z 2025-05-07T19:56:46.3043688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3045450Z 2025-05-07T19:56:46.3047131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3049525Z 2025-05-07T19:56:46.3051148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3053031Z 2025-05-07T19:56:46.3054907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3056758Z 2025-05-07T19:56:46.3598461Z [279/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o 2025-05-07T19:56:46.3619503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3621199Z 2025-05-07T19:56:46.3622868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3624733Z 2025-05-07T19:56:46.3626355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3628557Z 2025-05-07T19:56:46.3630112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3631798Z 2025-05-07T19:56:46.3633660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3635416Z 2025-05-07T19:56:46.3637101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3638918Z 2025-05-07T19:56:46.4138563Z [280/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:46.4159217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.4160735Z 2025-05-07T19:56:46.4162105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.4163719Z 2025-05-07T19:56:46.4165251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.4167509Z 2025-05-07T19:56:46.4169290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.4171038Z 2025-05-07T19:56:46.4172593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.4174182Z 2025-05-07T19:56:46.4175731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.4177483Z 2025-05-07T19:56:47.2995192Z [281/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:56:47.3015932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.3017762Z 2025-05-07T19:56:47.3019267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.3021469Z 2025-05-07T19:56:47.3023302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.3025056Z 2025-05-07T19:56:47.3026573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.3028356Z 2025-05-07T19:56:47.3029764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.3031468Z 2025-05-07T19:56:47.3033145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.3034898Z 2025-05-07T19:56:50.0399081Z [282/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o 2025-05-07T19:56:50.0421711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0424204Z 2025-05-07T19:56:50.0425851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0427752Z 2025-05-07T19:56:50.0429640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0431527Z 2025-05-07T19:56:50.0433291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0435093Z 2025-05-07T19:56:50.0436694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0438474Z 2025-05-07T19:56:50.0440075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0441963Z 2025-05-07T19:56:50.0547606Z [283/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o 2025-05-07T19:56:50.0570225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0572544Z 2025-05-07T19:56:50.0574346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0576260Z 2025-05-07T19:56:50.0577659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:56:50.0579283Z 2025-05-07T19:56:50.0580999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0582935Z 2025-05-07T19:56:50.0584653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0586556Z 2025-05-07T19:56:50.0587891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:56:50.0589449Z 2025-05-07T19:56:50.0590968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0592942Z 2025-05-07T19:56:50.0594470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0596279Z 2025-05-07T19:56:50.3714463Z [284/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o 2025-05-07T19:56:50.3726383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.3727393Z 2025-05-07T19:56:50.3728271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.3729279Z 2025-05-07T19:56:50.3730027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:50.3730869Z 2025-05-07T19:56:50.3731633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:50.3732473Z 2025-05-07T19:56:50.3733210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:50.3734070Z 2025-05-07T19:56:50.3734807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:50.3735653Z 2025-05-07T19:56:50.3736414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:50.3737243Z 2025-05-07T19:56:50.3737983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:50.3738854Z 2025-05-07T19:56:50.3739585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:50.3740457Z 2025-05-07T19:56:50.3741190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:50.3742032Z 2025-05-07T19:56:50.3742929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.3743900Z 2025-05-07T19:56:50.3744768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.3745772Z 2025-05-07T19:56:50.3746509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:50.3747428Z 2025-05-07T19:56:50.3748179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:50.3749023Z 2025-05-07T19:56:50.3749842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:50.3750686Z 2025-05-07T19:56:50.3751425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:50.3752296Z 2025-05-07T19:56:50.3753141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:50.3754217Z 2025-05-07T19:56:50.3755380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:50.3756697Z 2025-05-07T19:56:50.3757838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:50.3759263Z 2025-05-07T19:56:50.3760574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:50.3762080Z 2025-05-07T19:56:50.3763692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.3765581Z 2025-05-07T19:56:50.3767307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.3769325Z 2025-05-07T19:56:56.7548582Z [285/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:56:56.7569507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.7571236Z 2025-05-07T19:56:56.7572787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.7574641Z 2025-05-07T19:56:56.7576049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.7577541Z 2025-05-07T19:56:56.7578982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.7580691Z 2025-05-07T19:56:56.7582204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.7583912Z 2025-05-07T19:56:56.7585410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.7587020Z 2025-05-07T19:56:57.1672194Z [286/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:56:57.1696187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.1698139Z 2025-05-07T19:56:57.1699786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.1701694Z 2025-05-07T19:56:57.1703610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.1705528Z 2025-05-07T19:56:57.1707198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.1709094Z 2025-05-07T19:56:57.1710791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.1712806Z 2025-05-07T19:56:57.1714434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.1716332Z 2025-05-07T19:56:57.9013823Z [287/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:56:57.9036050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.9037950Z 2025-05-07T19:56:57.9039499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.9041399Z 2025-05-07T19:56:57.9042977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.9044784Z 2025-05-07T19:56:57.9046491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.9048338Z 2025-05-07T19:56:57.9049894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.9051650Z 2025-05-07T19:56:57.9053211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.9055053Z 2025-05-07T19:56:58.4886676Z [288/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o 2025-05-07T19:56:58.4907741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:58.4909397Z 2025-05-07T19:56:58.4910856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:58.4912272Z 2025-05-07T19:56:58.4913694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:58.4915086Z 2025-05-07T19:56:58.4916315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:58.4918012Z 2025-05-07T19:56:58.4919312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:58.4920932Z 2025-05-07T19:56:58.4922403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:58.4923943Z 2025-05-07T19:56:58.4925454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:58.4927198Z 2025-05-07T19:56:58.4928570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:58.4930182Z 2025-05-07T19:56:58.4931626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:58.4933620Z 2025-05-07T19:56:58.4935371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:58.4937030Z 2025-05-07T19:56:58.4938499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:58.4940204Z 2025-05-07T19:56:58.4941702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:58.4943420Z 2025-05-07T19:56:58.4944870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:58.4946367Z 2025-05-07T19:56:58.4947869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:58.4949576Z 2025-05-07T19:56:58.4951087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:58.4953136Z 2025-05-07T19:56:58.4954727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:58.4956582Z 2025-05-07T19:56:58.4958181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:58.4960019Z 2025-05-07T19:56:58.4961636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:58.4963449Z 2025-05-07T19:57:03.5540402Z [289/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o 2025-05-07T19:57:03.5563013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.5564841Z 2025-05-07T19:57:03.5566473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.5568085Z 2025-05-07T19:57:03.5569492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:03.5571184Z 2025-05-07T19:57:03.5572668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:03.5574383Z 2025-05-07T19:57:03.5575973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:03.5577715Z 2025-05-07T19:57:03.5579246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:03.5581015Z 2025-05-07T19:57:03.5582657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.5584592Z 2025-05-07T19:57:03.5586154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.5587799Z 2025-05-07T19:57:03.5589196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:03.5590878Z 2025-05-07T19:57:03.5592340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:03.5594510Z 2025-05-07T19:57:03.5595954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:03.5597548Z 2025-05-07T19:57:03.5599223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:03.5600882Z 2025-05-07T19:57:03.5602835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.5604778Z 2025-05-07T19:57:03.5606438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.5608326Z 2025-05-07T19:57:03.5609747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:03.5611417Z 2025-05-07T19:57:03.5612962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:03.5614718Z 2025-05-07T19:57:03.5616247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:03.5617947Z 2025-05-07T19:57:03.5619169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:03.5620677Z 2025-05-07T19:57:04.4267276Z [290/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:04.4290173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.4292106Z 2025-05-07T19:57:04.4293872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.4295799Z 2025-05-07T19:57:04.4297214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.4298974Z 2025-05-07T19:57:04.4300409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.4302328Z 2025-05-07T19:57:04.4303919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.4305471Z 2025-05-07T19:57:04.4306909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.4308613Z 2025-05-07T19:57:04.5553123Z [291/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_inference.so -o fbgemm_gpu_tbe_inference.so CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed asmjit.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:57:07.7473296Z [292/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:07.7495876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:07.7497839Z 2025-05-07T19:57:07.7499525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:07.7501306Z 2025-05-07T19:57:07.7503255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:07.7505128Z 2025-05-07T19:57:07.7506757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:07.7508622Z 2025-05-07T19:57:07.7510248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:07.7512091Z 2025-05-07T19:57:07.7513825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:07.7515622Z 2025-05-07T19:57:12.6949212Z [293/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:12.6972385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.6974258Z 2025-05-07T19:57:12.6975943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.6977795Z 2025-05-07T19:57:12.6979468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.6981361Z 2025-05-07T19:57:12.6983016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.6984917Z 2025-05-07T19:57:12.6986593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.6988503Z 2025-05-07T19:57:12.6990184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:12.6992044Z 2025-05-07T19:57:14.1051357Z [294/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:14.1077314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.1079198Z 2025-05-07T19:57:14.1080837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.1082702Z 2025-05-07T19:57:14.1084372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.1086217Z 2025-05-07T19:57:14.1087860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.1089607Z 2025-05-07T19:57:14.1091223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.1093059Z 2025-05-07T19:57:14.1094699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.1096691Z 2025-05-07T19:57:16.5899719Z [295/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:16.5924651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:16.5926790Z 2025-05-07T19:57:16.5928617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:16.5930753Z 2025-05-07T19:57:16.5932565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:16.5934612Z 2025-05-07T19:57:16.5936442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:16.5938500Z 2025-05-07T19:57:16.5940313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:16.5942828Z 2025-05-07T19:57:16.5944636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:16.5946710Z 2025-05-07T19:57:16.6935806Z [296/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:16.6957208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:16.6959253Z 2025-05-07T19:57:16.6960958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:16.6962815Z 2025-05-07T19:57:16.6964354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:16.6966054Z 2025-05-07T19:57:16.6967454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:16.6969105Z 2025-05-07T19:57:16.6970509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:16.6972515Z 2025-05-07T19:57:16.6974226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:16.6975932Z 2025-05-07T19:57:17.7438823Z [297/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:17.7464184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:17.7465970Z 2025-05-07T19:57:17.7467642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:17.7469387Z 2025-05-07T19:57:17.7470859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:17.7472543Z 2025-05-07T19:57:17.7474213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:17.7476259Z 2025-05-07T19:57:17.7477811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:17.7479572Z 2025-05-07T19:57:17.7481429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:17.7483227Z 2025-05-07T19:57:19.3970529Z [298/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:19.3995429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.3997593Z 2025-05-07T19:57:19.3999434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.4001520Z 2025-05-07T19:57:19.4003571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.4005836Z 2025-05-07T19:57:19.4007647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.4009735Z 2025-05-07T19:57:19.4011784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.4013829Z 2025-05-07T19:57:19.4015626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.4017685Z 2025-05-07T19:57:23.3439381Z [299/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:23.3464102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.3466216Z 2025-05-07T19:57:23.3468044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.3469948Z 2025-05-07T19:57:23.3471779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.3475452Z 2025-05-07T19:57:23.3477507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.3479571Z 2025-05-07T19:57:23.3481335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.3483268Z 2025-05-07T19:57:23.3485067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.3487124Z 2025-05-07T19:57:24.0265154Z [300/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:24.0288936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.0290895Z 2025-05-07T19:57:24.0292426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.0294655Z 2025-05-07T19:57:24.0296378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.0298292Z 2025-05-07T19:57:24.0300292Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.0302315Z 2025-05-07T19:57:24.0303989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.0305923Z 2025-05-07T19:57:24.0307639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.0309578Z 2025-05-07T19:57:24.8274570Z [301/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:24.8297704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.8299564Z 2025-05-07T19:57:24.8301478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.8303607Z 2025-05-07T19:57:24.8305592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.8307552Z 2025-05-07T19:57:24.8309180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.8311094Z 2025-05-07T19:57:24.8312871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.8314808Z 2025-05-07T19:57:24.8316580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.8318449Z 2025-05-07T19:57:24.9680476Z [302/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:24.9704272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.9706471Z 2025-05-07T19:57:24.9708378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.9710317Z 2025-05-07T19:57:24.9711984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.9714067Z 2025-05-07T19:57:24.9715800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.9717794Z 2025-05-07T19:57:24.9719478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.9721308Z 2025-05-07T19:57:24.9723006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.9724881Z 2025-05-07T19:57:25.3971646Z [303/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:25.3989132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.3990629Z 2025-05-07T19:57:25.3992337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.3994169Z 2025-05-07T19:57:25.3995442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.3997042Z 2025-05-07T19:57:25.3998370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.3999915Z 2025-05-07T19:57:25.4001330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.4003101Z 2025-05-07T19:57:25.4004626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.4006125Z 2025-05-07T19:57:26.3204015Z [304/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:26.3222543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:26.3224061Z 2025-05-07T19:57:26.3225469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:26.3226906Z 2025-05-07T19:57:26.3228202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:26.3229722Z 2025-05-07T19:57:26.3231042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:26.3232467Z 2025-05-07T19:57:26.3233897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:26.3235342Z 2025-05-07T19:57:26.3236639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:26.3238098Z 2025-05-07T19:57:26.4255387Z [305/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:26.4275788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:26.4277707Z 2025-05-07T19:57:26.4279365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:26.4281274Z 2025-05-07T19:57:26.4282913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:26.4284583Z 2025-05-07T19:57:26.4285912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:26.4287324Z 2025-05-07T19:57:26.4288633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:26.4290504Z 2025-05-07T19:57:26.4292177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:26.4294096Z 2025-05-07T19:57:28.6739763Z [306/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:28.6762176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.6763996Z 2025-05-07T19:57:28.6765588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.6767224Z 2025-05-07T19:57:28.6768639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.6770221Z 2025-05-07T19:57:28.6771767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.6773294Z 2025-05-07T19:57:28.6774890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.6776815Z 2025-05-07T19:57:28.6778365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.6780105Z 2025-05-07T19:57:28.7300225Z [307/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:28.7323338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.7325279Z 2025-05-07T19:57:28.7326864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.7328642Z 2025-05-07T19:57:28.7330135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.7331710Z 2025-05-07T19:57:28.7333092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.7334903Z 2025-05-07T19:57:28.7336479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.7338160Z 2025-05-07T19:57:28.7339610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.7341177Z 2025-05-07T19:57:28.9885513Z [308/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:28.9906706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.9908467Z 2025-05-07T19:57:28.9909820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.9911689Z 2025-05-07T19:57:28.9913452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.9915218Z 2025-05-07T19:57:28.9916772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.9918558Z 2025-05-07T19:57:28.9920144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.9921937Z 2025-05-07T19:57:28.9923572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:28.9925481Z 2025-05-07T19:57:29.2931772Z [309/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:29.2953276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.2955159Z 2025-05-07T19:57:29.2956572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.2958248Z 2025-05-07T19:57:29.2959811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.2961503Z 2025-05-07T19:57:29.2963096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.2964767Z 2025-05-07T19:57:29.2966317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.2967926Z 2025-05-07T19:57:29.2969389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.2971018Z 2025-05-07T19:57:29.5542221Z [310/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o 2025-05-07T19:57:29.5554205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.5555208Z 2025-05-07T19:57:29.5556119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.5557101Z 2025-05-07T19:57:29.5557975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.5558941Z 2025-05-07T19:57:29.5559811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.5560823Z 2025-05-07T19:57:29.5561689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.5562674Z 2025-05-07T19:57:29.5563537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.5564577Z 2025-05-07T19:57:31.7266036Z [311/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o 2025-05-07T19:57:31.7288949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.7290886Z 2025-05-07T19:57:31.7292620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.7294559Z 2025-05-07T19:57:31.7296237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.7298105Z 2025-05-07T19:57:31.7299821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.7301775Z 2025-05-07T19:57:31.7303711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.7305542Z 2025-05-07T19:57:31.7307231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.7309436Z 2025-05-07T19:57:32.2437324Z [312/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:32.2461921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.2463861Z 2025-05-07T19:57:32.2465462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.2467348Z 2025-05-07T19:57:32.2469025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.2470925Z 2025-05-07T19:57:32.2472788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.2474696Z 2025-05-07T19:57:32.2476020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.2477674Z 2025-05-07T19:57:32.2479084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.2480859Z 2025-05-07T19:57:32.7798497Z [313/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:32.7821675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.7823616Z 2025-05-07T19:57:32.7825343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.7827214Z 2025-05-07T19:57:32.7828890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.7830822Z 2025-05-07T19:57:32.7832520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.7834896Z 2025-05-07T19:57:32.7836487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.7838226Z 2025-05-07T19:57:32.7839956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.7841899Z 2025-05-07T19:57:32.8899307Z [314/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:32.8923109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.8925146Z 2025-05-07T19:57:32.8926894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.8928886Z 2025-05-07T19:57:32.8930520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.8932450Z 2025-05-07T19:57:32.8934401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.8936317Z 2025-05-07T19:57:32.8937986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.8939874Z 2025-05-07T19:57:32.8941628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.8943601Z 2025-05-07T19:57:33.2070771Z [315/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:33.2094369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.2096319Z 2025-05-07T19:57:33.2098014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.2099982Z 2025-05-07T19:57:33.2101718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.2104195Z 2025-05-07T19:57:33.2105976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.2107982Z 2025-05-07T19:57:33.2109718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.2111607Z 2025-05-07T19:57:33.2113450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.2115582Z 2025-05-07T19:57:33.7186312Z [316/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o 2025-05-07T19:57:33.7205994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.7207626Z 2025-05-07T19:57:33.7209136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.7211010Z 2025-05-07T19:57:33.7212444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.7214083Z 2025-05-07T19:57:33.7215540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.7217140Z 2025-05-07T19:57:33.7218550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.7220148Z 2025-05-07T19:57:33.7221605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.7223426Z 2025-05-07T19:57:35.4467630Z [317/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o 2025-05-07T19:57:35.4490580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.4492558Z 2025-05-07T19:57:35.4494183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.4496341Z 2025-05-07T19:57:35.4498033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.4499904Z 2025-05-07T19:57:35.4501637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.4503825Z 2025-05-07T19:57:35.4505527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.4507573Z 2025-05-07T19:57:35.4509288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.4511163Z 2025-05-07T19:57:37.1460127Z [318/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:37.1481326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.1483193Z 2025-05-07T19:57:37.1484631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.1486265Z 2025-05-07T19:57:37.1487739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.1489358Z 2025-05-07T19:57:37.1490763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.1492361Z 2025-05-07T19:57:37.1493767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.1495586Z 2025-05-07T19:57:37.1497137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.1498743Z 2025-05-07T19:57:38.3574885Z [319/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:57:38.3595911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.3598067Z 2025-05-07T19:57:38.3599646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.3601365Z 2025-05-07T19:57:38.3603128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.3604736Z 2025-05-07T19:57:38.3606376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.3610487Z 2025-05-07T19:57:38.3611792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.3613283Z 2025-05-07T19:57:38.3614803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.3616392Z 2025-05-07T19:57:39.4769770Z [320/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:57:39.4792900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.4794883Z 2025-05-07T19:57:39.4796690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.4798635Z 2025-05-07T19:57:39.4800277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.4802346Z 2025-05-07T19:57:39.4803948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.4806175Z 2025-05-07T19:57:39.4808046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.4809981Z 2025-05-07T19:57:39.4811714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.4813566Z 2025-05-07T19:57:39.7268481Z [321/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:39.7292059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.7293966Z 2025-05-07T19:57:39.7295620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.7297202Z 2025-05-07T19:57:39.7298822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.7300984Z 2025-05-07T19:57:39.7302584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.7303989Z 2025-05-07T19:57:39.7305348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.7306817Z 2025-05-07T19:57:39.7308011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.7309540Z 2025-05-07T19:57:42.0910973Z [322/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o 2025-05-07T19:57:42.0934303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.0936330Z 2025-05-07T19:57:42.0937981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.0939864Z 2025-05-07T19:57:42.0941825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.0943410Z 2025-05-07T19:57:42.0945251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.0947077Z 2025-05-07T19:57:42.0948776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.0950265Z 2025-05-07T19:57:42.0951521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.0952960Z 2025-05-07T19:57:43.1306115Z [323/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:43.1329645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.1331574Z 2025-05-07T19:57:43.1333359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.1335589Z 2025-05-07T19:57:43.1337386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.1339256Z 2025-05-07T19:57:43.1341005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.1342892Z 2025-05-07T19:57:43.1344561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.1346523Z 2025-05-07T19:57:43.1348192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.1350106Z 2025-05-07T19:57:46.3585347Z [324/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:57:46.3608952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3611297Z 2025-05-07T19:57:46.3613048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3615029Z 2025-05-07T19:57:46.3616842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3618736Z 2025-05-07T19:57:46.3620406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3622318Z 2025-05-07T19:57:46.3623960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3625793Z 2025-05-07T19:57:46.3627450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3629310Z 2025-05-07T19:57:51.1222792Z [325/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o 2025-05-07T19:57:51.1244100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.1246324Z 2025-05-07T19:57:51.1248203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.1250133Z 2025-05-07T19:57:51.1251776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.1253354Z 2025-05-07T19:57:51.1254950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.1256754Z 2025-05-07T19:57:51.1258302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.1260006Z 2025-05-07T19:57:51.1261631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.1263433Z 2025-05-07T19:58:18.3184537Z [326/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:18.3206854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.3208735Z 2025-05-07T19:58:18.3210739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.3212656Z 2025-05-07T19:58:18.3214192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.3215639Z 2025-05-07T19:58:18.3216961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.3218715Z 2025-05-07T19:58:18.3220273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.3222097Z 2025-05-07T19:58:18.3223728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.3225487Z 2025-05-07T19:58:19.2166066Z [327/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o 2025-05-07T19:58:19.2188440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.2190240Z 2025-05-07T19:58:19.2191826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.2193799Z 2025-05-07T19:58:19.2195386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.2197250Z 2025-05-07T19:58:19.2198938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.2200861Z 2025-05-07T19:58:19.2202839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.2204693Z 2025-05-07T19:58:19.2206210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.2207951Z 2025-05-07T19:58:19.3938880Z [328/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:19.3962678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.3964616Z 2025-05-07T19:58:19.3966323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.3968251Z 2025-05-07T19:58:19.3969983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.3971894Z 2025-05-07T19:58:19.3973632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.3975554Z 2025-05-07T19:58:19.3977233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.3979313Z 2025-05-07T19:58:19.3981021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.3982951Z 2025-05-07T19:58:19.9402334Z [329/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:19.9423481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.9425051Z 2025-05-07T19:58:19.9426625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.9428363Z 2025-05-07T19:58:19.9429851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.9431642Z 2025-05-07T19:58:19.9433253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.9434913Z 2025-05-07T19:58:19.9436280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.9437798Z 2025-05-07T19:58:19.9439210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:19.9440972Z 2025-05-07T19:58:25.6294991Z [330/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o 2025-05-07T19:58:25.6317497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.6319349Z 2025-05-07T19:58:25.6320856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.6322709Z 2025-05-07T19:58:25.6324210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.6325751Z 2025-05-07T19:58:25.6326999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.6328379Z 2025-05-07T19:58:25.6329633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.6331036Z 2025-05-07T19:58:25.6332308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.6333760Z 2025-05-07T19:58:32.7601224Z [331/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:32.7620525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.7622090Z 2025-05-07T19:58:32.7623453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.7624987Z 2025-05-07T19:58:32.7626296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.7627804Z 2025-05-07T19:58:32.7629171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.7630721Z 2025-05-07T19:58:32.7632002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.7633480Z 2025-05-07T19:58:32.7634741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.7636178Z 2025-05-07T19:58:32.9174056Z [332/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o 2025-05-07T19:58:32.9195199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.9197031Z 2025-05-07T19:58:32.9198517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.9200241Z 2025-05-07T19:58:32.9201719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.9203696Z 2025-05-07T19:58:32.9205270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.9207020Z 2025-05-07T19:58:32.9208530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.9210245Z 2025-05-07T19:58:32.9211812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.9213573Z 2025-05-07T19:58:33.3922960Z [333/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:33.3946814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.3948791Z 2025-05-07T19:58:33.3950536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.3952497Z 2025-05-07T19:58:33.3954346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.3956650Z 2025-05-07T19:58:33.3958415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.3960380Z 2025-05-07T19:58:33.3962096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.3964028Z 2025-05-07T19:58:33.3965781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:33.3967926Z 2025-05-07T19:58:34.4372879Z [334/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:34.4395884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.4397881Z 2025-05-07T19:58:34.4399571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.4401577Z 2025-05-07T19:58:34.4403484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.4405369Z 2025-05-07T19:58:34.4406998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.4408973Z 2025-05-07T19:58:34.4410673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.4412592Z 2025-05-07T19:58:34.4414679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:34.4416626Z 2025-05-07T19:58:35.1440037Z [335/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:35.1463157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1465130Z 2025-05-07T19:58:35.1466863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1468875Z 2025-05-07T19:58:35.1470551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1472459Z 2025-05-07T19:58:35.1474313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1476270Z 2025-05-07T19:58:35.1477944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1480208Z 2025-05-07T19:58:35.1481969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1483922Z 2025-05-07T19:58:35.1599031Z [336/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:35.1621799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1623817Z 2025-05-07T19:58:35.1625521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1627384Z 2025-05-07T19:58:35.1629054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1630882Z 2025-05-07T19:58:35.1632556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1634878Z 2025-05-07T19:58:35.1636568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1638499Z 2025-05-07T19:58:35.1640159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:35.1642043Z 2025-05-07T19:58:37.1936227Z [337/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o 2025-05-07T19:58:37.1957324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.1958996Z 2025-05-07T19:58:37.1960573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.1962352Z 2025-05-07T19:58:37.1963885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.1965595Z 2025-05-07T19:58:37.1967109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.1969185Z 2025-05-07T19:58:37.1970713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.1972402Z 2025-05-07T19:58:37.1973831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.1975535Z 2025-05-07T19:58:37.7299439Z [338/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o 2025-05-07T19:58:37.7323956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7325736Z 2025-05-07T19:58:37.7327172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7329095Z 2025-05-07T19:58:37.7330755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7332981Z 2025-05-07T19:58:37.7334678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7336570Z 2025-05-07T19:58:37.7338267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7340157Z 2025-05-07T19:58:37.7341856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7343777Z 2025-05-07T19:58:40.6371708Z [339/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o 2025-05-07T19:58:40.6392449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.6394485Z 2025-05-07T19:58:40.6396098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.6397756Z 2025-05-07T19:58:40.6399277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.6401483Z 2025-05-07T19:58:40.6403283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.6405067Z 2025-05-07T19:58:40.6406684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.6408351Z 2025-05-07T19:58:40.6409792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.6411799Z 2025-05-07T19:58:41.6405526Z [340/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:58:41.6423890Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:41.7200153Z [341/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:41.7222036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:41.7223937Z 2025-05-07T19:58:41.7225516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:41.7227118Z 2025-05-07T19:58:41.7228430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:41.7230070Z 2025-05-07T19:58:41.7231628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:41.7233588Z 2025-05-07T19:58:41.7234767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:41.7236252Z 2025-05-07T19:58:41.7237732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:41.7239490Z 2025-05-07T19:58:41.8911055Z [342/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:58:41.8929973Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:42.8224942Z [343/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:42.8244553Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:43.3167177Z [344/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:58:43.3186703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.3188342Z 2025-05-07T19:58:43.3189792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.3191524Z 2025-05-07T19:58:43.3193200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.3194934Z 2025-05-07T19:58:43.3196503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.3198191Z 2025-05-07T19:58:43.3199749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.3201712Z 2025-05-07T19:58:43.3203480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.3205187Z 2025-05-07T19:58:43.4549034Z [345/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:58:43.4566891Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:43.5853660Z [346/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:58:43.5872700Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:45.1392597Z [347/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:45.1413970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.1415663Z 2025-05-07T19:58:45.1417464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.1419204Z 2025-05-07T19:58:45.1420729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.1422424Z 2025-05-07T19:58:45.1423922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.1425630Z 2025-05-07T19:58:45.1427108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.1429006Z 2025-05-07T19:58:45.1430470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.1432447Z 2025-05-07T19:58:45.7801008Z [348/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:45.7819125Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:46.3040435Z [349/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:58:46.3060158Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:47.6538797Z [350/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o 2025-05-07T19:58:47.6561670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.6563486Z 2025-05-07T19:58:47.6565135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.6567171Z 2025-05-07T19:58:47.6568822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.6570728Z 2025-05-07T19:58:47.6572638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.6574547Z 2025-05-07T19:58:47.6576243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.6578148Z 2025-05-07T19:58:47.6579849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.6581684Z 2025-05-07T19:58:48.2184028Z [351/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:48.2204362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.2206146Z 2025-05-07T19:58:48.2207500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.2209029Z 2025-05-07T19:58:48.2210560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.2212060Z 2025-05-07T19:58:48.2213425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.2214935Z 2025-05-07T19:58:48.2216272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.2217804Z 2025-05-07T19:58:48.2219160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.2220692Z 2025-05-07T19:58:49.3037592Z [352/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:49.3057052Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:50.0020185Z [353/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:50.0040028Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:50.0430056Z [354/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:58:50.2981721Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:50.3001609Z [355/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:58:50.3021313Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:51.1908727Z [356/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:51.1928450Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:54.7656457Z [357/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o 2025-05-07T19:58:54.7680513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.7682439Z 2025-05-07T19:58:54.7684101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.7685974Z 2025-05-07T19:58:54.7687651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.7689810Z 2025-05-07T19:58:54.7691650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.7693575Z 2025-05-07T19:58:54.7695198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.7697044Z 2025-05-07T19:58:54.7698687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.7700548Z 2025-05-07T19:58:57.4523945Z [358/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o 2025-05-07T19:58:57.4547658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.4549592Z 2025-05-07T19:58:57.4551269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.4553640Z 2025-05-07T19:58:57.4555494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.4557342Z 2025-05-07T19:58:57.4558997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.4560895Z 2025-05-07T19:58:57.4562595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.4564529Z 2025-05-07T19:58:57.4566238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.4568165Z 2025-05-07T19:58:58.1619837Z [359/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o 2025-05-07T19:58:58.1643352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.1645555Z 2025-05-07T19:58:58.1647205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.1649354Z 2025-05-07T19:58:58.1650978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.1652805Z 2025-05-07T19:58:58.1654341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.1656049Z 2025-05-07T19:58:58.1657702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.1659542Z 2025-05-07T19:58:58.1661221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.1663085Z 2025-05-07T19:58:58.3093967Z [360/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:58.3118607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.3120914Z 2025-05-07T19:58:58.3122634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.3124574Z 2025-05-07T19:58:58.3126261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.3128179Z 2025-05-07T19:58:58.3129918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.3131856Z 2025-05-07T19:58:58.3133555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.3135472Z 2025-05-07T19:58:58.3137176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.3139084Z 2025-05-07T19:58:59.0403096Z [361/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:59.0426441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.0428405Z 2025-05-07T19:58:59.0430077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.0431976Z 2025-05-07T19:58:59.0433725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.0435582Z 2025-05-07T19:58:59.0437212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.0439065Z 2025-05-07T19:58:59.0440676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.0442505Z 2025-05-07T19:58:59.0444174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.0445986Z 2025-05-07T19:59:01.0004254Z [362/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o 2025-05-07T19:59:01.0027575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.0029462Z 2025-05-07T19:59:01.0031156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.0033180Z 2025-05-07T19:59:01.0034820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.0036672Z 2025-05-07T19:59:01.0038286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.0040195Z 2025-05-07T19:59:01.0041823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.0043597Z 2025-05-07T19:59:01.0045195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:01.0047045Z 2025-05-07T19:59:03.1122668Z [363/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o 2025-05-07T19:59:03.1145437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.1147358Z 2025-05-07T19:59:03.1148850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.1150597Z 2025-05-07T19:59:03.1152110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.1153923Z 2025-05-07T19:59:03.1155513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.1157206Z 2025-05-07T19:59:03.1158906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.1160785Z 2025-05-07T19:59:03.1162502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.1164430Z 2025-05-07T19:59:03.4988403Z [364/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:03.5011938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.5013887Z 2025-05-07T19:59:03.5015639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.5017563Z 2025-05-07T19:59:03.5019253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.5021167Z 2025-05-07T19:59:03.5022879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.5024824Z 2025-05-07T19:59:03.5026523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.5028423Z 2025-05-07T19:59:03.5030139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.5032043Z 2025-05-07T19:59:05.0548482Z [365/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o 2025-05-07T19:59:05.0570974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0572776Z 2025-05-07T19:59:05.0574361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0576147Z 2025-05-07T19:59:05.0577674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0579465Z 2025-05-07T19:59:05.0581083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0582875Z 2025-05-07T19:59:05.0584448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0586215Z 2025-05-07T19:59:05.0588118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0590161Z 2025-05-07T19:59:05.3384808Z [366/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:05.3406749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.3408504Z 2025-05-07T19:59:05.3410037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.3411787Z 2025-05-07T19:59:05.3413318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.3415010Z 2025-05-07T19:59:05.3416540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.3418237Z 2025-05-07T19:59:05.3419715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.3421409Z 2025-05-07T19:59:05.3422925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.3424903Z 2025-05-07T19:59:07.4360497Z [367/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:07.4381137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.4382945Z 2025-05-07T19:59:07.4384404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.4386142Z 2025-05-07T19:59:07.4387585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.4389290Z 2025-05-07T19:59:07.4390872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.4392658Z 2025-05-07T19:59:07.4394389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.4396406Z 2025-05-07T19:59:07.4398004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.4399772Z 2025-05-07T19:59:09.3463661Z [368/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o 2025-05-07T19:59:09.3483871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.3485547Z 2025-05-07T19:59:09.3487102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.3488836Z 2025-05-07T19:59:09.3490361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.3492027Z 2025-05-07T19:59:09.3493577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.3495663Z 2025-05-07T19:59:09.3497140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.3498902Z 2025-05-07T19:59:09.3500400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.3501923Z 2025-05-07T19:59:10.5368717Z [369/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:10.5393042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.5394997Z 2025-05-07T19:59:10.5396661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.5398545Z 2025-05-07T19:59:10.5400167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.5402454Z 2025-05-07T19:59:10.5404114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.5406002Z 2025-05-07T19:59:10.5407644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.5409519Z 2025-05-07T19:59:10.5411138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.5413175Z 2025-05-07T19:59:10.6337585Z [370/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o 2025-05-07T19:59:10.6360991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.6362885Z 2025-05-07T19:59:10.6364566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.6366710Z 2025-05-07T19:59:10.6368393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.6370221Z 2025-05-07T19:59:10.6371879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.6373810Z 2025-05-07T19:59:10.6375451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.6377421Z 2025-05-07T19:59:10.6379077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.6380932Z 2025-05-07T19:59:12.6095443Z [371/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:12.6119294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.6121492Z 2025-05-07T19:59:12.6123157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.6125050Z 2025-05-07T19:59:12.6126742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.6128556Z 2025-05-07T19:59:12.6130173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.6132246Z 2025-05-07T19:59:12.6133843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.6135666Z 2025-05-07T19:59:12.6137415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:12.6139310Z 2025-05-07T19:59:16.7015042Z [372/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o 2025-05-07T19:59:16.7034507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.7036081Z 2025-05-07T19:59:16.7037455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.7038973Z 2025-05-07T19:59:16.7040439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.7042218Z 2025-05-07T19:59:16.7043758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.7045681Z 2025-05-07T19:59:16.7047466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.7049189Z 2025-05-07T19:59:16.7050852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.7052741Z 2025-05-07T19:59:19.4776780Z [373/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o 2025-05-07T19:59:19.4800299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.4801848Z 2025-05-07T19:59:19.4803496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.4805204Z 2025-05-07T19:59:19.4806626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.4808542Z 2025-05-07T19:59:19.4810300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.4812150Z 2025-05-07T19:59:19.4813806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.4815647Z 2025-05-07T19:59:19.4817259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.4819069Z 2025-05-07T19:59:24.7259732Z [374/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o 2025-05-07T19:59:24.7282884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.7284825Z 2025-05-07T19:59:24.7286465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.7288638Z 2025-05-07T19:59:24.7290474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.7292236Z 2025-05-07T19:59:24.7293860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.7295727Z 2025-05-07T19:59:24.7297455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.7299373Z 2025-05-07T19:59:24.7301055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.7303255Z 2025-05-07T19:59:29.7889963Z [375/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:29.7914435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:29.7916633Z 2025-05-07T19:59:29.7918169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:29.7922399Z 2025-05-07T19:59:29.7924162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:29.7926018Z 2025-05-07T19:59:29.7927737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:29.7929604Z 2025-05-07T19:59:29.7931295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:29.7933223Z 2025-05-07T19:59:29.7934940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:29.7936836Z 2025-05-07T19:59:31.1555621Z [376/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:31.1580732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.1582698Z 2025-05-07T19:59:31.1584400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.1586333Z 2025-05-07T19:59:31.1588061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.1589938Z 2025-05-07T19:59:31.1591657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.1593802Z 2025-05-07T19:59:31.1595519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.1597441Z 2025-05-07T19:59:31.1599157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.1601071Z 2025-05-07T19:59:33.3987020Z [377/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:33.4009202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.4010999Z 2025-05-07T19:59:33.4012609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.4014429Z 2025-05-07T19:59:33.4015941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.4017574Z 2025-05-07T19:59:33.4019015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.4020790Z 2025-05-07T19:59:33.4022365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.4024104Z 2025-05-07T19:59:33.4025699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:33.4027404Z 2025-05-07T19:59:38.2628493Z [378/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o 2025-05-07T19:59:38.2650313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:38.2651814Z 2025-05-07T19:59:38.2653121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:38.2654738Z 2025-05-07T19:59:38.2656327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:38.2657954Z 2025-05-07T19:59:38.2659328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:38.2660955Z 2025-05-07T19:59:38.2662192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:38.2663805Z 2025-05-07T19:59:38.2665149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:38.2666792Z 2025-05-07T19:59:39.2391823Z [379/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o 2025-05-07T19:59:39.2415541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.2417564Z 2025-05-07T19:59:39.2419305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.2421309Z 2025-05-07T19:59:39.2423039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.2424986Z 2025-05-07T19:59:39.2426724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.2428698Z 2025-05-07T19:59:39.2430414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.2432345Z 2025-05-07T19:59:39.2434203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.2436141Z 2025-05-07T19:59:40.9367825Z [380/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T19:59:40.9391424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.9393598Z 2025-05-07T19:59:40.9395334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.9397298Z 2025-05-07T19:59:40.9399011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.9400957Z 2025-05-07T19:59:40.9402943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.9404885Z 2025-05-07T19:59:40.9406629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.9408498Z 2025-05-07T19:59:40.9410218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.9412202Z 2025-05-07T19:59:49.1012740Z [381/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:59:49.1033521Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:59:53.1520439Z [382/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T19:59:53.1544511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:53.1546709Z 2025-05-07T19:59:53.1548353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:53.1550234Z 2025-05-07T19:59:53.1552190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:53.1554188Z 2025-05-07T19:59:53.1555853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:53.1557554Z 2025-05-07T19:59:53.1559148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:53.1560936Z 2025-05-07T19:59:53.1562548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:53.1564311Z 2025-05-07T19:59:57.1469645Z [383/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:59:57.1490081Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:59:59.4118307Z [384/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T19:59:59.4138593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.4140232Z 2025-05-07T19:59:59.4141716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.4143365Z 2025-05-07T19:59:59.4144767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.4146885Z 2025-05-07T19:59:59.4148464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.4150159Z 2025-05-07T19:59:59.4151619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.4153329Z 2025-05-07T19:59:59.4154744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.4156309Z 2025-05-07T20:00:01.0916219Z [385/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:01.0939978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:01.0941941Z 2025-05-07T20:00:01.0943644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:01.0945736Z 2025-05-07T20:00:01.0947459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:01.0949346Z 2025-05-07T20:00:01.0951031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:01.0953084Z 2025-05-07T20:00:01.0954678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:01.0956349Z 2025-05-07T20:00:01.0957919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:01.0959878Z 2025-05-07T20:00:01.3905597Z [386/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T20:00:01.3921692Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:02.1600304Z [387/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:02.1623496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.1625426Z 2025-05-07T20:00:02.1627165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.1629137Z 2025-05-07T20:00:02.1630811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.1632708Z 2025-05-07T20:00:02.1634498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.1636457Z 2025-05-07T20:00:02.1638182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.1640089Z 2025-05-07T20:00:02.1641768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.1643579Z 2025-05-07T20:00:02.7767519Z [388/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:02.7788094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.7789887Z 2025-05-07T20:00:02.7791501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.7793502Z 2025-05-07T20:00:02.7794891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.7796495Z 2025-05-07T20:00:02.7798069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.7799828Z 2025-05-07T20:00:02.7801390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.7803336Z 2025-05-07T20:00:02.7804920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.7806701Z 2025-05-07T20:00:03.1516194Z [389/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:03.1537680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.1539493Z 2025-05-07T20:00:03.1541079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.1542800Z 2025-05-07T20:00:03.1544300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.1546017Z 2025-05-07T20:00:03.1547365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.1549034Z 2025-05-07T20:00:03.1550549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.1552310Z 2025-05-07T20:00:03.1554140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.1556253Z 2025-05-07T20:00:03.6871506Z [390/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:03.6893104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.6894888Z 2025-05-07T20:00:03.6896411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.6898151Z 2025-05-07T20:00:03.6899784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.6901453Z 2025-05-07T20:00:03.6903233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.6905007Z 2025-05-07T20:00:03.6906584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.6908615Z 2025-05-07T20:00:03.6910193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.6911920Z 2025-05-07T20:00:03.7698229Z [391/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o 2025-05-07T20:00:03.7720610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.7722344Z 2025-05-07T20:00:03.7723796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.7725661Z 2025-05-07T20:00:03.7727189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.7728940Z 2025-05-07T20:00:03.7730421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.7732122Z 2025-05-07T20:00:03.7733269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.7734567Z 2025-05-07T20:00:03.7735989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:03.7737581Z 2025-05-07T20:00:04.2319327Z [392/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:04.2341389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.2343213Z 2025-05-07T20:00:04.2344733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.2346413Z 2025-05-07T20:00:04.2348095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.2349798Z 2025-05-07T20:00:04.2351568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.2353454Z 2025-05-07T20:00:04.2355050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.2356889Z 2025-05-07T20:00:04.2358562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.2360451Z 2025-05-07T20:00:04.4722417Z [393/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:00:04.4743540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.4745269Z 2025-05-07T20:00:04.4746873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.4748704Z 2025-05-07T20:00:04.4750592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.4752402Z 2025-05-07T20:00:04.4754157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.4755970Z 2025-05-07T20:00:04.4757572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.4759395Z 2025-05-07T20:00:04.4761068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.4763096Z 2025-05-07T20:00:04.8632727Z [394/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:04.8655280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.8657036Z 2025-05-07T20:00:04.8658665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.8660511Z 2025-05-07T20:00:04.8661997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.8663821Z 2025-05-07T20:00:04.8665392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.8667186Z 2025-05-07T20:00:04.8668412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.8669907Z 2025-05-07T20:00:04.8671050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.8672623Z 2025-05-07T20:00:04.8794437Z [395/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:04.8814600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.8816617Z 2025-05-07T20:00:04.8818287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.8820118Z 2025-05-07T20:00:04.8821694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.8823750Z 2025-05-07T20:00:04.8825309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.8827255Z 2025-05-07T20:00:04.8828739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.8830166Z 2025-05-07T20:00:04.8832019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:04.8833892Z 2025-05-07T20:00:06.0717835Z [396/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:06.0740349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.0742248Z 2025-05-07T20:00:06.0743863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.0745724Z 2025-05-07T20:00:06.0747221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.0749013Z 2025-05-07T20:00:06.0750389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.0752366Z 2025-05-07T20:00:06.0754243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.0756019Z 2025-05-07T20:00:06.0757650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.0759355Z 2025-05-07T20:00:08.8132008Z [397/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:08.8153766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.8155475Z 2025-05-07T20:00:08.8156865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.8158574Z 2025-05-07T20:00:08.8160220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.8161949Z 2025-05-07T20:00:08.8163444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.8165092Z 2025-05-07T20:00:08.8166766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.8168665Z 2025-05-07T20:00:08.8170340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.8171961Z 2025-05-07T20:00:10.5166397Z [398/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:10.5190296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.5192242Z 2025-05-07T20:00:10.5194169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.5196299Z 2025-05-07T20:00:10.5197994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.5199933Z 2025-05-07T20:00:10.5201824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.5203898Z 2025-05-07T20:00:10.5205462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.5207229Z 2025-05-07T20:00:10.5208920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.5210885Z 2025-05-07T20:00:15.6798418Z [399/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:15.6821569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:15.6823689Z 2025-05-07T20:00:15.6825349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:15.6827163Z 2025-05-07T20:00:15.6829006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:15.6830793Z 2025-05-07T20:00:15.6832443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:15.6834219Z 2025-05-07T20:00:15.6835910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:15.6837764Z 2025-05-07T20:00:15.6839373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:15.6841048Z 2025-05-07T20:00:16.3887674Z [400/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:16.3908855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:16.3910607Z 2025-05-07T20:00:16.3912421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:16.3914286Z 2025-05-07T20:00:16.3915780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:16.3917497Z 2025-05-07T20:00:16.3919037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:16.3920746Z 2025-05-07T20:00:16.3922170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:16.3923893Z 2025-05-07T20:00:16.3925357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:16.3927054Z 2025-05-07T20:00:16.6049846Z [401/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_dense.cpp 2025-05-07T20:00:16.6067187Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:17.0459951Z [402/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:17.0482038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.0483842Z 2025-05-07T20:00:17.0485424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.0487264Z 2025-05-07T20:00:17.0488879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.0491054Z 2025-05-07T20:00:17.0492739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.0494493Z 2025-05-07T20:00:17.0496137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.0497954Z 2025-05-07T20:00:17.0499426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.0501485Z 2025-05-07T20:00:18.1144688Z [403/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:18.1166014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.1167967Z 2025-05-07T20:00:18.1169672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.1171897Z 2025-05-07T20:00:18.1173515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.1175334Z 2025-05-07T20:00:18.1177015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.1178817Z 2025-05-07T20:00:18.1180433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.1182258Z 2025-05-07T20:00:18.1183840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.1185758Z 2025-05-07T20:00:18.4672529Z [404/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:18.4693841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.4695961Z 2025-05-07T20:00:18.4697570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.4699311Z 2025-05-07T20:00:18.4700887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.4702846Z 2025-05-07T20:00:18.4704380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.4706162Z 2025-05-07T20:00:18.4707743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.4709825Z 2025-05-07T20:00:18.4711664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.4713549Z 2025-05-07T20:00:18.5668037Z [405/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:18.5688703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.5690433Z 2025-05-07T20:00:18.5692077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.5693644Z 2025-05-07T20:00:18.5695048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.5696714Z 2025-05-07T20:00:18.5698093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.5700082Z 2025-05-07T20:00:18.5704793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.5706581Z 2025-05-07T20:00:18.5708063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.5709791Z 2025-05-07T20:00:18.6035984Z [406/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:18.6057489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.6059291Z 2025-05-07T20:00:18.6060920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.6062720Z 2025-05-07T20:00:18.6064282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.6066366Z 2025-05-07T20:00:18.6067988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.6069660Z 2025-05-07T20:00:18.6071254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.6072928Z 2025-05-07T20:00:18.6074292Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.6075858Z 2025-05-07T20:00:19.4642375Z [407/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:19.4663450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.4665313Z 2025-05-07T20:00:19.4667064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.4668938Z 2025-05-07T20:00:19.4672452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.4674417Z 2025-05-07T20:00:19.4676186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.4678051Z 2025-05-07T20:00:19.4679481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.4681138Z 2025-05-07T20:00:19.4682624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.4684389Z 2025-05-07T20:00:19.7913310Z [408/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:19.7936646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7938444Z 2025-05-07T20:00:19.7940152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7942350Z 2025-05-07T20:00:19.7944236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7946178Z 2025-05-07T20:00:19.7947901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7949838Z 2025-05-07T20:00:19.7951592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7953597Z 2025-05-07T20:00:19.7955331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7957273Z 2025-05-07T20:00:20.1323795Z [409/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o 2025-05-07T20:00:20.1344023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.1345847Z 2025-05-07T20:00:20.1347215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.1348887Z 2025-05-07T20:00:20.1350442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:00:20.1352083Z 2025-05-07T20:00:20.1353748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.1355295Z 2025-05-07T20:00:20.1356761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.1358265Z 2025-05-07T20:00:20.1359275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:00:20.1360795Z 2025-05-07T20:00:20.1362250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.1364109Z 2025-05-07T20:00:20.1365879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.1367839Z 2025-05-07T20:00:20.2944525Z [410/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:20.2965191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.2966862Z 2025-05-07T20:00:20.2968369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.2969896Z 2025-05-07T20:00:20.2971219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.2972810Z 2025-05-07T20:00:20.2974218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.2975857Z 2025-05-07T20:00:20.2977331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.2979006Z 2025-05-07T20:00:20.2980482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.2982211Z 2025-05-07T20:00:20.3969151Z [411/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:20.3990578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.3992373Z 2025-05-07T20:00:20.3993907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.3995672Z 2025-05-07T20:00:20.3997289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.3998951Z 2025-05-07T20:00:20.4000347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.4002344Z 2025-05-07T20:00:20.4003832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.4005574Z 2025-05-07T20:00:20.4007112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:20.4008895Z 2025-05-07T20:00:21.0473817Z [412/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:21.0496991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.0498830Z 2025-05-07T20:00:21.0500146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.0501794Z 2025-05-07T20:00:21.0503725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.0505321Z 2025-05-07T20:00:21.0506781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.0508688Z 2025-05-07T20:00:21.0510160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.0512054Z 2025-05-07T20:00:21.0513670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.0515362Z 2025-05-07T20:00:21.8759634Z [413/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:21.8781732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.8783567Z 2025-05-07T20:00:21.8785247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.8787052Z 2025-05-07T20:00:21.8788553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.8790349Z 2025-05-07T20:00:21.8791934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.8793866Z 2025-05-07T20:00:21.8795236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.8797019Z 2025-05-07T20:00:21.8798467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.8800123Z 2025-05-07T20:00:22.2375076Z [414/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:22.2395325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:22.2397185Z 2025-05-07T20:00:22.2398729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:22.2400604Z 2025-05-07T20:00:22.2402428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:22.2404204Z 2025-05-07T20:00:22.2405786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:22.2407416Z 2025-05-07T20:00:22.2408941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:22.2410673Z 2025-05-07T20:00:22.2412173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:22.2414366Z 2025-05-07T20:00:23.2063220Z [415/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:23.2086402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.2088289Z 2025-05-07T20:00:23.2089895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.2091800Z 2025-05-07T20:00:23.2093465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.2095330Z 2025-05-07T20:00:23.2097012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.2098822Z 2025-05-07T20:00:23.2100485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.2102814Z 2025-05-07T20:00:23.2104465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.2106302Z 2025-05-07T20:00:23.5131119Z [416/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:23.5153565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.5155517Z 2025-05-07T20:00:23.5157250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.5159144Z 2025-05-07T20:00:23.5160727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.5162563Z 2025-05-07T20:00:23.5164159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.5166277Z 2025-05-07T20:00:23.5167851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.5169707Z 2025-05-07T20:00:23.5171309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.5173081Z 2025-05-07T20:00:23.6955441Z [417/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adagrad.cpp 2025-05-07T20:00:23.6972073Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:24.4020843Z [418/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:24.4044228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.4046114Z 2025-05-07T20:00:24.4047767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.4049591Z 2025-05-07T20:00:24.4051241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.4053047Z 2025-05-07T20:00:24.4054695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.4056532Z 2025-05-07T20:00:24.4058188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.4060061Z 2025-05-07T20:00:24.4061747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.4063645Z 2025-05-07T20:00:26.0740600Z [419/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_sgd.cpp 2025-05-07T20:00:26.0759857Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:27.1673859Z [420/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:27.1696313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.1698403Z 2025-05-07T20:00:27.1699925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.1701614Z 2025-05-07T20:00:27.1703389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.1705083Z 2025-05-07T20:00:27.1706594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.1708292Z 2025-05-07T20:00:27.1709753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.1711672Z 2025-05-07T20:00:27.1713474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.1715159Z 2025-05-07T20:00:28.4976763Z [421/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:28.4998739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.5000935Z 2025-05-07T20:00:28.5002823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.5004727Z 2025-05-07T20:00:28.5006419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.5008302Z 2025-05-07T20:00:28.5010049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.5012209Z 2025-05-07T20:00:28.5013892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.5015807Z 2025-05-07T20:00:28.5017681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.5019557Z 2025-05-07T20:00:28.6758028Z [422/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T20:00:28.6774990Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:28.7882089Z [423/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T20:00:28.7901088Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:29.2475854Z [424/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T20:00:29.2494068Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:29.5714662Z [425/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lamb.cpp 2025-05-07T20:00:29.5733975Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:29.6068370Z [426/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o 2025-05-07T20:00:29.6091404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.6093193Z 2025-05-07T20:00:29.6094903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.6096737Z 2025-05-07T20:00:29.6098143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:00:29.6099697Z 2025-05-07T20:00:29.6101294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.6103285Z 2025-05-07T20:00:29.6104844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.6106703Z 2025-05-07T20:00:29.6107926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:00:29.6109483Z 2025-05-07T20:00:29.6111165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.6113181Z 2025-05-07T20:00:29.6114892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.6116815Z 2025-05-07T20:00:29.9486442Z [427/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T20:00:29.9504315Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:30.0623096Z [428/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T20:00:30.0641770Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:30.0897625Z [429/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T20:00:30.0917586Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:30.1888127Z [430/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T20:00:30.1908669Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:30.2908811Z [431/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T20:00:30.2929352Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:30.3909889Z [432/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T20:00:30.3929712Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:30.4064868Z [433/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:00:30.4084138Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:30.8808735Z [434/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_forward.so -o fbgemm_gpu_tbe_training_forward.so CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_common.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:00:31.0617721Z [435/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:00:31.0638692Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:31.6754352Z [436/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T20:00:31.6773518Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:31.7649113Z [437/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T20:00:31.7667696Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:32.0788073Z [438/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T20:00:32.0808665Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:32.1042428Z [439/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T20:00:32.1062441Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:32.2614367Z [440/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adam.cpp 2025-05-07T20:00:32.2633346Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:33.5690530Z [441/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T20:00:33.5709476Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:33.8794904Z [442/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:33.8819097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.8821121Z 2025-05-07T20:00:33.8822792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.8824495Z 2025-05-07T20:00:33.8826274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.8828223Z 2025-05-07T20:00:33.8829811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.8831665Z 2025-05-07T20:00:33.8833476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.8835373Z 2025-05-07T20:00:33.8837084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.8838984Z 2025-05-07T20:00:34.1169198Z [443/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:34.1193629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.1195530Z 2025-05-07T20:00:34.1197198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.1199047Z 2025-05-07T20:00:34.1200644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.1202683Z 2025-05-07T20:00:34.1204298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.1206252Z 2025-05-07T20:00:34.1207933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.1209735Z 2025-05-07T20:00:34.1211355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.1213109Z 2025-05-07T20:00:34.2365482Z [444/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T20:00:34.2386102Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:34.5924856Z [445/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_none.cpp 2025-05-07T20:00:34.5942937Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:34.6062737Z [446/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T20:00:34.6081553Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:34.6400512Z [447/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T20:00:34.6418795Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:35.6007456Z [448/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T20:00:35.6026869Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:35.6534913Z [449/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:35.6557867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:35.6559754Z 2025-05-07T20:00:35.6561429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:35.6563384Z 2025-05-07T20:00:35.6565079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:35.6566949Z 2025-05-07T20:00:35.6568667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:35.6570589Z 2025-05-07T20:00:35.6572257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:35.6574163Z 2025-05-07T20:00:35.6575884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:35.6577796Z 2025-05-07T20:00:35.9490451Z [450/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T20:00:35.9509983Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:36.7409960Z [451/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T20:00:36.7428881Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:36.9019081Z [452/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T20:00:36.9038583Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:36.9362359Z [453/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T20:00:36.9382577Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:37.7125143Z [454/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T20:00:37.7139691Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:37.7958828Z [455/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T20:00:37.7977382Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:37.9792538Z [456/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T20:00:37.9809765Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:38.3101712Z [457/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T20:00:38.3120080Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:38.4223074Z [458/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T20:00:38.4240248Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:38.4689286Z [459/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T20:00:38.4705914Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:38.4904305Z [460/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T20:00:38.4922700Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:41.3194669Z [461/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cpp 2025-05-07T20:00:41.3210284Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:42.5814495Z [462/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cpp 2025-05-07T20:00:42.5830953Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:42.9912509Z [463/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_models.cpp 2025-05-07T20:00:42.9935166Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:43.3231609Z [464/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T20:00:43.3252955Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:43.4511510Z [465/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T20:00:43.4529989Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:44.0615693Z [466/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T20:00:44.0636047Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:44.8028748Z [467/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_split_host.so -o fbgemm_gpu_tbe_training_backward_split_host.so CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so && : 2025-05-07T20:00:45.0758082Z [468/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T20:00:45.0782674Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:45.9961307Z [469/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T20:00:45.9979146Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:46.0786669Z [470/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T20:00:46.0803797Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:47.1599320Z [471/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T20:00:47.1618367Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:47.4821170Z [472/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T20:00:47.4839751Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:47.5864040Z [473/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T20:00:47.8712239Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:47.8729361Z [474/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T20:00:47.8746427Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:48.0441800Z [475/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T20:00:48.0457528Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:48.3740860Z [476/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T20:00:48.3758535Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:48.5106270Z [477/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T20:00:48.5122618Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:48.5873068Z [478/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T20:00:48.5890136Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:48.8778578Z [479/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T20:00:48.8795787Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:50.3514615Z [480/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T20:00:50.3530388Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:50.8928869Z [481/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/topology_utils.cpp 2025-05-07T20:00:50.8945510Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:51.0355187Z [482/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T20:00:51.0373310Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:51.5548213Z [483/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T20:00:51.5564085Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:52.1219952Z [484/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T20:00:52.1238230Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:52.6769233Z [485/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:52.6793400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:52.6795312Z 2025-05-07T20:00:52.6796963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:52.6799124Z 2025-05-07T20:00:52.6800774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:52.6802850Z 2025-05-07T20:00:52.6804548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:52.6806408Z 2025-05-07T20:00:52.6808097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:52.6809963Z 2025-05-07T20:00:52.6811625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:52.6813707Z 2025-05-07T20:00:53.9504942Z [486/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o 2025-05-07T20:00:53.9526641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:53.9528583Z 2025-05-07T20:00:53.9530207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:53.9532256Z 2025-05-07T20:00:53.9533807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:53.9535577Z 2025-05-07T20:00:53.9537150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:53.9539001Z 2025-05-07T20:00:53.9540599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:53.9542529Z 2025-05-07T20:00:53.9544070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:53.9545922Z 2025-05-07T20:00:55.3170472Z [487/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T20:00:55.3186987Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:55.6427186Z [488/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o 2025-05-07T20:00:55.6450730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.6452729Z 2025-05-07T20:00:55.6454465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.6456371Z 2025-05-07T20:00:55.6458057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.6459924Z 2025-05-07T20:00:55.6461604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.6463525Z 2025-05-07T20:00:55.6465178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.6467070Z 2025-05-07T20:00:55.6468741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.6470617Z 2025-05-07T20:00:55.8066712Z [489/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:00:55.8087922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.8089553Z 2025-05-07T20:00:55.8091057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.8092789Z 2025-05-07T20:00:55.8094334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.8096094Z 2025-05-07T20:00:55.8097642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.8099346Z 2025-05-07T20:00:55.8100847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.8102757Z 2025-05-07T20:00:55.8104244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:55.8105895Z 2025-05-07T20:00:55.8563637Z [490/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops_host.cpp 2025-05-07T20:00:55.8580520Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:56.8256966Z [491/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T20:00:56.8275051Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:56.8803548Z [492/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_utils.cpp 2025-05-07T20:00:56.8821551Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:57.0993962Z [493/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T20:00:57.1008837Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:57.7466390Z [494/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T20:00:57.7483890Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:57.8144358Z [495/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator.cpp 2025-05-07T20:00:57.8162324Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:58.5790780Z [496/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T20:00:58.5809116Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:59.3354767Z [497/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T20:00:59.3371880Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:59.4321918Z [498/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:59.4344481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:59.4346448Z 2025-05-07T20:00:59.4348057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:59.4349923Z 2025-05-07T20:00:59.4351367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:59.4353455Z 2025-05-07T20:00:59.4355063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:59.4356891Z 2025-05-07T20:00:59.4358502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:59.4360245Z 2025-05-07T20:00:59.4361761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:59.4363559Z 2025-05-07T20:00:59.6885369Z [499/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator.cpp 2025-05-07T20:00:59.6902478Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:01.3607644Z [500/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T20:01:01.3626819Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:01.6077932Z [501/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:01:01.6101464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.6103909Z 2025-05-07T20:01:01.6105617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.6107541Z 2025-05-07T20:01:01.6109228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.6111172Z 2025-05-07T20:01:01.6112966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.6114899Z 2025-05-07T20:01:01.6116610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.6118680Z 2025-05-07T20:01:01.6120589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.6122506Z 2025-05-07T20:01:01.7965168Z [502/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T20:01:01.7982995Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:03.2269671Z [503/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T20:01:03.2289210Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:03.3519866Z [504/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T20:01:03.3537150Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:04.8825556Z [505/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o 2025-05-07T20:01:04.8848975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.8850872Z 2025-05-07T20:01:04.8852567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.8854458Z 2025-05-07T20:01:04.8856104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.8858011Z 2025-05-07T20:01:04.8859666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.8861568Z 2025-05-07T20:01:04.8863204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.8865043Z 2025-05-07T20:01:04.8866719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:04.8868802Z 2025-05-07T20:01:05.2976045Z [506/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T20:01:05.2993717Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:07.5955800Z [507/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o 2025-05-07T20:01:07.5980352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.5982657Z 2025-05-07T20:01:07.5984272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.5986137Z 2025-05-07T20:01:07.5987865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.5989606Z 2025-05-07T20:01:07.5991206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.5993187Z 2025-05-07T20:01:07.5994645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.5996289Z 2025-05-07T20:01:07.5997580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:07.5999295Z 2025-05-07T20:01:08.1830067Z [508/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:08.1865335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.1867122Z 2025-05-07T20:01:08.1868976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.1870760Z 2025-05-07T20:01:08.1872281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.1874185Z 2025-05-07T20:01:08.1875696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.1877456Z 2025-05-07T20:01:08.1878966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.1880558Z 2025-05-07T20:01:08.1882104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.1883835Z 2025-05-07T20:01:08.7203656Z [509/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T20:01:08.7225202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.7227027Z 2025-05-07T20:01:08.7228633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.7230297Z 2025-05-07T20:01:08.7231802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.7233651Z 2025-05-07T20:01:08.7235135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.7236839Z 2025-05-07T20:01:08.7238365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.7240021Z 2025-05-07T20:01:08.7241601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.7243327Z 2025-05-07T20:01:09.1509201Z [510/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o 2025-05-07T20:01:09.1530961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.1532753Z 2025-05-07T20:01:09.1534366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.1536100Z 2025-05-07T20:01:09.1537614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.1539376Z 2025-05-07T20:01:09.1540943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.1542754Z 2025-05-07T20:01:09.1544352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.1546036Z 2025-05-07T20:01:09.1547644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.1549467Z 2025-05-07T20:01:09.1691385Z [511/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_grad_index_select.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o 2025-05-07T20:01:09.1712997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.1714839Z 2025-05-07T20:01:09.1716362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.1718203Z 2025-05-07T20:01:09.1719747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.1721548Z 2025-05-07T20:01:09.1723160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.1724956Z 2025-05-07T20:01:09.1726518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.1728304Z 2025-05-07T20:01:09.1729931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.1731753Z 2025-05-07T20:01:10.2170809Z [512/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T20:01:10.2184443Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:10.7103989Z [513/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o 2025-05-07T20:01:10.7121951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.7123644Z 2025-05-07T20:01:10.7124918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.7126350Z 2025-05-07T20:01:10.7127648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.7129081Z 2025-05-07T20:01:10.7130367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.7131812Z 2025-05-07T20:01:10.7133057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.7134643Z 2025-05-07T20:01:10.7136037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.7137459Z 2025-05-07T20:01:10.7795287Z [514/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:01:10.7817964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.7819860Z 2025-05-07T20:01:10.7821507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.7823346Z 2025-05-07T20:01:10.7824941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.7826810Z 2025-05-07T20:01:10.7828393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.7830467Z 2025-05-07T20:01:10.7832256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.7834189Z 2025-05-07T20:01:10.7835817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:10.7837571Z 2025-05-07T20:01:11.1185320Z [515/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o 2025-05-07T20:01:11.1203097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:11.1204546Z 2025-05-07T20:01:11.1205932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:11.1207423Z 2025-05-07T20:01:11.1208639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:11.1210245Z 2025-05-07T20:01:11.1211544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:11.1212957Z 2025-05-07T20:01:11.1214418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:11.1215798Z 2025-05-07T20:01:11.1216994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:11.1218437Z 2025-05-07T20:01:12.9624153Z [516/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_forward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o 2025-05-07T20:01:12.9645808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.9647684Z 2025-05-07T20:01:12.9649239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.9651142Z 2025-05-07T20:01:12.9652690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.9654813Z 2025-05-07T20:01:12.9656697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.9658475Z 2025-05-07T20:01:12.9659984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.9661813Z 2025-05-07T20:01:12.9663437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.9665229Z 2025-05-07T20:01:14.0068462Z [517/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_forward_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o 2025-05-07T20:01:14.0089821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.0091603Z 2025-05-07T20:01:14.0093266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.0095417Z 2025-05-07T20:01:14.0096989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.0098819Z 2025-05-07T20:01:14.0100727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.0102662Z 2025-05-07T20:01:14.0104211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.0105990Z 2025-05-07T20:01:14.0107471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.0109090Z 2025-05-07T20:01:20.5006708Z [518/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_forward_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o 2025-05-07T20:01:20.5023479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.5024894Z 2025-05-07T20:01:20.5026150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.5027754Z 2025-05-07T20:01:20.5029171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.5030585Z 2025-05-07T20:01:20.5031875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.5033409Z 2025-05-07T20:01:20.5034616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.5036006Z 2025-05-07T20:01:20.5037238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.5038603Z 2025-05-07T20:01:20.7938387Z [519/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o 2025-05-07T20:01:22.7623056Z [520/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o 2025-05-07T20:01:27.8271981Z [521/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o 2025-05-07T20:01:28.5211352Z [522/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_backward_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o 2025-05-07T20:01:28.5230961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.5232674Z 2025-05-07T20:01:28.5234227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.5236048Z 2025-05-07T20:01:28.5237515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.5239147Z 2025-05-07T20:01:28.5240607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.5242308Z 2025-05-07T20:01:28.5243815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.5245388Z 2025-05-07T20:01:28.5246791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:28.5248505Z 2025-05-07T20:01:36.2921442Z [523/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/histogram_binning_calibration_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o 2025-05-07T20:01:38.4340611Z [524/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o 2025-05-07T20:01:39.6373177Z [525/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_backward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o 2025-05-07T20:01:39.6393791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.6395567Z 2025-05-07T20:01:39.6397172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.6399088Z 2025-05-07T20:01:39.6400807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.6403264Z 2025-05-07T20:01:39.6405125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.6407097Z 2025-05-07T20:01:39.6408818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.6410741Z 2025-05-07T20:01:39.6412495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.6414444Z 2025-05-07T20:01:40.5207153Z [526/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T20:01:40.5226008Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:40.6698231Z [527/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_backward_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o 2025-05-07T20:01:40.6720150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.6722209Z 2025-05-07T20:01:40.6723948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.6725916Z 2025-05-07T20:01:40.6727649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.6729561Z 2025-05-07T20:01:40.6731308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.6733252Z 2025-05-07T20:01:40.6734960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.6737151Z 2025-05-07T20:01:40.6738875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.6740809Z 2025-05-07T20:01:41.3542734Z [528/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_index_select.so -o fbgemm_gpu_tbe_index_select.so CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:01:42.1151309Z [529/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o 2025-05-07T20:01:43.0802501Z [530/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o 2025-05-07T20:01:43.5117154Z [531/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o 2025-05-07T20:01:46.0215602Z [532/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o 2025-05-07T20:01:46.3774888Z [533/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o 2025-05-07T20:01:46.7325504Z [534/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o 2025-05-07T20:01:47.3610783Z [535/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o 2025-05-07T20:01:47.3704336Z [536/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:01:47.3706815Z ################################################################################ 2025-05-07T20:01:47.3707408Z [CMAKE] Running post-build script ... 2025-05-07T20:01:47.3708297Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:01:47.3709175Z Removing all RPATHs ... 2025-05-07T20:01:47.3709685Z ################################################################################ 2025-05-07T20:01:47.3949787Z [537/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 1 2025-05-07T20:01:47.3951905Z ################################################################################ 2025-05-07T20:01:47.3952543Z [CMAKE] Running post-build script ... 2025-05-07T20:01:47.3953526Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:01:47.3954759Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:47.3955358Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:47.3956063Z ################################################################################ 2025-05-07T20:01:47.4463403Z [538/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o 2025-05-07T20:01:47.4483791Z [539/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:01:47.4485772Z ################################################################################ 2025-05-07T20:01:47.4486631Z [CMAKE] Running post-build script ... 2025-05-07T20:01:47.4487550Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:01:47.4488525Z Removing all RPATHs ... 2025-05-07T20:01:47.4488934Z ################################################################################ 2025-05-07T20:01:47.4562257Z [540/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:01:47.4564547Z ################################################################################ 2025-05-07T20:01:47.4565157Z [CMAKE] Running post-build script ... 2025-05-07T20:01:47.4566066Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:01:47.4567097Z Removing all RPATHs ... 2025-05-07T20:01:47.4567819Z ################################################################################ 2025-05-07T20:01:47.4845830Z [541/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:01:47.4848216Z ################################################################################ 2025-05-07T20:01:47.4848834Z [CMAKE] Running post-build script ... 2025-05-07T20:01:47.4849712Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:01:47.4850688Z Removing all RPATHs ... 2025-05-07T20:01:47.4851288Z ################################################################################ 2025-05-07T20:01:47.4945666Z [542/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 1 2025-05-07T20:01:47.4948218Z ################################################################################ 2025-05-07T20:01:47.4948832Z [CMAKE] Running post-build script ... 2025-05-07T20:01:47.4949915Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:01:47.4951022Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:47.4951657Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:47.4952370Z ################################################################################ 2025-05-07T20:01:47.5057438Z [543/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 1 2025-05-07T20:01:47.5059799Z ################################################################################ 2025-05-07T20:01:47.5060425Z [CMAKE] Running post-build script ... 2025-05-07T20:01:47.5061360Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:01:47.5062261Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:47.5062886Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:47.5063533Z ################################################################################ 2025-05-07T20:01:47.5219697Z [544/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:01:47.5222102Z ################################################################################ 2025-05-07T20:01:47.5222669Z [CMAKE] Running post-build script ... 2025-05-07T20:01:47.5223927Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:01:47.5224887Z Removing all RPATHs ... 2025-05-07T20:01:47.5225320Z ################################################################################ 2025-05-07T20:01:47.6062972Z [545/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o 2025-05-07T20:01:47.6420145Z [546/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 1 2025-05-07T20:01:47.6422584Z ################################################################################ 2025-05-07T20:01:47.6423279Z [CMAKE] Running post-build script ... 2025-05-07T20:01:47.6424400Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:01:47.6425582Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:47.6426266Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:47.6427029Z ################################################################################ 2025-05-07T20:01:47.6612948Z [547/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 1 2025-05-07T20:01:47.6615281Z ################################################################################ 2025-05-07T20:01:47.7265279Z [CMAKE] Running post-build script ... 2025-05-07T20:01:47.7266107Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:01:47.7267001Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:47.7267411Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:47.7267834Z ################################################################################ 2025-05-07T20:01:47.7269098Z [548/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 1 2025-05-07T20:01:47.7270351Z ################################################################################ 2025-05-07T20:01:47.7270706Z [CMAKE] Running post-build script ... 2025-05-07T20:01:47.7271319Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:01:47.7271989Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:47.7272390Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:47.7272958Z ################################################################################ 2025-05-07T20:01:48.2555742Z [549/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o 2025-05-07T20:01:48.7421119Z [550/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o 2025-05-07T20:01:48.9525230Z [551/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o 2025-05-07T20:01:48.9543328Z In file included from tmpxft_000097c8_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:1: 2025-05-07T20:01:48.9545448Z /tmp/tmpxft_000097c8_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:45:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:48.9555740Z static void __device_stub__ZN10fbgemm_gpu28unique_indices_length_kernelIlLl9223372036854775807ELln9223372036854775808EEEvN2at27GenericPackedTensorAccessorIT_Lm1ENS1_17RestrictPtrTraitsEiEES5_S5_S5_(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par0, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par1, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par2, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par3){__cudaLaunchPrologue(4);__cudaSetupArg(__par0, 0UL);__cudaSetupArg(__par1, 16UL);__cudaSetupArg(__par2, 32UL);__cudaSetupArg(__par3, 48UL);__cudaLaunch(((char *)((void ( *)(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE))fbgemm_gpu::unique_indices_length_kernel )));}namespace fbgemm_gpu{ 2025-05-07T20:01:48.9565276Z ^ 2025-05-07T20:01:48.9567558Z /tmp/tmpxft_000097c8_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:45:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:48.9570522Z /tmp/tmpxft_000097c8_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:45:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:48.9573249Z /tmp/tmpxft_000097c8_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:51:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:48.9581609Z static void __device_stub__ZN10fbgemm_gpu24compute_hash_size_kernelIlLln9223372036854775808EEEvN2at27GenericPackedTensorAccessorIT_Lm1ENS1_17RestrictPtrTraitsEiEES5_lS5_(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par0, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par1, const int64_t __par2, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par3){__cudaLaunchPrologue(4);__cudaSetupArg(__par0, 0UL);__cudaSetupArg(__par1, 16UL);__cudaSetupArgSimple(__par2, 32UL);__cudaSetupArg(__par3, 40UL);__cudaLaunch(((char *)((void ( *)(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const int64_t, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE))fbgemm_gpu::compute_hash_size_kernel )));}namespace fbgemm_gpu{ 2025-05-07T20:01:48.9589232Z ^ 2025-05-07T20:01:48.9591700Z /tmp/tmpxft_000097c8_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:51:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:48.9594644Z /tmp/tmpxft_000097c8_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:51:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:48.9597670Z /tmp/tmpxft_000097c8_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:54:445: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:48.9600501Z /tmp/tmpxft_000097c8_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:54:1476: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:48.9602448Z 8 warnings generated. 2025-05-07T20:01:49.7035872Z [552/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_mx.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o 2025-05-07T20:01:49.9439448Z [553/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 1 2025-05-07T20:01:49.9441320Z ################################################################################ 2025-05-07T20:01:49.9441848Z [CMAKE] Running post-build script ... 2025-05-07T20:01:49.9443092Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:01:49.9443961Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:49.9444505Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:49.9445099Z ################################################################################ 2025-05-07T20:01:50.0101886Z [554/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o 2025-05-07T20:01:50.0636855Z [555/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 1 2025-05-07T20:01:50.0639227Z ################################################################################ 2025-05-07T20:01:50.0639802Z [CMAKE] Running post-build script ... 2025-05-07T20:01:50.0640783Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:01:50.0641784Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:50.0642413Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:50.0643099Z ################################################################################ 2025-05-07T20:01:50.1472656Z [556/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o 2025-05-07T20:01:50.8707993Z [557/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o 2025-05-07T20:01:51.9548681Z [558/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o 2025-05-07T20:01:52.4693451Z [559/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o 2025-05-07T20:01:53.1000675Z [560/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o 2025-05-07T20:01:53.8052735Z [561/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o 2025-05-07T20:01:54.1342833Z [562/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o 2025-05-07T20:01:54.1358354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:54.1359222Z 2025-05-07T20:01:54.1360078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:54.1360956Z 2025-05-07T20:01:54.1361652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:54.1362509Z 2025-05-07T20:01:54.1363379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:54.1364239Z 2025-05-07T20:01:54.1364927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:54.1365801Z 2025-05-07T20:01:54.1366488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:54.1367329Z 2025-05-07T20:01:54.1368044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:54.1368896Z 2025-05-07T20:01:54.1369591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:54.1370451Z 2025-05-07T20:01:54.1371145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:54.1371992Z 2025-05-07T20:01:54.1372697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:54.1373566Z 2025-05-07T20:01:54.1374259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:54.1375128Z 2025-05-07T20:01:54.1375832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:54.1376687Z 2025-05-07T20:01:55.0228620Z [563/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_bfloat16.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o 2025-05-07T20:01:55.5552737Z [564/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o 2025-05-07T20:01:55.5573759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.5574969Z 2025-05-07T20:01:55.5576047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.5577284Z 2025-05-07T20:01:55.5578321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.5579564Z 2025-05-07T20:01:55.5580584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.5581844Z 2025-05-07T20:01:55.5582855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.5584339Z 2025-05-07T20:01:55.5585360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.5586573Z 2025-05-07T20:01:55.7099978Z [565/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o 2025-05-07T20:01:55.7119969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.7121117Z 2025-05-07T20:01:55.7122051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.7123340Z 2025-05-07T20:01:55.7124288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.7125303Z 2025-05-07T20:01:55.7126171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.7127289Z 2025-05-07T20:01:55.7128221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.7129362Z 2025-05-07T20:01:55.7130296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.7131438Z 2025-05-07T20:01:55.7132364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.7133617Z 2025-05-07T20:01:55.7134549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.7135680Z 2025-05-07T20:01:55.7136691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.7137816Z 2025-05-07T20:01:55.7138771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.7139924Z 2025-05-07T20:01:55.7140931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.7142068Z 2025-05-07T20:01:55.7142999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.7144110Z 2025-05-07T20:01:55.7145072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.7146213Z 2025-05-07T20:01:55.7147153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.7148306Z 2025-05-07T20:01:55.7149262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:55.7150439Z 2025-05-07T20:01:56.3697909Z [566/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o 2025-05-07T20:01:56.3719626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:56.3720906Z 2025-05-07T20:01:56.3722137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:56.3723401Z 2025-05-07T20:01:56.3724484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:56.3725765Z 2025-05-07T20:01:56.3726915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:56.3728161Z 2025-05-07T20:01:56.3729214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:56.3730478Z 2025-05-07T20:01:56.3731506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:56.3732763Z 2025-05-07T20:01:56.3733799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:56.3735047Z 2025-05-07T20:01:56.3736092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:56.3737333Z 2025-05-07T20:01:56.3738375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:56.3739655Z 2025-05-07T20:01:56.3740693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:56.3741946Z 2025-05-07T20:01:56.3742987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:56.3744238Z 2025-05-07T20:01:56.3745264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:56.3746530Z 2025-05-07T20:01:56.3747572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:56.3748819Z 2025-05-07T20:01:56.3749842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:56.3751219Z 2025-05-07T20:01:56.3752244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:56.3753603Z 2025-05-07T20:01:58.1737025Z [567/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o 2025-05-07T20:01:58.1757485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:58.1758662Z 2025-05-07T20:01:58.1759607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:58.1760787Z 2025-05-07T20:01:58.1761714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:01:58.1762880Z 2025-05-07T20:01:59.3235140Z [568/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_hfp8.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o 2025-05-07T20:02:02.5596760Z [569/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o 2025-05-07T20:02:02.5618281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:02.5620252Z 2025-05-07T20:02:02.5621960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:02.5624087Z 2025-05-07T20:02:02.5625152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:02:02.5626440Z 2025-05-07T20:02:02.5628128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:02.5629994Z 2025-05-07T20:02:02.5631708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:02.5633671Z 2025-05-07T20:02:02.5634728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:02:02.5636156Z 2025-05-07T20:02:02.5637949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:02.5639811Z 2025-05-07T20:02:02.5641493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:02.5643366Z 2025-05-07T20:02:02.5753944Z [570/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o 2025-05-07T20:02:02.5775512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:02.5777436Z 2025-05-07T20:02:02.5779142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:02.5781069Z 2025-05-07T20:02:02.5782714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:02.5784577Z 2025-05-07T20:02:02.5786278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:02.5788420Z 2025-05-07T20:02:02.5790183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:02.5791918Z 2025-05-07T20:02:02.5793646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:02.5795518Z 2025-05-07T20:02:03.1945518Z [571/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o 2025-05-07T20:02:03.1967438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:03.1969323Z 2025-05-07T20:02:03.1971104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:03.1973028Z 2025-05-07T20:02:03.1974725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:03.1976558Z 2025-05-07T20:02:03.1978199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:03.1980395Z 2025-05-07T20:02:03.1982212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:03.1984081Z 2025-05-07T20:02:03.1985784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:03.1987751Z 2025-05-07T20:02:07.1130892Z [572/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o 2025-05-07T20:02:08.0517330Z [573/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o 2025-05-07T20:02:08.0538830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:08.0540726Z 2025-05-07T20:02:08.0542427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:08.0544327Z 2025-05-07T20:02:08.0546028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:08.0547905Z 2025-05-07T20:02:08.0549579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:08.0551481Z 2025-05-07T20:02:08.0553243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:08.0555141Z 2025-05-07T20:02:08.0556869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:08.0558975Z 2025-05-07T20:02:08.2647029Z [574/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.cu -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o 2025-05-07T20:02:08.8572511Z [575/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_embedding_inplace_ops.so -o fbgemm_gpu_embedding_inplace_ops.so CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:02:08.8646192Z [576/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:02:08.8648595Z ################################################################################ 2025-05-07T20:02:08.8649230Z [CMAKE] Running post-build script ... 2025-05-07T20:02:08.8650393Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:02:08.8651538Z Removing all RPATHs ... 2025-05-07T20:02:08.8652148Z ################################################################################ 2025-05-07T20:02:09.0194239Z [577/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o 2025-05-07T20:02:09.0216502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.0218425Z 2025-05-07T20:02:09.0220114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.0222025Z 2025-05-07T20:02:09.0223727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.0225788Z 2025-05-07T20:02:09.0227459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.0229288Z 2025-05-07T20:02:09.0231091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.0233080Z 2025-05-07T20:02:09.0234868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:09.0236791Z 2025-05-07T20:02:10.8935884Z [578/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_group_index.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o 2025-05-07T20:02:10.8957285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.8959169Z 2025-05-07T20:02:10.8960784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.8962706Z 2025-05-07T20:02:10.8964429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.8966488Z 2025-05-07T20:02:10.8968189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.8969998Z 2025-05-07T20:02:10.8971771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.8973562Z 2025-05-07T20:02:10.8975311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.8977108Z 2025-05-07T20:02:11.5784355Z [579/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o 2025-05-07T20:02:11.5806337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.5808249Z 2025-05-07T20:02:11.5809907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.5811667Z 2025-05-07T20:02:11.5813345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.5815447Z 2025-05-07T20:02:11.5817130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.5819034Z 2025-05-07T20:02:11.5820857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.5822743Z 2025-05-07T20:02:11.5824528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.5826411Z 2025-05-07T20:02:14.1710232Z [580/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o 2025-05-07T20:02:16.0288670Z [581/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_add.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o 2025-05-07T20:02:16.0299587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:16.0300567Z 2025-05-07T20:02:16.0301442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:16.0302603Z 2025-05-07T20:02:16.0303456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:16.0304432Z 2025-05-07T20:02:16.0305293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:16.0306264Z 2025-05-07T20:02:16.0307139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:16.0308095Z 2025-05-07T20:02:16.0308959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:16.0310012Z 2025-05-07T20:02:17.0517558Z [582/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_select.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o 2025-05-07T20:02:17.0537009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.0538669Z 2025-05-07T20:02:17.0539889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.0541444Z 2025-05-07T20:02:17.0542553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.0544122Z 2025-05-07T20:02:17.0545560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.0547250Z 2025-05-07T20:02:17.0548873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.0550625Z 2025-05-07T20:02:17.0552266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.0554376Z 2025-05-07T20:02:23.4632907Z [583/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_invert_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o 2025-05-07T20:02:23.4652395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.4654081Z 2025-05-07T20:02:23.4655601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.4657392Z 2025-05-07T20:02:23.4658878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.4660592Z 2025-05-07T20:02:23.4662191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.4663944Z 2025-05-07T20:02:23.4665442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.4667103Z 2025-05-07T20:02:23.4668575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.4670528Z 2025-05-07T20:02:26.5805323Z [584/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o 2025-05-07T20:02:26.5824745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:26.5826446Z 2025-05-07T20:02:26.5827993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:26.5829719Z 2025-05-07T20:02:26.5831235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:26.5833021Z 2025-05-07T20:02:26.5834444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:26.5836039Z 2025-05-07T20:02:26.5837518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:26.5839149Z 2025-05-07T20:02:26.5840672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:26.5842518Z 2025-05-07T20:02:28.5374828Z [585/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute102.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o 2025-05-07T20:02:28.5393753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:28.5395525Z 2025-05-07T20:02:28.5397056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:28.5398805Z 2025-05-07T20:02:28.5400338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:28.5402253Z 2025-05-07T20:02:28.5403779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:28.5405511Z 2025-05-07T20:02:28.5407011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:28.5408704Z 2025-05-07T20:02:28.5410214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:28.5413524Z 2025-05-07T20:02:29.8764378Z [586/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_1d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o 2025-05-07T20:02:29.8783959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:29.8785766Z 2025-05-07T20:02:29.8787344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:29.8789162Z 2025-05-07T20:02:29.8790663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:29.8792387Z 2025-05-07T20:02:29.8794042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:29.8795781Z 2025-05-07T20:02:29.8797311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:29.8799180Z 2025-05-07T20:02:29.8800816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:29.8802967Z 2025-05-07T20:02:30.1647238Z [587/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o 2025-05-07T20:02:30.1666850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.1668644Z 2025-05-07T20:02:30.1670200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.1671983Z 2025-05-07T20:02:30.1673720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.1675460Z 2025-05-07T20:02:30.1677048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.1678884Z 2025-05-07T20:02:30.1680424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.1682190Z 2025-05-07T20:02:30.1683825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.1685882Z 2025-05-07T20:02:30.5489957Z [588/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_range.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o 2025-05-07T20:02:30.5500702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.5501647Z 2025-05-07T20:02:30.5502857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.5503857Z 2025-05-07T20:02:30.5504713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.5505674Z 2025-05-07T20:02:30.5506545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.5507519Z 2025-05-07T20:02:30.5508395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.5509353Z 2025-05-07T20:02:30.5510212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.5511283Z 2025-05-07T20:02:31.0808205Z [589/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o 2025-05-07T20:02:31.0819375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:31.0820324Z 2025-05-07T20:02:31.0821187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:31.0822128Z 2025-05-07T20:02:31.0822954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:31.0823901Z 2025-05-07T20:02:31.0824736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:31.0825689Z 2025-05-07T20:02:31.0826519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:31.0827454Z 2025-05-07T20:02:31.0828313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:31.0829313Z 2025-05-07T20:02:31.8547113Z [590/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_2d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o 2025-05-07T20:02:31.8558382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:31.8559376Z 2025-05-07T20:02:31.8560245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:31.8561228Z 2025-05-07T20:02:31.8562107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:31.8563076Z 2025-05-07T20:02:31.8563951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:31.8564912Z 2025-05-07T20:02:31.8565864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:31.8566808Z 2025-05-07T20:02:31.8567642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:31.8568660Z 2025-05-07T20:02:32.5091329Z [591/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_zipf.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o 2025-05-07T20:02:32.5102294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:32.5103439Z 2025-05-07T20:02:32.5104333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:32.5105316Z 2025-05-07T20:02:32.5106185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:32.5107159Z 2025-05-07T20:02:32.5108024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:32.5108993Z 2025-05-07T20:02:32.5109856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:32.5110804Z 2025-05-07T20:02:32.5111664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:32.5112722Z 2025-05-07T20:02:34.0991316Z [592/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o 2025-05-07T20:02:34.1002652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.1003630Z 2025-05-07T20:02:34.1004519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.1005493Z 2025-05-07T20:02:34.1006363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.1007320Z 2025-05-07T20:02:34.1008186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.1009173Z 2025-05-07T20:02:34.1010024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.1010977Z 2025-05-07T20:02:34.1011849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.1012889Z 2025-05-07T20:02:35.0204609Z [593/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o 2025-05-07T20:02:35.0215677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:35.0216635Z 2025-05-07T20:02:35.0217480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:35.0218431Z 2025-05-07T20:02:35.0219268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:35.0220199Z 2025-05-07T20:02:35.0221042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:35.0222001Z 2025-05-07T20:02:35.0222839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:35.0223777Z 2025-05-07T20:02:35.0224620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:35.0225626Z 2025-05-07T20:02:37.0490559Z [594/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:02:37.0502984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.0503969Z 2025-05-07T20:02:37.0504839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.0505845Z 2025-05-07T20:02:37.0506700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.0507662Z 2025-05-07T20:02:37.0508554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.0509526Z 2025-05-07T20:02:37.0510388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.0511412Z 2025-05-07T20:02:37.0512278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.0513335Z 2025-05-07T20:02:38.0566898Z [595/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o 2025-05-07T20:02:38.0578080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.0578981Z 2025-05-07T20:02:38.0579800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.0580702Z 2025-05-07T20:02:38.0581502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.0582392Z 2025-05-07T20:02:38.0583191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.0584158Z 2025-05-07T20:02:38.0584944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.0585827Z 2025-05-07T20:02:38.0586642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.0587529Z 2025-05-07T20:02:40.4353982Z [596/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T20:02:40.4366202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.4367110Z 2025-05-07T20:02:40.4367910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.4368818Z 2025-05-07T20:02:40.4369607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.4370496Z 2025-05-07T20:02:40.4371364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.4372254Z 2025-05-07T20:02:40.4373064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.4373948Z 2025-05-07T20:02:40.4374746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.4375650Z 2025-05-07T20:02:41.7657717Z [597/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward.so -o fbgemm_gpu_tbe_training_backward.so CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:02:42.4093354Z [598/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_dense.so -o fbgemm_gpu_tbe_training_backward_dense.so CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:02:42.4757468Z [599/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 1 2025-05-07T20:02:42.4758752Z ################################################################################ 2025-05-07T20:02:42.4759128Z [CMAKE] Running post-build script ... 2025-05-07T20:02:42.4759938Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:02:42.4760562Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:42.4760938Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:42.4761416Z ################################################################################ 2025-05-07T20:02:42.5620078Z [600/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 1 2025-05-07T20:02:42.5621518Z ################################################################################ 2025-05-07T20:02:42.5621888Z [CMAKE] Running post-build script ... 2025-05-07T20:02:42.5622533Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:02:42.5623176Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:42.5623553Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:42.5623955Z ################################################################################ 2025-05-07T20:02:42.5982651Z [601/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_vbe.so -o fbgemm_gpu_tbe_training_backward_vbe.so CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:02:46.2350467Z [602/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o 2025-05-07T20:02:46.4159949Z [603/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 1 2025-05-07T20:02:46.4161403Z ################################################################################ 2025-05-07T20:02:46.4161770Z [CMAKE] Running post-build script ... 2025-05-07T20:02:46.4162628Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:02:46.4163392Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:46.4163753Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:46.4164208Z ################################################################################ 2025-05-07T20:02:47.0105663Z [604/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_py.so -o fbgemm_gpu_py.so CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_embedding_inplace_ops.so fbgemm_gpu_tbe_index_select.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_optimizers.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:02:47.0453363Z [605/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_gwd.so -o fbgemm_gpu_tbe_training_backward_gwd.so CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build -L"/github/home/miniconda/envs/build_binary/lib/stubs" -L"/github/home/miniconda/envs/build_binary/lib" && : 2025-05-07T20:02:47.5498092Z [606/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 1 2025-05-07T20:02:47.5501638Z ################################################################################ 2025-05-07T20:02:47.5503078Z [CMAKE] Running post-build script ... 2025-05-07T20:02:47.5504189Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:02:47.5504755Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:47.5505164Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:47.5505588Z ################################################################################ 2025-05-07T20:02:47.9718899Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 1 2025-05-07T20:02:47.9720216Z ################################################################################ 2025-05-07T20:02:47.9720563Z [CMAKE] Running post-build script ... 2025-05-07T20:02:47.9721320Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:02:47.9721941Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:47.9722292Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:47.9722699Z ################################################################################ 2025-05-07T20:02:47.9723622Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/cmake/data/bin/cmake -P cmake_install.cmake 2025-05-07T20:02:47.9767399Z -- Install configuration: "Release" 2025-05-07T20:02:47.9768683Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/asmjit.so 2025-05-07T20:02:48.0014600Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm.so 2025-05-07T20:02:48.0017334Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so 2025-05-07T20:02:48.0035982Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so 2025-05-07T20:02:48.0038882Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so 2025-05-07T20:02:48.0063376Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so 2025-05-07T20:02:48.0087803Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:02:48.0091178Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so 2025-05-07T20:02:48.0093967Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:02:48.0119463Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:02:48.0122668Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:02:48.0124106Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:02:48.0125160Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:02:48.0126356Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:02:48.0127414Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:02:48.0128522Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:02:48.0129579Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:02:48.0130685Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py 2025-05-07T20:02:48.0131917Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py 2025-05-07T20:02:48.0133152Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py 2025-05-07T20:02:48.0134317Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py 2025-05-07T20:02:48.0135525Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py 2025-05-07T20:02:48.0136721Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py 2025-05-07T20:02:48.0137966Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py 2025-05-07T20:02:48.0139276Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py 2025-05-07T20:02:48.0140606Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py 2025-05-07T20:02:48.0141896Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py 2025-05-07T20:02:48.0143279Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py 2025-05-07T20:02:48.0144551Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py 2025-05-07T20:02:48.0145746Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py 2025-05-07T20:02:48.0147033Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py 2025-05-07T20:02:48.0148406Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T20:02:48.0149697Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py 2025-05-07T20:02:48.0150834Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:02:48.0161719Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so 2025-05-07T20:02:48.0213846Z 2025-05-07T20:02:48.0252569Z 2025-05-07T20:02:48.0253349Z copying fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/__init__.py 2025-05-07T20:02:48.0254477Z copying fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py 2025-05-07T20:02:48.0255504Z copying fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/enums.py 2025-05-07T20:02:48.0256392Z copying fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/metrics.py 2025-05-07T20:02:48.0257467Z copying fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py 2025-05-07T20:02:48.0258784Z copying fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py 2025-05-07T20:02:48.0260077Z copying fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_comm.py 2025-05-07T20:02:48.0260994Z copying fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_utils.py 2025-05-07T20:02:48.0261994Z copying fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/runtime_monitor.py 2025-05-07T20:02:48.0277388Z copying fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sparse_ops.py 2025-05-07T20:02:48.0278384Z copying fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_configs.py 2025-05-07T20:02:48.0279703Z copying fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py 2025-05-07T20:02:48.0281028Z copying fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py 2025-05-07T20:02:48.0282166Z copying fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_utils.py 2025-05-07T20:02:48.0283589Z copying fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py 2025-05-07T20:02:48.0284944Z copying fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py 2025-05-07T20:02:48.0286397Z copying fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py 2025-05-07T20:02:48.0287869Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py 2025-05-07T20:02:48.0289413Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py 2025-05-07T20:02:48.0290968Z copying fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py 2025-05-07T20:02:48.0292227Z copying fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py 2025-05-07T20:02:48.0293370Z copying fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/uvm.py 2025-05-07T20:02:48.0294085Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config 2025-05-07T20:02:48.0294893Z copying fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/__init__.py 2025-05-07T20:02:48.0295900Z copying fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/feature_list.py 2025-05-07T20:02:48.0296809Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs 2025-05-07T20:02:48.0297598Z copying fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/__init__.py 2025-05-07T20:02:48.0298567Z copying fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/common.py 2025-05-07T20:02:48.0299546Z copying fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/examples.py 2025-05-07T20:02:48.0300561Z copying fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py 2025-05-07T20:02:48.0301711Z copying fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py 2025-05-07T20:02:48.0303184Z copying fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py 2025-05-07T20:02:48.0304317Z copying fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/quantize_ops.py 2025-05-07T20:02:48.0305307Z copying fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/sparse_ops.py 2025-05-07T20:02:48.0306202Z copying fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/version.py 2025-05-07T20:02:48.0307016Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize 2025-05-07T20:02:48.0307900Z copying fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/__init__.py 2025-05-07T20:02:48.0308988Z copying fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/quantize_ops.py 2025-05-07T20:02:48.0309899Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll 2025-05-07T20:02:48.0310630Z copying fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/__init__.py 2025-05-07T20:02:48.0311436Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe 2025-05-07T20:02:48.0312213Z copying fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/__init__.py 2025-05-07T20:02:48.0313189Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton 2025-05-07T20:02:48.0314032Z copying fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/__init__.py 2025-05-07T20:02:48.0314962Z copying fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/common.py 2025-05-07T20:02:48.0315930Z copying fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize.py 2025-05-07T20:02:48.0316881Z copying fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize_ref.py 2025-05-07T20:02:48.0317771Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils 2025-05-07T20:02:48.0318561Z copying fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/__init__.py 2025-05-07T20:02:48.0319580Z copying fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/filestore.py 2025-05-07T20:02:48.0320517Z copying fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/loader.py 2025-05-07T20:02:48.0321491Z copying fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/torch_library.py 2025-05-07T20:02:48.0322458Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu 2025-05-07T20:02:48.0323299Z copying fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/__init__.py 2025-05-07T20:02:48.0324299Z copying fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py 2025-05-07T20:02:48.0325221Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta 2025-05-07T20:02:48.0326099Z copying fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/__init__.py 2025-05-07T20:02:48.0327103Z copying fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py 2025-05-07T20:02:48.0328001Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton 2025-05-07T20:02:48.0328898Z copying fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/__init__.py 2025-05-07T20:02:48.0329904Z copying fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/common.py 2025-05-07T20:02:48.0331179Z copying fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py 2025-05-07T20:02:48.0332641Z copying fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py 2025-05-07T20:02:48.0333978Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py 2025-05-07T20:02:48.0335282Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py 2025-05-07T20:02:48.0336778Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py 2025-05-07T20:02:48.0338447Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py 2025-05-07T20:02:48.0340034Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py 2025-05-07T20:02:48.0341497Z copying fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py 2025-05-07T20:02:48.0343121Z copying fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py 2025-05-07T20:02:48.0344473Z copying fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py 2025-05-07T20:02:48.0345829Z copying fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py 2025-05-07T20:02:48.0346914Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench 2025-05-07T20:02:48.0347696Z copying fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/__init__.py 2025-05-07T20:02:48.0348653Z copying fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py 2025-05-07T20:02:48.0349668Z copying fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py 2025-05-07T20:02:48.0350626Z copying fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py 2025-05-07T20:02:48.0351749Z copying fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py 2025-05-07T20:02:48.0352968Z copying fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py 2025-05-07T20:02:48.0354042Z copying fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/reporter.py 2025-05-07T20:02:48.0355070Z copying fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py 2025-05-07T20:02:48.0356147Z copying fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py 2025-05-07T20:02:48.0357380Z copying fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py 2025-05-07T20:02:48.0358447Z copying fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/utils.py 2025-05-07T20:02:48.0359248Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache 2025-05-07T20:02:48.0360024Z copying fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/__init__.py 2025-05-07T20:02:48.0361081Z copying fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py 2025-05-07T20:02:48.0362028Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:48.0362802Z copying fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py 2025-05-07T20:02:48.0363664Z copying fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/common.py 2025-05-07T20:02:48.0364571Z copying fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/inference.py 2025-05-07T20:02:48.0365604Z copying fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/training.py 2025-05-07T20:02:48.0366384Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils 2025-05-07T20:02:48.0367140Z copying fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/__init__.py 2025-05-07T20:02:48.0368009Z copying fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/common.py 2025-05-07T20:02:48.0368950Z copying fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/offsets.py 2025-05-07T20:02:48.0369858Z copying fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/quantize.py 2025-05-07T20:02:48.0370815Z copying fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/requests.py 2025-05-07T20:02:48.0371612Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats 2025-05-07T20:02:48.0372380Z copying fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/__init__.py 2025-05-07T20:02:48.0373380Z copying fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py 2025-05-07T20:02:48.0374251Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:48.0375095Z copying fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py 2025-05-07T20:02:48.0376252Z copying fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py 2025-05-07T20:02:48.0377287Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged 2025-05-07T20:02:48.0378094Z copying fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/__init__.py 2025-05-07T20:02:48.0379175Z copying fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py 2025-05-07T20:02:48.0379919Z 2025-05-07T20:02:48.0470387Z INFO:root:running bdist_wheel 2025-05-07T20:02:48.0515252Z INFO:root:running build 2025-05-07T20:02:48.0515577Z INFO:root:running build_py 2025-05-07T20:02:48.0521057Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0522644Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0524647Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0525999Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0527400Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0529178Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0530925Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0532485Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0533879Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0535321Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0536751Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0538282Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0539836Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0541464Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0542968Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0544516Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0546198Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0547843Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0549442Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0551608Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0553360Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0554904Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0556310Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0557659Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:02:48.0558910Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:02:48.0560439Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:02:48.0562417Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:48.0563526Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:48.0564871Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:48.0566262Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:48.0567680Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:48.0569270Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:48.0570820Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:48.0572314Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:48.0573716Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:48.0575083Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:48.0576328Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:02:48.0577651Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:02:48.0579137Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:02:48.0580612Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll 2025-05-07T20:02:48.0581758Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll 2025-05-07T20:02:48.0583473Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe 2025-05-07T20:02:48.0584569Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe 2025-05-07T20:02:48.0586416Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:48.0587514Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:48.0588904Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:48.0590312Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:48.0591889Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:48.0594099Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:48.0595206Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:48.0596814Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:48.0598302Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:48.0599819Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:48.0601332Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:02:48.0602733Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:02:48.0604284Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:02:48.0606129Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:02:48.0607433Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:02:48.0609033Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:02:48.0610741Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:48.0612394Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:48.0613966Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:48.0615697Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:48.0617529Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:48.0619200Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:48.0620917Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:48.0622664Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:48.0624496Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:48.0626326Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:48.0628145Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:48.0629909Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:48.0631700Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:48.0633601Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:48.0635026Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:48.0636190Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:48.0637799Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:48.0639367Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:48.0640971Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:48.0642705Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:48.0644387Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:48.0646018Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:48.0647567Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:48.0649249Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:48.0650951Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:48.0652511Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:48.0653749Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:02:48.0654946Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:02:48.0656627Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:02:48.0657925Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:48.0659044Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:48.0660508Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:48.0662069Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:48.0663650Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:48.0664868Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:48.0666081Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:48.0667610Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:48.0669092Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:48.0670742Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:48.0672343Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:48.0673625Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:02:48.0674949Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:02:48.0676565Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:02:48.0677859Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:48.0679166Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:48.0680871Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:48.0682167Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:02:48.0683347Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:02:48.0684936Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:02:48.0718579Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.0764282Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.1093766Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:48.1695720Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:49.8150751Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:49.8154460Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:49.8780735Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:49.8839655Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:49.8955745Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:49.9312101Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:51.4354872Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:51.5182692Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:55.4465937Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:56.0535291Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:57.4512310Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:57.6916157Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:57.7283646Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:57.8702250Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:57.8703907Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:57.8705976Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:57.8712957Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:57.8719035Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:57.8725238Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:57.8731082Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:57.8737073Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:57.8743039Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:57.8748230Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:57.8754462Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:57.8759595Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:57.8765148Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:57.8775907Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:57.8781371Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:57.8785584Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:02:57.8787133Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:02:57.8791622Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:02:57.8800617Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:57.8826781Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1877051Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1878615Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1879933Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1881182Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1882533Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1884010Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1885499Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1886799Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1888182Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1889483Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1891796Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1893230Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1894701Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1896103Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1897502Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1899283Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1900866Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1902648Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1906141Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1907699Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1909113Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1910533Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:02:58.1911894Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:02:58.1913453Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:02:58.1914974Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:58.1916511Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:58.1917951Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:58.1919433Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:58.1920893Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:58.1922881Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:58.1924526Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:58.1925953Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:58.1927548Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:02:58.1929018Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:02:58.1932372Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:02:58.1933835Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll 2025-05-07T20:02:58.1935594Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe 2025-05-07T20:02:58.1937114Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:58.1938574Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:58.1939957Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:58.1941583Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:02:58.1943061Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:58.1944512Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:58.1945874Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:58.1947255Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:02:58.1948705Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:02:58.1950110Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:02:58.1951802Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:02:58.1953305Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:02:58.1955037Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:58.1956541Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:58.1958193Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:58.1959827Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:58.1961388Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:58.1962943Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:58.1964593Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:58.1966303Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:58.1968023Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:58.1969687Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:58.1971442Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:58.1973082Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:58.1974719Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:02:58.1976281Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.1977701Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.1979197Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.1980658Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.1982206Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.1983763Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.1985666Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.1987215Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.1988738Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.1990286Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.1991787Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.1993264Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:02:58.1994750Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:02:58.1996233Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:58.1997617Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:58.1999014Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:58.2000601Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:58.2003083Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:58.2004901Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:58.2006331Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:58.2007775Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:58.2009518Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:02:58.2011114Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:02:58.2012652Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:02:58.2014211Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:58.2015797Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:58.2017410Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:02:58.2018977Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:02:58.2030827Z INFO:skbuild:copied 90 files 2025-05-07T20:02:58.2031141Z INFO:root:running build_ext 2025-05-07T20:02:58.2033650Z INFO:root:installing to _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:02:58.2034130Z INFO:root:running install 2025-05-07T20:02:58.2092464Z INFO:root:running install_lib 2025-05-07T20:02:58.2093032Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:02:58.2093716Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu 2025-05-07T20:02:58.2094455Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/config 2025-05-07T20:02:58.2095621Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:02:58.2097287Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:02:58.2098471Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/docs 2025-05-07T20:02:58.2099595Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:58.2101468Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:58.2103184Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:58.2104761Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:58.2106408Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:58.2108164Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:58.2109831Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:58.2111404Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:58.2113071Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:02:58.2114225Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/quantize 2025-05-07T20:02:58.2115425Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:02:58.2117046Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:02:58.2118233Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll 2025-05-07T20:02:58.2119001Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/cpu 2025-05-07T20:02:58.2120173Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:02:58.2121739Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:02:58.2122917Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/meta 2025-05-07T20:02:58.2124105Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:02:58.2125668Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:02:58.2126867Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/triton 2025-05-07T20:02:58.2128125Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:58.2129742Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:58.2131498Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:58.2133344Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:58.2135104Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:58.2136941Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:58.2138768Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:58.2140683Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:58.2142594Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:58.2144480Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:58.2146375Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:58.2148221Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:58.2150069Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:02:58.2151761Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll 2025-05-07T20:02:58.2152921Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe 2025-05-07T20:02:58.2153703Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.2154896Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.2156536Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.2158185Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.2159806Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.2161497Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.2163238Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.2164930Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.2166617Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.2168342Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.2170089Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.2171769Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:02:58.2172978Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/cache 2025-05-07T20:02:58.2174187Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:02:58.2175872Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:02:58.2177133Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:58.2177944Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:58.2179185Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:58.2180981Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:58.2182716Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:02:58.2184285Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:02:58.2185919Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:02:58.2187533Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:02:58.2188713Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/utils 2025-05-07T20:02:58.2189916Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:02:58.2191553Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:02:58.2193254Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:02:58.2194889Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:02:58.2196574Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:02:58.2197769Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/stats 2025-05-07T20:02:58.2198983Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:02:58.2200656Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:02:58.2202406Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe 2025-05-07T20:02:58.2203545Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton 2025-05-07T20:02:58.2204348Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton/jagged 2025-05-07T20:02:58.2205594Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:02:58.2207346Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:02:58.2209042Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:02:58.2210594Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:02:58.2212172Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:02:58.2214026Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:02:58.2215101Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/utils 2025-05-07T20:02:58.2216168Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:02:58.2217622Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:02:58.2219049Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:02:58.2220547Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:02:58.2221986Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.2223333Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.2231386Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.2299911Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.3637517Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.3639120Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.3693859Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.3697015Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.3709215Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.3743373Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.4911960Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.4976791Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.7995490Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.8462734Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.9537783Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.9722516Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.9755194Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.9867451Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:58.9869336Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:58.9871824Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:58.9874242Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:58.9876600Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:58.9878986Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:58.9881281Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:58.9883676Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:58.9886131Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:58.9888334Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:58.9890772Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:58.9893070Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:58.9895235Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:58.9897380Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:58.9899555Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:02:58.9901128Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:02:58.9902976Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:02:58.9905180Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:02:58.9907043Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:58.9908605Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0136357Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0138125Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0139824Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0141401Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0143092Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0144914Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0146822Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0148451Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0150071Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0151607Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0153409Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0155303Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0157085Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0158736Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0160344Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0162048Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0163754Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0165496Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0167248Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0168987Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0170601Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0172103Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:02:59.0172976Z INFO:skbuild:copied 125 files 2025-05-07T20:02:59.0173260Z INFO:root:running install_egg_info 2025-05-07T20:02:59.0210949Z INFO:root:running egg_info 2025-05-07T20:02:59.0250434Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T20:02:59.0252522Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T20:02:59.0254817Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T20:02:59.0255779Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T20:02:59.0347422Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:02:59.0379277Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:02:59.0380207Z INFO:root:Copying fbgemm_gpu_nightly.egg-info to _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu_nightly-2025.5.7-py3.13.egg-info 2025-05-07T20:02:59.0384935Z INFO:root:running install_scripts 2025-05-07T20:02:59.0385703Z INFO:skbuild:copied 0 files 2025-05-07T20:03:01.6864396Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL 2025-05-07T20:03:01.6866103Z INFO:wheel:creating '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-q69au0_e/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl' and adding '_skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel' to it 2025-05-07T20:03:01.6868347Z INFO:wheel:adding 'fbgemm_gpu/__init__.py' 2025-05-07T20:03:01.7131319Z INFO:wheel:adding 'fbgemm_gpu/asmjit.so' 2025-05-07T20:03:01.7143830Z INFO:wheel:adding 'fbgemm_gpu/batched_unary_embeddings_ops.py' 2025-05-07T20:03:01.7144301Z INFO:wheel:adding 'fbgemm_gpu/enums.py' 2025-05-07T20:03:01.9155234Z INFO:wheel:adding 'fbgemm_gpu/fbgemm.so' 2025-05-07T20:03:01.9287041Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_config.so' 2025-05-07T20:03:01.9394979Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so' 2025-05-07T20:03:02.9020571Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_py.so' 2025-05-07T20:03:03.0115778Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so' 2025-05-07T20:03:03.3733601Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_cache.so' 2025-05-07T20:03:03.4331545Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_common.so' 2025-05-07T20:03:03.7463670Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_index_select.so' 2025-05-07T20:03:12.1999725Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_inference.so' 2025-05-07T20:03:12.8017490Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so' 2025-05-07T20:03:27.0909718Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so' 2025-05-07T20:03:28.6459462Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so' 2025-05-07T20:03:30.6035255Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so' 2025-05-07T20:03:31.1600604Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so' 2025-05-07T20:03:31.3827829Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so' 2025-05-07T20:03:35.9459447Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so' 2025-05-07T20:03:41.8552195Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so' 2025-05-07T20:03:42.6155663Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_utils.so' 2025-05-07T20:03:42.6331866Z INFO:wheel:adding 'fbgemm_gpu/metrics.py' 2025-05-07T20:03:42.6335601Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules.py' 2025-05-07T20:03:42.6336252Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules_split.py' 2025-05-07T20:03:42.6336718Z INFO:wheel:adding 'fbgemm_gpu/quantize_comm.py' 2025-05-07T20:03:42.6338456Z INFO:wheel:adding 'fbgemm_gpu/quantize_utils.py' 2025-05-07T20:03:42.6341363Z INFO:wheel:adding 'fbgemm_gpu/runtime_monitor.py' 2025-05-07T20:03:42.6352057Z INFO:wheel:adding 'fbgemm_gpu/sparse_ops.py' 2025-05-07T20:03:42.6355874Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_configs.py' 2025-05-07T20:03:42.6358525Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_inference_converter.py' 2025-05-07T20:03:42.6359993Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_ops.py' 2025-05-07T20:03:42.6361271Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_utils.py' 2025-05-07T20:03:42.6362999Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops.py' 2025-05-07T20:03:42.6366050Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_common.py' 2025-05-07T20:03:42.6388981Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_inference.py' 2025-05-07T20:03:42.6431071Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training.py' 2025-05-07T20:03:42.6434404Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py' 2025-05-07T20:03:42.6435689Z INFO:wheel:adding 'fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py' 2025-05-07T20:03:42.6437568Z INFO:wheel:adding 'fbgemm_gpu/tbe_input_multiplexer.py' 2025-05-07T20:03:42.6438928Z INFO:wheel:adding 'fbgemm_gpu/uvm.py' 2025-05-07T20:03:42.6440922Z INFO:wheel:adding 'fbgemm_gpu/config/__init__.py' 2025-05-07T20:03:42.6442619Z INFO:wheel:adding 'fbgemm_gpu/config/feature_list.py' 2025-05-07T20:03:42.6444233Z INFO:wheel:adding 'fbgemm_gpu/docs/__init__.py' 2025-05-07T20:03:42.6445466Z INFO:wheel:adding 'fbgemm_gpu/docs/common.py' 2025-05-07T20:03:42.6447196Z INFO:wheel:adding 'fbgemm_gpu/docs/examples.py' 2025-05-07T20:03:42.6449495Z INFO:wheel:adding 'fbgemm_gpu/docs/jagged_tensor_ops.py' 2025-05-07T20:03:42.6451205Z INFO:wheel:adding 'fbgemm_gpu/docs/merge_pooled_embedding_ops.py' 2025-05-07T20:03:42.6453202Z INFO:wheel:adding 'fbgemm_gpu/docs/permute_pooled_embedding_ops.py' 2025-05-07T20:03:42.6454745Z INFO:wheel:adding 'fbgemm_gpu/docs/quantize_ops.py' 2025-05-07T20:03:42.6460319Z INFO:wheel:adding 'fbgemm_gpu/docs/sparse_ops.py' 2025-05-07T20:03:42.6462051Z INFO:wheel:adding 'fbgemm_gpu/docs/version.py' 2025-05-07T20:03:42.6463695Z INFO:wheel:adding 'fbgemm_gpu/quantize/__init__.py' 2025-05-07T20:03:42.6465235Z INFO:wheel:adding 'fbgemm_gpu/quantize/quantize_ops.py' 2025-05-07T20:03:42.6467148Z INFO:wheel:adding 'fbgemm_gpu/sll/__init__.py' 2025-05-07T20:03:42.6469111Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/__init__.py' 2025-05-07T20:03:42.6475318Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/cpu_sll.py' 2025-05-07T20:03:42.6477717Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/__init__.py' 2025-05-07T20:03:42.6480067Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/meta_sll.py' 2025-05-07T20:03:42.6482307Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/__init__.py' 2025-05-07T20:03:42.6483719Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/common.py' 2025-05-07T20:03:42.6485409Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py' 2025-05-07T20:03:42.6487648Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py' 2025-05-07T20:03:42.6491047Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm.py' 2025-05-07T20:03:42.6494876Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py' 2025-05-07T20:03:42.6496772Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py' 2025-05-07T20:03:42.6498855Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py' 2025-05-07T20:03:42.6504503Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py' 2025-05-07T20:03:42.6509712Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py' 2025-05-07T20:03:42.6511801Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py' 2025-05-07T20:03:42.6515526Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_softmax.py' 2025-05-07T20:03:42.6520685Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py' 2025-05-07T20:03:42.6523235Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py' 2025-05-07T20:03:42.6526041Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py' 2025-05-07T20:03:42.6529433Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py' 2025-05-07T20:03:42.6531452Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py' 2025-05-07T20:03:42.6533231Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py' 2025-05-07T20:03:42.6536029Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py' 2025-05-07T20:03:42.6539055Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py' 2025-05-07T20:03:42.6541800Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py' 2025-05-07T20:03:42.6544795Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py' 2025-05-07T20:03:42.6547706Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py' 2025-05-07T20:03:42.6550620Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py' 2025-05-07T20:03:42.6553789Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py' 2025-05-07T20:03:42.6557029Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py' 2025-05-07T20:03:42.6559756Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py' 2025-05-07T20:03:42.6561619Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py' 2025-05-07T20:03:42.6564060Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py' 2025-05-07T20:03:42.6565231Z INFO:wheel:adding 'fbgemm_gpu/tbe/__init__.py' 2025-05-07T20:03:42.6567211Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/__init__.py' 2025-05-07T20:03:42.6569178Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_config.py' 2025-05-07T20:03:42.6573941Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_runs.py' 2025-05-07T20:03:42.6576225Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eeg_cli.py' 2025-05-07T20:03:42.6578374Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/embedding_ops_common_config.py' 2025-05-07T20:03:42.6580096Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eval_compression.py' 2025-05-07T20:03:42.6581518Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/reporter.py' 2025-05-07T20:03:42.6584518Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config.py' 2025-05-07T20:03:42.6587048Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_loader.py' 2025-05-07T20:03:42.6589354Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py' 2025-05-07T20:03:42.6590927Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/utils.py' 2025-05-07T20:03:42.6592377Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/__init__.py' 2025-05-07T20:03:42.6594077Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py' 2025-05-07T20:03:42.6595413Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/__init__.py' 2025-05-07T20:03:42.6596645Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/common.py' 2025-05-07T20:03:42.6603091Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/inference.py' 2025-05-07T20:03:42.6628795Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/training.py' 2025-05-07T20:03:42.6631329Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/__init__.py' 2025-05-07T20:03:42.6634090Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py' 2025-05-07T20:03:42.6635501Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/__init__.py' 2025-05-07T20:03:42.6638051Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/bench_params_reporter.py' 2025-05-07T20:03:42.6639655Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/__init__.py' 2025-05-07T20:03:42.6641060Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/common.py' 2025-05-07T20:03:42.6642573Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/offsets.py' 2025-05-07T20:03:42.6644940Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/quantize.py' 2025-05-07T20:03:42.6650312Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/requests.py' 2025-05-07T20:03:42.6652279Z INFO:wheel:adding 'fbgemm_gpu/triton/__init__.py' 2025-05-07T20:03:42.6653896Z INFO:wheel:adding 'fbgemm_gpu/triton/common.py' 2025-05-07T20:03:42.6661246Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize.py' 2025-05-07T20:03:42.6665677Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize_ref.py' 2025-05-07T20:03:42.6667443Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/__init__.py' 2025-05-07T20:03:42.6675334Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py' 2025-05-07T20:03:42.6677441Z INFO:wheel:adding 'fbgemm_gpu/utils/__init__.py' 2025-05-07T20:03:42.6679541Z INFO:wheel:adding 'fbgemm_gpu/utils/filestore.py' 2025-05-07T20:03:42.6681007Z INFO:wheel:adding 'fbgemm_gpu/utils/loader.py' 2025-05-07T20:03:42.6683015Z INFO:wheel:adding 'fbgemm_gpu/utils/torch_library.py' 2025-05-07T20:03:42.6685409Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/METADATA' 2025-05-07T20:03:42.6686386Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL' 2025-05-07T20:03:42.6713260Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/top_level.txt' 2025-05-07T20:03:42.6719405Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/RECORD' 2025-05-07T20:03:42.6722247Z INFO:root:removing _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:03:42.7553924Z ╒════════════════════════════╤════════════════════════════════════════════════╕ 2025-05-07T20:03:42.7554476Z │ │ Version │ 2025-05-07T20:03:42.7554993Z ╞════════════════════════════╪════════════════════════════════════════════════╡ 2025-05-07T20:03:42.7555674Z │ PyTorch │ 2.8.0.dev20250507+cu118 │ 2025-05-07T20:03:42.7556232Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:03:42.7556772Z │ CUDA (Declared by PyTorch) │ 11.8 │ 2025-05-07T20:03:42.7557420Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:03:42.7557926Z │ CUDA (Actual) │ nvcc: NVIDIA (R) Cuda compiler driver │ 2025-05-07T20:03:42.7558464Z │ │ Copyright (c) 2005-2022 NVIDIA Corporation │ 2025-05-07T20:03:42.7558965Z │ │ Built on Wed_Sep_21_10:33:58_PDT_2022 │ 2025-05-07T20:03:42.7559442Z │ │ Cuda compilation tools, release 11.8, V11.8.89 │ 2025-05-07T20:03:42.7559926Z │ │ Build cuda_11.8.r11.8/compiler.31833905_0 │ 2025-05-07T20:03:42.7560449Z ╘════════════════════════════╧════════════════════════════════════════════════╛ 2025-05-07T20:03:43.0542486Z Successfully built fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:43.1434407Z 2025-05-07T20:03:43.1584893Z ################################################################################ 2025-05-07T20:03:43.1585436Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:03:43.1585880Z [CHECK] Listing out library size: 2025-05-07T20:03:43.1586286Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:03:43.1586652Z 2025-05-07T20:03:43.1595340Z 1 ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:03:43.1596087Z 2025-05-07T20:03:43.1600319Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:03:43.1602714Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.1603286Z 2025-05-07T20:03:43.1669073Z GLIBC_2.2.5 2025-05-07T20:03:43.1669850Z GLIBC_2.14 2025-05-07T20:03:43.1670235Z 2025-05-07T20:03:43.1670247Z 2025-05-07T20:03:43.1671284Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:03:43.1672821Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.1673510Z 2025-05-07T20:03:43.1734756Z GLIBCXX_3.4 2025-05-07T20:03:43.1735165Z 2025-05-07T20:03:43.1741071Z 2025-05-07T20:03:43.1755982Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so > /tmp/tmp.PAAbI5HYVe.symbols.txt 2025-05-07T20:03:43.1757223Z 2025-05-07T20:03:43.1785288Z 2025-05-07T20:03:43.1810562Z [CHECK] Total Number of symbols: 841 2025-05-07T20:03:43.1825923Z [CHECK] Number of fbgemm symbols: 0 2025-05-07T20:03:43.1840062Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so > /tmp/tmp.zMoTOvUN7O.usymbols.txt 2025-05-07T20:03:43.1841325Z 2025-05-07T20:03:43.1860532Z 2025-05-07T20:03:43.1895169Z [CHECK] Listing out undefined symbols (51 total): 2025-05-07T20:03:43.1908082Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:43.1908437Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:43.1908780Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:43.1909143Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:43.1909465Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:03:43.1909810Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:43.1910150Z U abort@GLIBC_2.2.5 2025-05-07T20:03:43.1910443Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:43.1910744Z U close@GLIBC_2.2.5 2025-05-07T20:03:43.1911028Z U fputs@GLIBC_2.2.5 2025-05-07T20:03:43.1911327Z U free@GLIBC_2.2.5 2025-05-07T20:03:43.1911767Z U ftruncate64@GLIBC_2.2.5 2025-05-07T20:03:43.1912082Z U fwrite@GLIBC_2.2.5 2025-05-07T20:03:43.1912364Z U getenv@GLIBC_2.2.5 2025-05-07T20:03:43.1912672Z U getpagesize@GLIBC_2.2.5 2025-05-07T20:03:43.1913088Z U madvise@GLIBC_2.2.5 2025-05-07T20:03:43.1913372Z U malloc@GLIBC_2.2.5 2025-05-07T20:03:43.1913757Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:43.1914037Z U memcpy@GLIBC_2.14 2025-05-07T20:03:43.1914327Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:43.1914612Z U memset@GLIBC_2.2.5 2025-05-07T20:03:43.1914912Z U mmap@GLIBC_2.2.5 2025-05-07T20:03:43.1915191Z U mprotect@GLIBC_2.2.5 2025-05-07T20:03:43.1915492Z U munmap@GLIBC_2.2.5 2025-05-07T20:03:43.1915774Z U open64@GLIBC_2.2.5 2025-05-07T20:03:43.1916097Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:43.1916454Z U pthread_mutex_destroy@GLIBC_2.2.5 2025-05-07T20:03:43.1916789Z U pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:43.1917139Z U pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:43.1917454Z U read@GLIBC_2.2.5 2025-05-07T20:03:43.1917748Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:43.1918038Z U shm_open@GLIBC_2.2.5 2025-05-07T20:03:43.1918344Z U shm_unlink@GLIBC_2.2.5 2025-05-07T20:03:43.1918641Z U snprintf@GLIBC_2.2.5 2025-05-07T20:03:43.1918979Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:43.1919303Z U stderr@GLIBC_2.2.5 2025-05-07T20:03:43.1919586Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:43.1919883Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:43.1920164Z U strtol@GLIBC_2.2.5 2025-05-07T20:03:43.1920460Z U syscall@GLIBC_2.2.5 2025-05-07T20:03:43.1920749Z U sysconf@GLIBC_2.2.5 2025-05-07T20:03:43.1921050Z U uname@GLIBC_2.2.5 2025-05-07T20:03:43.1921330Z U unlink@GLIBC_2.2.5 2025-05-07T20:03:43.1921634Z U vsnprintf@GLIBC_2.2.5 2025-05-07T20:03:43.1922005Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:43.1922433Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:43.1922882Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:03:43.1923269Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:43.1923702Z w _ITM_registerTMCloneTable 2025-05-07T20:03:43.1924012Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:43.1924331Z w __gmon_start__ 2025-05-07T20:03:43.1924681Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:43.1925178Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:03:43.1925416Z 2025-05-07T20:03:43.1955616Z linux-vdso.so.1 (0x00007ffe165fd000) 2025-05-07T20:03:43.1955973Z libtorch_cpu.so => not found 2025-05-07T20:03:43.1956270Z libtorch_cuda.so => not found 2025-05-07T20:03:43.1956542Z libtorch.so => not found 2025-05-07T20:03:43.1956883Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f15bb6ca000) 2025-05-07T20:03:43.1957324Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f15bb674000) 2025-05-07T20:03:43.1957726Z librt.so.1 => /lib64/librt.so.1 (0x00007f15bb66d000) 2025-05-07T20:03:43.1958133Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f15bb63f000) 2025-05-07T20:03:43.1958570Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f15bb63a000) 2025-05-07T20:03:43.1959003Z libc.so.6 => /lib64/libc.so.6 (0x00007f15bb432000) 2025-05-07T20:03:43.1959357Z libm.so.6 => /lib64/libm.so.6 (0x00007f15bb357000) 2025-05-07T20:03:43.1959730Z /lib64/ld-linux-x86-64.so.2 (0x00007f15bb9aa000) 2025-05-07T20:03:43.1959971Z 2025-05-07T20:03:43.1960096Z [CHECK] Displaying ELF information: 2025-05-07T20:03:43.1960579Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:03:43.1960880Z 2025-05-07T20:03:43.1994988Z 2025-05-07T20:03:43.1995354Z Dynamic section at offset 0x74dd0 contains 35 entries: 2025-05-07T20:03:43.1995750Z Tag Type Name/Value 2025-05-07T20:03:43.1996224Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:43.1996886Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:43.1997417Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:43.1997956Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:43.1998476Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:43.1998996Z 0x0000000000000001 (NEEDED) Shared library: [librt.so.1] 2025-05-07T20:03:43.1999520Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:43.2000041Z 0x0000000000000001 (NEEDED) Shared library: [libpthread.so.0] 2025-05-07T20:03:43.2000573Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:43.2001067Z 0x000000000000000e (SONAME) Library soname: [asmjit.so] 2025-05-07T20:03:43.2001497Z 0x000000000000000c (INIT) 0x19000 2025-05-07T20:03:43.2001840Z 0x000000000000000d (FINI) 0x56a1c 2025-05-07T20:03:43.2002397Z 0x0000000000000019 (INIT_ARRAY) 0x74ff8 2025-05-07T20:03:43.2002782Z 0x000000000000001b (INIT_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:43.2003153Z 0x000000000000001a (FINI_ARRAY) 0x75000 2025-05-07T20:03:43.2003535Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:43.2003898Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:43.2004269Z 0x0000000000000005 (STRTAB) 0x7120 2025-05-07T20:03:43.2004612Z 0x0000000000000006 (SYMTAB) 0x2230 2025-05-07T20:03:43.2005008Z 0x000000000000000a (STRSZ) 48790 (bytes) 2025-05-07T20:03:43.2005417Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:43.2005780Z 0x0000000000000003 (PLTGOT) 0x76050 2025-05-07T20:03:43.2006181Z 0x0000000000000002 (PLTRELSZ) 8472 (bytes) 2025-05-07T20:03:43.2006551Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:43.2006924Z 0x0000000000000017 (JMPREL) 0x16a58 2025-05-07T20:03:43.2007276Z 0x0000000000000007 (RELA) 0x13710 2025-05-07T20:03:43.2007672Z 0x0000000000000008 (RELASZ) 13128 (bytes) 2025-05-07T20:03:43.2008191Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:43.2008571Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:43.2008916Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:43.2009265Z 0x000000006ffffffe (VERNEED) 0x13650 2025-05-07T20:03:43.2009632Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:43.2009980Z 0x000000006ffffff0 (VERSYM) 0x12fb6 2025-05-07T20:03:43.2010360Z 0x000000006ffffff9 (RELACOUNT) 3 2025-05-07T20:03:43.2010686Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:43.2010935Z 2025-05-07T20:03:43.2011061Z ################################################################################ 2025-05-07T20:03:43.2011300Z 2025-05-07T20:03:43.2011305Z 2025-05-07T20:03:43.2011457Z ################################################################################ 2025-05-07T20:03:43.2011956Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:43.2012477Z [CHECK] Listing out library size: 2025-05-07T20:03:43.2012938Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:43.2013332Z 2025-05-07T20:03:43.2013521Z 1 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:43.2013822Z 2025-05-07T20:03:43.2014270Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:43.2015246Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.2015854Z 2025-05-07T20:03:43.2071039Z GLIBC_2.2.5 2025-05-07T20:03:43.2071383Z GLIBC_2.14 2025-05-07T20:03:43.2071531Z 2025-05-07T20:03:43.2071695Z 2025-05-07T20:03:43.2072126Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:43.2073294Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.2073926Z 2025-05-07T20:03:43.2130853Z GLIBCXX_3.4 2025-05-07T20:03:43.2131237Z GLIBCXX_3.4.9 2025-05-07T20:03:43.2131801Z GLIBCXX_3.4.21 2025-05-07T20:03:43.2131997Z 2025-05-07T20:03:43.2132001Z 2025-05-07T20:03:43.2155978Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.mSkaLx5kan.symbols.txt 2025-05-07T20:03:43.2156470Z 2025-05-07T20:03:43.2177684Z 2025-05-07T20:03:43.2203194Z [CHECK] Total Number of symbols: 116 2025-05-07T20:03:43.2216569Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:03:43.2234315Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.8M7jbWExQN.usymbols.txt 2025-05-07T20:03:43.2246370Z 2025-05-07T20:03:43.2249947Z 2025-05-07T20:03:43.2278907Z [CHECK] Listing out undefined symbols (59 total): 2025-05-07T20:03:43.2298280Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.2298977Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:43.2299446Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:43.2299788Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:43.2300131Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:43.2300468Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:43.2300793Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:43.2301147Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:43.2301478Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:03:43.2301832Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:43.2302393Z U c10::BoolType::get() 2025-05-07T20:03:43.2302723Z U c10::StringType::get() 2025-05-07T20:03:43.2303070Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:43.2303850Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:43.2305321Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.2306132Z U getenv@GLIBC_2.2.5 2025-05-07T20:03:43.2306475Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:43.2306809Z U memcpy@GLIBC_2.14 2025-05-07T20:03:43.2307109Z U memset@GLIBC_2.2.5 2025-05-07T20:03:43.2307453Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:43.2307805Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:43.2308362Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:43.2309112Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:03:43.2309910Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:43.2310753Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:43.2311605Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.2312670Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.2314021Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:43.2314961Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.2316029Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.2317074Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.2317727Z U std::__throw_invalid_argument(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.2318179Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.2318598Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.2318993Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.2319611Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:43.2320493Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.2321263Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:43.2321613Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:43.2321943Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:43.2322288Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:43.2322595Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:43.2322910Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:43.2323227Z U strtol@GLIBC_2.2.5 2025-05-07T20:03:43.2323532Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:43.2324350Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:43.2325568Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:03:43.2326575Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:43.2327220Z U typeinfo for std::invalid_argument@GLIBCXX_3.4 2025-05-07T20:03:43.2327627Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:43.2328081Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:43.2328521Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:43.2329099Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.2329746Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:43.2330171Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:43.2330510Z w _ITM_registerTMCloneTable 2025-05-07T20:03:43.2330821Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:43.2331216Z w __gmon_start__ 2025-05-07T20:03:43.2331513Z w __pthread_key_create@GLIBC_2.2.5 2025-05-07T20:03:43.2331868Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:43.2332326Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:43.2332619Z 2025-05-07T20:03:43.2349227Z linux-vdso.so.1 (0x00007ffd9af98000) 2025-05-07T20:03:43.2350259Z libtorch.so => not found 2025-05-07T20:03:43.2351005Z libc10.so => not found 2025-05-07T20:03:43.2351701Z libtorch_cpu.so => not found 2025-05-07T20:03:43.2352516Z libtorch_cuda.so => not found 2025-05-07T20:03:43.2353743Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f5b18901000) 2025-05-07T20:03:43.2355007Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f5b188a9000) 2025-05-07T20:03:43.2356205Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f5b1887b000) 2025-05-07T20:03:43.2357481Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f5b18876000) 2025-05-07T20:03:43.2358670Z libc.so.6 => /lib64/libc.so.6 (0x00007f5b1866e000) 2025-05-07T20:03:43.2359432Z libm.so.6 => /lib64/libm.so.6 (0x00007f5b18593000) 2025-05-07T20:03:43.2359828Z /lib64/ld-linux-x86-64.so.2 (0x00007f5b18b74000) 2025-05-07T20:03:43.2360081Z 2025-05-07T20:03:43.2360203Z [CHECK] Displaying ELF information: 2025-05-07T20:03:43.2360666Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:43.2361013Z 2025-05-07T20:03:43.2390864Z 2025-05-07T20:03:43.2391603Z Dynamic section at offset 0x8aa8 contains 35 entries: 2025-05-07T20:03:43.2392990Z Tag Type Name/Value 2025-05-07T20:03:43.2394336Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:43.2395804Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:43.2397325Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:43.2398899Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:43.2400276Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:43.2400835Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:43.2401345Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:43.2401881Z 0x0000000000000001 (NEEDED) Shared library: [libpthread.so.0] 2025-05-07T20:03:43.2402606Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:43.2403165Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_config.so] 2025-05-07T20:03:43.2403819Z 0x000000000000000c (INIT) 0x4000 2025-05-07T20:03:43.2404161Z 0x000000000000000d (FINI) 0x6890 2025-05-07T20:03:43.2404527Z 0x0000000000000019 (INIT_ARRAY) 0x99c0 2025-05-07T20:03:43.2404877Z 0x000000000000001b (INIT_ARRAYSZ) 16 (bytes) 2025-05-07T20:03:43.2405243Z 0x000000000000001a (FINI_ARRAY) 0x99d0 2025-05-07T20:03:43.2405583Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:43.2405943Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:43.2406268Z 0x0000000000000005 (STRTAB) 0xff0 2025-05-07T20:03:43.2406593Z 0x0000000000000006 (SYMTAB) 0x4f8 2025-05-07T20:03:43.2406950Z 0x000000000000000a (STRSZ) 7890 (bytes) 2025-05-07T20:03:43.2407307Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:43.2407659Z 0x0000000000000003 (PLTGOT) 0x9d28 2025-05-07T20:03:43.2408007Z 0x0000000000000002 (PLTRELSZ) 1632 (bytes) 2025-05-07T20:03:43.2408371Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:43.2408691Z 0x0000000000000017 (JMPREL) 0x3520 2025-05-07T20:03:43.2409026Z 0x0000000000000007 (RELA) 0x3070 2025-05-07T20:03:43.2409369Z 0x0000000000000008 (RELASZ) 1200 (bytes) 2025-05-07T20:03:43.2409781Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:43.2410127Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:43.2410452Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:43.2410818Z 0x000000006ffffffe (VERNEED) 0x2fb0 2025-05-07T20:03:43.2411147Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:43.2411481Z 0x000000006ffffff0 (VERSYM) 0x2ec2 2025-05-07T20:03:43.2413342Z 0x000000006ffffff9 (RELACOUNT) 4 2025-05-07T20:03:43.2413673Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:43.2413894Z 2025-05-07T20:03:43.2414025Z ################################################################################ 2025-05-07T20:03:43.2414263Z 2025-05-07T20:03:43.2414268Z 2025-05-07T20:03:43.2414381Z ################################################################################ 2025-05-07T20:03:43.2414892Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:43.2415376Z [CHECK] Listing out library size: 2025-05-07T20:03:43.2415862Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:43.2416229Z 2025-05-07T20:03:43.2416421Z 11 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:43.2416738Z 2025-05-07T20:03:43.2417127Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:43.2418105Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.2418696Z 2025-05-07T20:03:43.2495335Z GLIBC_2.2.5 2025-05-07T20:03:43.2495691Z GLIBC_2.14 2025-05-07T20:03:43.2495817Z 2025-05-07T20:03:43.2495822Z 2025-05-07T20:03:43.2496241Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:43.2497288Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.2497896Z 2025-05-07T20:03:43.2581907Z GLIBCXX_3.4 2025-05-07T20:03:43.2582307Z GLIBCXX_3.4.9 2025-05-07T20:03:43.2582649Z GLIBCXX_3.4.11 2025-05-07T20:03:43.2582889Z GLIBCXX_3.4.20 2025-05-07T20:03:43.2583099Z GLIBCXX_3.4.21 2025-05-07T20:03:43.2584862Z 2025-05-07T20:03:43.2584923Z 2025-05-07T20:03:43.2600947Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.Qstr1DPXAG.symbols.txt 2025-05-07T20:03:43.2601445Z 2025-05-07T20:03:43.2650636Z 2025-05-07T20:03:43.2674767Z [CHECK] Total Number of symbols: 819 2025-05-07T20:03:43.2691261Z [CHECK] Number of fbgemm symbols: 73 2025-05-07T20:03:43.2707014Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.lfuEL5SDJl.usymbols.txt 2025-05-07T20:03:43.2708488Z 2025-05-07T20:03:43.2722465Z 2025-05-07T20:03:43.2750501Z [CHECK] Listing out undefined symbols (152 total): 2025-05-07T20:03:43.2772479Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.2773133Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:43.2773519Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:43.2773925Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:43.2774338Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:43.2774722Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:43.2775131Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:43.2775492Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:43.2775892Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:43.2776265Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:43.2776581Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:43.2777062Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:43.2777397Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:43.2777743Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:43.2778080Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:43.2778431Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:43.2778797Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:43.2779177Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:43.2779826Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:43.2780544Z U at::_ops::arange::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.2781649Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.2782948Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.2783914Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:43.2784788Z U at::_ops::full_like::call(at::Tensor const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.2785727Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:03:43.2786388Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:03:43.2787291Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.2788412Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.2789258Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:43.2789660Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:03:43.2790015Z U c10::BoolType::get() 2025-05-07T20:03:43.2790443Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:43.2790851Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:43.2791247Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.2791701Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:43.2792078Z U c10::IntType::get() 2025-05-07T20:03:43.2792489Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:43.2793107Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:43.2793703Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:43.2794428Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:43.2795138Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:43.2795529Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:43.2795946Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:43.2796321Z U c10::TensorType::get() 2025-05-07T20:03:43.2796676Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:43.2797807Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:43.2798806Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:43.2799269Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:43.2799637Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:43.2800018Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:43.2800413Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:43.2800780Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:43.2801301Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:43.2801800Z U c10::cuda::current_device() 2025-05-07T20:03:43.2802347Z U c10::cuda::device_count() 2025-05-07T20:03:43.2802715Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:43.2803117Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:43.2803535Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:43.2803932Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:43.2804367Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:43.2804753Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:43.2805544Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:43.2806476Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:43.2807372Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.2808361Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:43.2809452Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.2810290Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:43.2810660Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:43.2811161Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:43.2811607Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:43.2812045Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:43.2812417Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:43.2812827Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:43.2813203Z U c10::throwNullDataPtrError() 2025-05-07T20:03:43.2813566Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:43.2813925Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:43.2814665Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:43.2815111Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:43.2815473Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:43.2815884Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.2816267Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:43.2816678Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:43.2817228Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:43.2817628Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:43.2818009Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:43.2818360Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.2818746Z U cudaFuncGetAttributes@libcudart.so.11.0 2025-05-07T20:03:43.2819111Z U cudaGetDevice@libcudart.so.11.0 2025-05-07T20:03:43.2819511Z U cudaGetDeviceCount@libcudart.so.11.0 2025-05-07T20:03:43.2819891Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:43.2820254Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:43.2820635Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:43.2820993Z U cudaMemsetAsync@libcudart.so.11.0 2025-05-07T20:03:43.2821540Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.11.0 2025-05-07T20:03:43.2822099Z U cudaPeekAtLastError@libcudart.so.11.0 2025-05-07T20:03:43.2822485Z U cudaSetDevice@libcudart.so.11.0 2025-05-07T20:03:43.2822831Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:43.2823198Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.2823580Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:43.2823987Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.2824428Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.2824820Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.2825203Z U log2f@GLIBC_2.2.5 2025-05-07T20:03:43.2825585Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:43.2826052Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.2826481Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.2826856Z U memcpy@GLIBC_2.14 2025-05-07T20:03:43.2827203Z U memset@GLIBC_2.2.5 2025-05-07T20:03:43.2827548Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:43.2827904Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:43.2828303Z U printf@GLIBC_2.2.5 2025-05-07T20:03:43.2828585Z U puts@GLIBC_2.2.5 2025-05-07T20:03:43.2829156Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:43.2830053Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:43.2831128Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.2832188Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.2833495Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:43.2834422Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.2835425Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:43.2836531Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.2837305Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.2837756Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.2838180Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:43.2838630Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:43.2839159Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:43.2840139Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.2840978Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:43.2841368Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:43.2841723Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:43.2842067Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:43.2842476Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.2843043Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.2843553Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:43.2843883Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:43.2844227Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:43.2845075Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:43.2846338Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:43.2847154Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:43.2847853Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:43.2848527Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.2848999Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:43.2849411Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:43.2850016Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:43.2850623Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.2851538Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:43.2852017Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:43.2852341Z w _ITM_registerTMCloneTable 2025-05-07T20:03:43.2852666Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:43.2852967Z w __gmon_start__ 2025-05-07T20:03:43.2853258Z w __pthread_key_create 2025-05-07T20:03:43.2853608Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:43.2853949Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:43.2854335Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:43.2854796Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:43.2855109Z 2025-05-07T20:03:43.2855233Z linux-vdso.so.1 (0x00007ffc3c5cd000) 2025-05-07T20:03:43.2855527Z libtorch.so => not found 2025-05-07T20:03:43.2855795Z libc10.so => not found 2025-05-07T20:03:43.2856036Z libc10_cuda.so => not found 2025-05-07T20:03:43.2856312Z libtorch_cpu.so => not found 2025-05-07T20:03:43.2856581Z libtorch_cuda.so => not found 2025-05-07T20:03:43.2856863Z libcudart.so.11.0 => not found 2025-05-07T20:03:43.2857208Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fc15819c000) 2025-05-07T20:03:43.2857624Z libm.so.6 => /lib64/libm.so.6 (0x00007fc15914c000) 2025-05-07T20:03:43.2858021Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fc1590f6000) 2025-05-07T20:03:43.2858418Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fc1590c8000) 2025-05-07T20:03:43.2858810Z libc.so.6 => /lib64/libc.so.6 (0x00007fc157f94000) 2025-05-07T20:03:43.2859188Z /lib64/ld-linux-x86-64.so.2 (0x00007fc15922d000) 2025-05-07T20:03:43.2859436Z 2025-05-07T20:03:43.2859542Z [CHECK] Displaying ELF information: 2025-05-07T20:03:43.2859979Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:43.2860326Z 2025-05-07T20:03:43.2860358Z 2025-05-07T20:03:43.2860513Z Dynamic section at offset 0xa76868 contains 37 entries: 2025-05-07T20:03:43.2860896Z Tag Type Name/Value 2025-05-07T20:03:43.2861312Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:43.2861827Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:43.2862342Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:43.2862855Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:43.2863387Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:43.2863915Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:43.2864456Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:43.2865084Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:43.2865680Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:43.2866344Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:43.2866832Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:43.2867354Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:03:43.2867959Z 0x000000000000000c (INIT) 0x2e000 2025-05-07T20:03:43.2868297Z 0x000000000000000d (FINI) 0xc47fc 2025-05-07T20:03:43.2868625Z 0x0000000000000019 (INIT_ARRAY) 0xa75ea0 2025-05-07T20:03:43.2868990Z 0x000000000000001b (INIT_ARRAYSZ) 208 (bytes) 2025-05-07T20:03:43.2869347Z 0x000000000000001a (FINI_ARRAY) 0xa75f70 2025-05-07T20:03:43.2869686Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:43.2870037Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:43.2870360Z 0x0000000000000005 (STRTAB) 0x6b50 2025-05-07T20:03:43.2870741Z 0x0000000000000006 (SYMTAB) 0x1e70 2025-05-07T20:03:43.2871089Z 0x000000000000000a (STRSZ) 120164 (bytes) 2025-05-07T20:03:43.2871465Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:43.2871817Z 0x0000000000000003 (PLTGOT) 0xa77b08 2025-05-07T20:03:43.2872172Z 0x0000000000000002 (PLTRELSZ) 10416 (bytes) 2025-05-07T20:03:43.2872529Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:43.2872928Z 0x0000000000000017 (JMPREL) 0x2aa30 2025-05-07T20:03:43.2873273Z 0x0000000000000007 (RELA) 0x24820 2025-05-07T20:03:43.2873619Z 0x0000000000000008 (RELASZ) 25104 (bytes) 2025-05-07T20:03:43.2874032Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:43.2874356Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:43.2874696Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:43.2875058Z 0x000000006ffffffe (VERNEED) 0x24720 2025-05-07T20:03:43.2875391Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:43.2875722Z 0x000000006ffffff0 (VERSYM) 0x240b4 2025-05-07T20:03:43.2876049Z 0x000000006ffffff9 (RELACOUNT) 176 2025-05-07T20:03:43.2876364Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:43.2876568Z 2025-05-07T20:03:43.2876707Z ################################################################################ 2025-05-07T20:03:43.2876947Z 2025-05-07T20:03:43.2876951Z 2025-05-07T20:03:43.2877062Z ################################################################################ 2025-05-07T20:03:43.2877577Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:43.2878100Z [CHECK] Listing out library size: 2025-05-07T20:03:43.2878574Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:43.2878958Z 2025-05-07T20:03:43.2879179Z 5 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:43.2879509Z 2025-05-07T20:03:43.2879909Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:43.2880945Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.2881547Z 2025-05-07T20:03:43.2923973Z GLIBC_2.2.5 2025-05-07T20:03:43.2924226Z GLIBC_2.3 2025-05-07T20:03:43.2924420Z GLIBC_2.14 2025-05-07T20:03:43.2925567Z 2025-05-07T20:03:43.2925712Z 2025-05-07T20:03:43.2926295Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:43.2927406Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.2928045Z 2025-05-07T20:03:43.2989894Z GLIBCXX_3.4 2025-05-07T20:03:43.2990539Z GLIBCXX_3.4.9 2025-05-07T20:03:43.2991117Z GLIBCXX_3.4.11 2025-05-07T20:03:43.2991702Z GLIBCXX_3.4.18 2025-05-07T20:03:43.2992275Z GLIBCXX_3.4.21 2025-05-07T20:03:43.2992642Z 2025-05-07T20:03:43.2992655Z 2025-05-07T20:03:43.3007496Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.pgdSAORtu8.symbols.txt 2025-05-07T20:03:43.3008006Z 2025-05-07T20:03:43.3038474Z 2025-05-07T20:03:43.3064274Z [CHECK] Total Number of symbols: 338 2025-05-07T20:03:43.3075442Z [CHECK] Number of fbgemm symbols: 16 2025-05-07T20:03:43.3095895Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.vmpSk5SXtp.usymbols.txt 2025-05-07T20:03:43.3096588Z 2025-05-07T20:03:43.3111977Z 2025-05-07T20:03:43.3137357Z [CHECK] Listing out undefined symbols (128 total): 2025-05-07T20:03:43.3155016Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.3156101Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.3156666Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:43.3157040Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:43.3157449Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:43.3157856Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:43.3158240Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:43.3158637Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:43.3158994Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:43.3159378Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:43.3159740Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:43.3160050Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:43.3160385Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:43.3160689Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:43.3161014Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:43.3161343Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:43.3161674Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:43.3162159Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:43.3162565Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:43.3163034Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:43.3163488Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:43.3163899Z U c10::BoolType::get() 2025-05-07T20:03:43.3164245Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:43.3164614Z U c10::FloatType::get() 2025-05-07T20:03:43.3164934Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:43.3165318Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.3165744Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:43.3166080Z U c10::IntType::get() 2025-05-07T20:03:43.3166446Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:43.3166833Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:43.3167205Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:43.3167610Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:43.3168266Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:43.3168923Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:43.3169278Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:43.3169600Z U c10::TensorType::get() 2025-05-07T20:03:43.3169929Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:43.3170868Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:43.3171896Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:43.3172265Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:43.3172599Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:43.3172952Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:43.3173286Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:43.3173637Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:43.3174185Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:43.3174650Z U c10::cuda::device_count() 2025-05-07T20:03:43.3175002Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:43.3175375Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:43.3175772Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:43.3176167Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:43.3176563Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:43.3176951Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:43.3177681Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:43.3178562Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:43.3179428Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.3180384Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:43.3181421Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.3182229Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:43.3182577Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:43.3182922Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:43.3183278Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:43.3183674Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:43.3184005Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:43.3184422Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:43.3184864Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.3185232Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:43.3185604Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:43.3185947Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:43.3186298Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:43.3186629Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:43.3186984Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.3187355Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:43.3187713Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:43.3188068Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:43.3188398Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:43.3188738Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:43.3189079Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.3189447Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:43.3189799Z U float at::Tensor::item() const 2025-05-07T20:03:43.3190172Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.3190592Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.3190987Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.3191350Z U memcpy@GLIBC_2.14 2025-05-07T20:03:43.3191625Z U memset@GLIBC_2.2.5 2025-05-07T20:03:43.3191930Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:43.3192327Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:43.3192981Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:43.3194014Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:43.3194975Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.3196056Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.3197123Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:43.3198055Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.3199092Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.3200104Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:43.3200949Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:43.3201550Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:43.3201926Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:43.3202467Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.3202883Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.3203269Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:43.3203767Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:43.3204727Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.3205541Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:43.3205914Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:43.3206274Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:43.3206614Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:43.3207034Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.3207578Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.3208086Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:43.3208442Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:43.3208752Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:43.3209078Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:43.3209915Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:43.3211114Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:43.3211975Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:43.3212723Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:43.3213483Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:43.3213904Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:43.3214348Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:43.3215060Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.3215683Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:43.3216111Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:43.3216411Z w _ITM_registerTMCloneTable 2025-05-07T20:03:43.3216713Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:43.3216998Z w __gmon_start__ 2025-05-07T20:03:43.3217248Z w __pthread_key_create 2025-05-07T20:03:43.3217544Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:43.3217848Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:43.3218217Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:43.3218653Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:43.3218973Z 2025-05-07T20:03:43.3219107Z linux-vdso.so.1 (0x00007ffc58d6b000) 2025-05-07T20:03:43.3219394Z libtorch.so => not found 2025-05-07T20:03:43.3219619Z libc10.so => not found 2025-05-07T20:03:43.3219850Z libc10_cuda.so => not found 2025-05-07T20:03:43.3220091Z libtorch_cpu.so => not found 2025-05-07T20:03:43.3220348Z libtorch_cuda.so => not found 2025-05-07T20:03:43.3220628Z libcudart.so.11.0 => not found 2025-05-07T20:03:43.3220957Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fbbbad9c000) 2025-05-07T20:03:43.3221346Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fbbbb61e000) 2025-05-07T20:03:43.3221744Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fbbbb5f0000) 2025-05-07T20:03:43.3222114Z libc.so.6 => /lib64/libc.so.6 (0x00007fbbbab94000) 2025-05-07T20:03:43.3222453Z /lib64/ld-linux-x86-64.so.2 (0x00007fbbbb67a000) 2025-05-07T20:03:43.3222803Z libm.so.6 => /lib64/libm.so.6 (0x00007fbbbb515000) 2025-05-07T20:03:43.3223020Z 2025-05-07T20:03:43.3223119Z [CHECK] Displaying ELF information: 2025-05-07T20:03:43.3223547Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:43.3223880Z 2025-05-07T20:03:43.3231826Z 2025-05-07T20:03:43.3232359Z Dynamic section at offset 0x467450 contains 37 entries: 2025-05-07T20:03:43.3233632Z Tag Type Name/Value 2025-05-07T20:03:43.3234491Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:43.3234995Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:43.3235510Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:43.3236039Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:43.3236558Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:43.3237099Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:43.3237620Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:43.3238145Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:43.3238654Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:43.3239165Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:43.3239694Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:43.3240258Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:03:43.3240732Z 0x000000000000000c (INIT) 0xf000 2025-05-07T20:03:43.3241131Z 0x000000000000000d (FINI) 0x31c4c 2025-05-07T20:03:43.3241472Z 0x0000000000000019 (INIT_ARRAY) 0x467fe0 2025-05-07T20:03:43.3241813Z 0x000000000000001b (INIT_ARRAYSZ) 48 (bytes) 2025-05-07T20:03:43.3242167Z 0x000000000000001a (FINI_ARRAY) 0x468010 2025-05-07T20:03:43.3242519Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:43.3242855Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:43.3243188Z 0x0000000000000005 (STRTAB) 0x2cc8 2025-05-07T20:03:43.3243503Z 0x0000000000000006 (SYMTAB) 0xd00 2025-05-07T20:03:43.3243851Z 0x000000000000000a (STRSZ) 38026 (bytes) 2025-05-07T20:03:43.3244203Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:43.3244554Z 0x0000000000000003 (PLTGOT) 0x4686f0 2025-05-07T20:03:43.3244901Z 0x0000000000000002 (PLTRELSZ) 4752 (bytes) 2025-05-07T20:03:43.3245252Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:43.3245583Z 0x0000000000000017 (JMPREL) 0xdab0 2025-05-07T20:03:43.3245901Z 0x0000000000000007 (RELA) 0xc508 2025-05-07T20:03:43.3246250Z 0x0000000000000008 (RELASZ) 5544 (bytes) 2025-05-07T20:03:43.3246596Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:43.3246923Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:43.3247268Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:43.3247618Z 0x000000006ffffffe (VERNEED) 0xc3f8 2025-05-07T20:03:43.3247950Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:43.3248264Z 0x000000006ffffff0 (VERSYM) 0xc152 2025-05-07T20:03:43.3248586Z 0x000000006ffffff9 (RELACOUNT) 58 2025-05-07T20:03:43.3248908Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:43.3249109Z 2025-05-07T20:03:43.3249231Z ################################################################################ 2025-05-07T20:03:43.3249459Z 2025-05-07T20:03:43.3249463Z 2025-05-07T20:03:43.3249570Z ################################################################################ 2025-05-07T20:03:43.3250012Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:03:43.3250440Z [CHECK] Listing out library size: 2025-05-07T20:03:43.3250825Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:03:43.3251148Z 2025-05-07T20:03:43.3251310Z 6 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:03:43.3251551Z 2025-05-07T20:03:43.3251879Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:03:43.3252767Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.3253302Z 2025-05-07T20:03:43.3515044Z GLIBC_2.2.5 2025-05-07T20:03:43.3515280Z GLIBC_2.3 2025-05-07T20:03:43.3515488Z GLIBC_2.14 2025-05-07T20:03:43.3516135Z 2025-05-07T20:03:43.3516286Z 2025-05-07T20:03:43.3516703Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:03:43.3517635Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.3518209Z 2025-05-07T20:03:43.3785089Z GLIBCXX_3.4 2025-05-07T20:03:43.3785514Z GLIBCXX_3.4.9 2025-05-07T20:03:43.3785761Z GLIBCXX_3.4.11 2025-05-07T20:03:43.3786154Z GLIBCXX_3.4.14 2025-05-07T20:03:43.3786414Z GLIBCXX_3.4.15 2025-05-07T20:03:43.3786638Z GLIBCXX_3.4.18 2025-05-07T20:03:43.3786844Z GLIBCXX_3.4.21 2025-05-07T20:03:43.3786973Z 2025-05-07T20:03:43.3786994Z 2025-05-07T20:03:43.3803888Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so > /tmp/tmp.2DgBVoGS1t.symbols.txt 2025-05-07T20:03:43.3804321Z 2025-05-07T20:03:43.4031321Z 2025-05-07T20:03:43.4056522Z [CHECK] Total Number of symbols: 4957 2025-05-07T20:03:43.4074692Z [CHECK] Number of fbgemm symbols: 3554 2025-05-07T20:03:43.4093270Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so > /tmp/tmp.Ga4UcPnf8z.usymbols.txt 2025-05-07T20:03:43.4094561Z 2025-05-07T20:03:43.4118479Z 2025-05-07T20:03:43.4143903Z [CHECK] Listing out undefined symbols (135 total): 2025-05-07T20:03:43.4166947Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:43.4167960Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:43.4168460Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:43.4168935Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:43.4169348Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:43.4169684Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:43.4170013Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:43.4170350Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:43.4170671Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:43.4171048Z U __cxa_init_primary_exception@CXXABI_1.3.11 2025-05-07T20:03:43.4171398Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:43.4171723Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:43.4172039Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:43.4172357Z U __extendhfsf2@GCC_12.0.0 2025-05-07T20:03:43.4174585Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:43.4175069Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:03:43.4175380Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:43.4175667Z U __truncsfhf2@GCC_12.0.0 2025-05-07T20:03:43.4175962Z U abort@GLIBC_2.2.5 2025-05-07T20:03:43.4176488Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:03:43.4177222Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:03:43.4178156Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:03:43.4179269Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:03:43.4180359Z U asmjit::_abi_1_13::BaseEmitter::emitArgsAssignment(asmjit::_abi_1_13::FuncFrame const&, asmjit::_abi_1_13::FuncArgsAssignment const&) 2025-05-07T20:03:43.4181112Z U asmjit::_abi_1_13::BaseEmitter::emitEpilog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:03:43.4181667Z U asmjit::_abi_1_13::BaseEmitter::emitProlog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:03:43.4182265Z U asmjit::_abi_1_13::CodeHolder::CodeHolder(asmjit::_abi_1_13::Support::Temporary const*) 2025-05-07T20:03:43.4182898Z U asmjit::_abi_1_13::CodeHolder::init(asmjit::_abi_1_13::Environment const&, unsigned long) 2025-05-07T20:03:43.4183387Z U asmjit::_abi_1_13::CodeHolder::~CodeHolder() 2025-05-07T20:03:43.4183911Z U asmjit::_abi_1_13::FuncArgsAssignment::updateFuncFrame(asmjit::_abi_1_13::FuncFrame&) const 2025-05-07T20:03:43.4184643Z U asmjit::_abi_1_13::FuncDetail::init(asmjit::_abi_1_13::FuncSignature const&, asmjit::_abi_1_13::Environment const&) 2025-05-07T20:03:43.4185194Z U asmjit::_abi_1_13::FuncFrame::finalize() 2025-05-07T20:03:43.4185623Z U asmjit::_abi_1_13::FuncFrame::init(asmjit::_abi_1_13::FuncDetail const&) 2025-05-07T20:03:43.4186211Z U asmjit::_abi_1_13::JitRuntime::JitRuntime(asmjit::_abi_1_13::JitAllocator::CreateParams const*) 2025-05-07T20:03:43.4186741Z U asmjit::_abi_1_13::JitRuntime::~JitRuntime() 2025-05-07T20:03:43.4187289Z U asmjit::_abi_1_13::x86::Assembler::Assembler(asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:03:43.4187737Z U asmjit::_abi_1_13::x86::Assembler::~Assembler() 2025-05-07T20:03:43.4188082Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:43.4188350Z U ceilf@GLIBC_2.2.5 2025-05-07T20:03:43.4188642Z U cpuinfo_get_packages 2025-05-07T20:03:43.4188935Z U cpuinfo_get_packages_count 2025-05-07T20:03:43.4189245Z U cpuinfo_initialize 2025-05-07T20:03:43.4189515Z U cpuinfo_isa 2025-05-07T20:03:43.4189780Z U floor@GLIBC_2.2.5 2025-05-07T20:03:43.4190058Z U fma@GLIBC_2.2.5 2025-05-07T20:03:43.4190315Z U fmaf@GLIBC_2.2.5 2025-05-07T20:03:43.4190599Z U free@GLIBC_2.2.5 2025-05-07T20:03:43.4190861Z U fwrite@GLIBC_2.2.5 2025-05-07T20:03:43.4191145Z U getenv@GLIBC_2.2.5 2025-05-07T20:03:43.4191415Z U ldexp@GLIBC_2.2.5 2025-05-07T20:03:43.4191691Z U log2@GLIBC_2.2.5 2025-05-07T20:03:43.4191949Z U log2f@GLIBC_2.2.5 2025-05-07T20:03:43.4192225Z U lrintf@GLIBC_2.2.5 2025-05-07T20:03:43.4192503Z U memcpy@GLIBC_2.14 2025-05-07T20:03:43.4192887Z U memset@GLIBC_2.2.5 2025-05-07T20:03:43.4193394Z U nearbyint@GLIBC_2.2.5 2025-05-07T20:03:43.4193698Z U nearbyintf@GLIBC_2.2.5 2025-05-07T20:03:43.4194035Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:43.4194379Z U operator delete[](void*)@GLIBCXX_3.4 2025-05-07T20:03:43.4194750Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:43.4195141Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:43.4195508Z U posix_memalign@GLIBC_2.2.5 2025-05-07T20:03:43.4195833Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:03:43.4196244Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:43.4196770Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:43.4197233Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:43.4197937Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:03:43.4198722Z U std::__atomic_futex_unsigned_base::_M_futex_notify_all(unsigned int*)@GLIBCXX_3.4.21 2025-05-07T20:03:43.4199866Z U std::__atomic_futex_unsigned_base::_M_futex_wait_until(unsigned int*, unsigned int, bool, std::chrono::duration >, std::chrono::duration >)@GLIBCXX_3.4.21 2025-05-07T20:03:43.4201062Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.4202441Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.4203482Z U std::__cxx11::basic_string, std::allocator >::compare(char const*) const@GLIBCXX_3.4.21 2025-05-07T20:03:43.4204347Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:43.4205113Z U std::__detail::_Prime_rehash_policy::_M_next_bkt(unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:43.4205632Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:03:43.4206061Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:03:43.4206557Z U std::__exception_ptr::exception_ptr::exception_ptr(void*)@CXXABI_1.3.11 2025-05-07T20:03:43.4207976Z U std::__future_base::_Result_base::_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:03:43.4208479Z U std::__future_base::_Result_base::~_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:03:43.4208895Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:03:43.4209259Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:03:43.4209625Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:43.4209971Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:43.4210321Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:03:43.4210685Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:43.4211096Z U std::__throw_future_error(int)@GLIBCXX_3.4.14 2025-05-07T20:03:43.4211492Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.4211903Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:43.4212299Z U std::bad_alloc::~bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:43.4213137Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.4213999Z U std::cerr@GLIBCXX_3.4 2025-05-07T20:03:43.4214310Z U std::cout@GLIBCXX_3.4 2025-05-07T20:03:43.4214691Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:03:43.4215222Z U std::future_category()@GLIBCXX_3.4.15 2025-05-07T20:03:43.4215752Z U std::future_error::~future_error()@GLIBCXX_3.4.14 2025-05-07T20:03:43.4216187Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:43.4216542Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:43.4217202Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:43.4218133Z U std::logic_error::logic_error(std::logic_error const&)@GLIBCXX_3.4.21 2025-05-07T20:03:43.4218732Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:03:43.4219260Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.4219804Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.4220292Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:03:43.4220655Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:43.4221012Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:03:43.4221479Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:03:43.4222008Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:43.4222466Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:43.4222841Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:43.4223152Z U stderr@GLIBC_2.2.5 2025-05-07T20:03:43.4223456Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:43.4223746Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:43.4224047Z U strstr@GLIBC_2.2.5 2025-05-07T20:03:43.4224334Z U tolower@GLIBC_2.2.5 2025-05-07T20:03:43.4224641Z U toupper@GLIBC_2.2.5 2025-05-07T20:03:43.4225033Z U typeinfo for std::__future_base::_Result_base@GLIBCXX_3.4.15 2025-05-07T20:03:43.4225477Z U typeinfo for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:03:43.4225876Z U typeinfo for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:03:43.4226260Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:43.4226788Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:43.4227215Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:43.4227628Z U vtable for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:03:43.4228007Z U vtable for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:03:43.4228365Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:43.4228715Z w _ITM_registerTMCloneTable 2025-05-07T20:03:43.4229032Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:43.4229352Z w __gmon_start__ 2025-05-07T20:03:43.4229630Z w __pthread_key_create 2025-05-07T20:03:43.4229957Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:43.4230292Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:43.4230626Z w pthread_once 2025-05-07T20:03:43.4230914Z w pthread_rwlock_rdlock 2025-05-07T20:03:43.4231224Z w pthread_rwlock_unlock 2025-05-07T20:03:43.4231538Z w pthread_rwlock_wrlock 2025-05-07T20:03:43.4231838Z w pthread_self@GLIBC_2.2.5 2025-05-07T20:03:43.4232209Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:43.4232616Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:03:43.4232998Z 2025-05-07T20:03:43.4233179Z linux-vdso.so.1 (0x00007ffd4e9dd000) 2025-05-07T20:03:43.4233493Z libc10.so => not found 2025-05-07T20:03:43.4234018Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f63d24c7000) 2025-05-07T20:03:43.4234609Z libtorch.so => not found 2025-05-07T20:03:43.4234868Z libtorch_cpu.so => not found 2025-05-07T20:03:43.4235190Z libtorch_cuda.so => not found 2025-05-07T20:03:43.4235528Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f63d1b9c000) 2025-05-07T20:03:43.4235944Z libm.so.6 => /lib64/libm.so.6 (0x00007f63d23ea000) 2025-05-07T20:03:43.4236348Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f63d23bc000) 2025-05-07T20:03:43.4236731Z libc.so.6 => /lib64/libc.so.6 (0x00007f63d1994000) 2025-05-07T20:03:43.4237109Z /lib64/ld-linux-x86-64.so.2 (0x00007f63d2543000) 2025-05-07T20:03:43.4237443Z libtorch_cpu.so => not found 2025-05-07T20:03:43.4237734Z libtorch_cuda.so => not found 2025-05-07T20:03:43.4238005Z libtorch.so => not found 2025-05-07T20:03:43.4238329Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f63d2364000) 2025-05-07T20:03:43.4238719Z librt.so.1 => /lib64/librt.so.1 (0x00007f63d198f000) 2025-05-07T20:03:43.4239149Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f63d198a000) 2025-05-07T20:03:43.4239436Z 2025-05-07T20:03:43.4239561Z [CHECK] Displaying ELF information: 2025-05-07T20:03:43.4239938Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:03:43.4240223Z 2025-05-07T20:03:43.4257267Z 2025-05-07T20:03:43.4258014Z Dynamic section at offset 0x54e508 contains 37 entries: 2025-05-07T20:03:43.4259203Z Tag Type Name/Value 2025-05-07T20:03:43.4260486Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:43.4261955Z 0x0000000000000001 (NEEDED) Shared library: [asmjit.so] 2025-05-07T20:03:43.4263416Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:43.4264954Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:43.4266487Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:43.4267803Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:43.4268336Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:43.4268848Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:43.4269367Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:43.4269886Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:43.4270595Z 0x000000000000000e (SONAME) Library soname: [fbgemm.so] 2025-05-07T20:03:43.4271192Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:43.4271607Z 0x000000000000000c (INIT) 0xfd000 2025-05-07T20:03:43.4272131Z 0x000000000000000d (FINI) 0x4c1d18 2025-05-07T20:03:43.4272474Z 0x0000000000000019 (INIT_ARRAY) 0x54b000 2025-05-07T20:03:43.4273013Z 0x000000000000001b (INIT_ARRAYSZ) 1224 (bytes) 2025-05-07T20:03:43.4273385Z 0x000000000000001a (FINI_ARRAY) 0x54b4c8 2025-05-07T20:03:43.4273748Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:43.4274130Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:43.4274484Z 0x0000000000000005 (STRTAB) 0x24e38 2025-05-07T20:03:43.4274820Z 0x0000000000000006 (SYMTAB) 0x7d68 2025-05-07T20:03:43.4275179Z 0x000000000000000a (STRSZ) 754916 (bytes) 2025-05-07T20:03:43.4275564Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:43.4275912Z 0x0000000000000003 (PLTGOT) 0x54e798 2025-05-07T20:03:43.4276295Z 0x0000000000000002 (PLTRELSZ) 26136 (bytes) 2025-05-07T20:03:43.4276641Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:43.4277039Z 0x0000000000000017 (JMPREL) 0xf6768 2025-05-07T20:03:43.4277374Z 0x0000000000000007 (RELA) 0xdfb48 2025-05-07T20:03:43.4277736Z 0x0000000000000008 (RELASZ) 93216 (bytes) 2025-05-07T20:03:43.4278108Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:43.4278434Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:43.4278811Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:43.4279165Z 0x000000006ffffffe (VERNEED) 0xdf9d8 2025-05-07T20:03:43.4279511Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:43.4279836Z 0x000000006ffffff0 (VERSYM) 0xdd31c 2025-05-07T20:03:43.4280184Z 0x000000006ffffff9 (RELACOUNT) 155 2025-05-07T20:03:43.4280497Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:43.4280717Z 2025-05-07T20:03:43.4280834Z ################################################################################ 2025-05-07T20:03:43.4281062Z 2025-05-07T20:03:43.4281066Z 2025-05-07T20:03:43.4281203Z ################################################################################ 2025-05-07T20:03:43.4281698Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:43.4282209Z [CHECK] Listing out library size: 2025-05-07T20:03:43.4282668Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:43.4283059Z 2025-05-07T20:03:43.4283273Z 2 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:43.4283575Z 2025-05-07T20:03:43.4283974Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:43.4285117Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.4285824Z 2025-05-07T20:03:43.4336314Z GLIBC_2.2.5 2025-05-07T20:03:43.4336948Z GLIBC_2.14 2025-05-07T20:03:43.4337304Z 2025-05-07T20:03:43.4337341Z 2025-05-07T20:03:43.4338560Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:43.4341599Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.4343422Z 2025-05-07T20:03:43.4399314Z GLIBCXX_3.4 2025-05-07T20:03:43.4399948Z GLIBCXX_3.4.9 2025-05-07T20:03:43.4400550Z GLIBCXX_3.4.14 2025-05-07T20:03:43.4401117Z GLIBCXX_3.4.20 2025-05-07T20:03:43.4401542Z GLIBCXX_3.4.21 2025-05-07T20:03:43.4401885Z 2025-05-07T20:03:43.4401890Z 2025-05-07T20:03:43.4421095Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.6WlOsCdv7u.symbols.txt 2025-05-07T20:03:43.4421920Z 2025-05-07T20:03:43.4452455Z 2025-05-07T20:03:43.4476879Z [CHECK] Total Number of symbols: 540 2025-05-07T20:03:43.4493699Z [CHECK] Number of fbgemm symbols: 48 2025-05-07T20:03:43.4508266Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.uWSq1rUDGg.usymbols.txt 2025-05-07T20:03:43.4509737Z 2025-05-07T20:03:43.4527085Z 2025-05-07T20:03:43.4564580Z [CHECK] Listing out undefined symbols (183 total): 2025-05-07T20:03:43.4585094Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.4585784Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:43.4586163Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:43.4586599Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:43.4587001Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:43.4587410Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:43.4587796Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:43.4588171Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:43.4588716Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:43.4589099Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:43.4589446Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:43.4589760Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:43.4590094Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:43.4590476Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:43.4590822Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:43.4591140Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:43.4591485Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:43.4591816Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:43.4592120Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:43.4592444Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:43.4593080Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:03:43.4593689Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:43.4594172Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:43.4595111Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.4596050Z U at::_ops::is_nonzero::call(at::Tensor const&) 2025-05-07T20:03:43.4596496Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:43.4597000Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:43.4597674Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:03:43.4598915Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:03:43.4599735Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:03:43.4600504Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.4601271Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:43.4601705Z U at::get_num_threads() 2025-05-07T20:03:43.4601979Z U at::get_thread_num() 2025-05-07T20:03:43.4602671Z U at::internal::set_thread_num(int) 2025-05-07T20:03:43.4603048Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:03:43.4603393Z U c10::BoolType::get() 2025-05-07T20:03:43.4603755Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:43.4604409Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:43.4605023Z U c10::Error::what() const 2025-05-07T20:03:43.4605395Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.4605841Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.4606284Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:43.4606636Z U c10::IntType::get() 2025-05-07T20:03:43.4607017Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:43.4607420Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:43.4607925Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:43.4608432Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:43.4608905Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:43.4609287Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:43.4609669Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:43.4610387Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:43.4611040Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:43.4611423Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:43.4611807Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:43.4612219Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:03:43.4612587Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:43.4612910Z U c10::SymIntType::get() 2025-05-07T20:03:43.4613291Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:43.4613654Z U c10::TensorType::get() 2025-05-07T20:03:43.4614003Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:43.4614920Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:43.4615883Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:43.4616328Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:03:43.4616858Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:03:43.4617557Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:03:43.4618119Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:43.4618472Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:43.4618834Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:43.4619178Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:43.4619549Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:43.4620007Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:43.4620555Z U c10::cuda::device_count() 2025-05-07T20:03:43.4635376Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:43.4635909Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:43.4636336Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:43.4636777Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:43.4637201Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:43.4637599Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:43.4638353Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:43.4639274Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:43.4640171Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.4641149Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:43.4642328Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.4643175Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:43.4643512Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:43.4643865Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:43.4644245Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:43.4644602Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:03:43.4644995Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:43.4645406Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:43.4645825Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:43.4645958Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:03:43.4646088Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:03:43.4646214Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:43.4646415Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:43.4646546Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.4646696Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:43.4646818Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:43.4646943Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:43.4647062Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:43.4647195Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:43.4647328Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.4647452Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:43.4647576Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:43.4647696Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:43.4647817Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:43.4647965Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.4648098Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:43.4649115Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.4649867Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.4650658Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.4651406Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.4652194Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.4652977Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.4653813Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.4654609Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.4655447Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.4656253Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.4657057Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.4657868Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.4658071Z U fbgemm::fbgemmAlignedAlloc(unsigned long, unsigned long, bool) 2025-05-07T20:03:43.4658184Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:03:43.4658434Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:03:43.4658588Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.4658744Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.4658865Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.4659013Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.4659182Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:43.4659310Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.4659452Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.4659560Z U memcpy@GLIBC_2.14 2025-05-07T20:03:43.4659652Z U memset@GLIBC_2.2.5 2025-05-07T20:03:43.4659755Z U omp_get_max_threads@OMP_1.0 2025-05-07T20:03:43.4659927Z U omp_get_thread_num@OMP_1.0 2025-05-07T20:03:43.4660036Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:43.4660154Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:43.4660497Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:43.4660868Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:43.4661258Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.4661805Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.4662178Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:43.4662586Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.4663045Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:43.4663544Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.4663735Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:43.4663878Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.4664017Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.4664209Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:43.4664446Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:43.4664997Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.4665137Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:43.4665254Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:43.4665383Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:43.4665527Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:43.4665636Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:43.4665817Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.4666074Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.4666200Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:43.4666492Z U std::pair fbgemm::radix_sort_parallel(int*, int*, int*, int*, long, long, bool) 2025-05-07T20:03:43.4667002Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:03:43.4667477Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:03:43.4667584Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:43.4667877Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:43.4668064Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:43.4668670Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:43.4669170Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:43.4669616Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:43.4670010Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:43.4670118Z U typeinfo for c10::Error 2025-05-07T20:03:43.4670256Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:43.4670471Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.4670638Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:43.4670809Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:43.4670979Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:43.4671121Z U vtable for c10::Error 2025-05-07T20:03:43.4671491Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.4671733Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:43.4671868Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:43.4672008Z w _ITM_registerTMCloneTable 2025-05-07T20:03:43.4672113Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:43.4672216Z w __gmon_start__ 2025-05-07T20:03:43.4672315Z w __pthread_key_create 2025-05-07T20:03:43.4672473Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:43.4672810Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:43.4672820Z 2025-05-07T20:03:43.4672995Z linux-vdso.so.1 (0x00007fff379b7000) 2025-05-07T20:03:43.4673091Z libc10.so => not found 2025-05-07T20:03:43.4673207Z libc10_cuda.so => not found 2025-05-07T20:03:43.4673589Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f2c89400000) 2025-05-07T20:03:43.4674047Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f2c89aa8000) 2025-05-07T20:03:43.4674159Z libtorch.so => not found 2025-05-07T20:03:43.4674261Z libtorch_cpu.so => not found 2025-05-07T20:03:43.4674365Z libtorch_cuda.so => not found 2025-05-07T20:03:43.4674496Z libcudart.so.11.0 => not found 2025-05-07T20:03:43.4674672Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f2c8919c000) 2025-05-07T20:03:43.4674824Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f2c89a50000) 2025-05-07T20:03:43.4674974Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f2c89a22000) 2025-05-07T20:03:43.4675127Z libc.so.6 => /lib64/libc.so.6 (0x00007f2c88f94000) 2025-05-07T20:03:43.4675223Z libc10.so => not found 2025-05-07T20:03:43.4675597Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f2c899a8000) 2025-05-07T20:03:43.4675720Z libtorch.so => not found 2025-05-07T20:03:43.4675821Z libtorch_cpu.so => not found 2025-05-07T20:03:43.4675924Z libtorch_cuda.so => not found 2025-05-07T20:03:43.4676052Z libm.so.6 => /lib64/libm.so.6 (0x00007f2c88eb9000) 2025-05-07T20:03:43.4676178Z /lib64/ld-linux-x86-64.so.2 (0x00007f2c89cad000) 2025-05-07T20:03:43.4676266Z libtorch.so => not found 2025-05-07T20:03:43.4676355Z libc10.so => not found 2025-05-07T20:03:43.4676454Z libtorch_cpu.so => not found 2025-05-07T20:03:43.4676545Z libtorch_cuda.so => not found 2025-05-07T20:03:43.4676807Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f2c8999f000) 2025-05-07T20:03:43.4676918Z libtorch_cpu.so => not found 2025-05-07T20:03:43.4677016Z libtorch_cuda.so => not found 2025-05-07T20:03:43.4677112Z libtorch.so => not found 2025-05-07T20:03:43.4677248Z librt.so.1 => /lib64/librt.so.1 (0x00007f2c8999a000) 2025-05-07T20:03:43.4677253Z 2025-05-07T20:03:43.4677367Z [CHECK] Displaying ELF information: 2025-05-07T20:03:43.4677610Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:43.4677615Z 2025-05-07T20:03:43.4677620Z 2025-05-07T20:03:43.4677787Z Dynamic section at offset 0x189ef8 contains 39 entries: 2025-05-07T20:03:43.4677926Z Tag Type Name/Value 2025-05-07T20:03:43.4678122Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:43.4678324Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:43.4678535Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:03:43.4678749Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:03:43.4678949Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:43.4679165Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:43.4679404Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:43.4679616Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:43.4679837Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:43.4680038Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:43.4680267Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:43.4680466Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:43.4680712Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:03:43.4680894Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:43.4681012Z 0x000000000000000c (INIT) 0x16000 2025-05-07T20:03:43.4681136Z 0x000000000000000d (FINI) 0x60bac 2025-05-07T20:03:43.4681252Z 0x0000000000000019 (INIT_ARRAY) 0x189258 2025-05-07T20:03:43.4681373Z 0x000000000000001b (INIT_ARRAYSZ) 72 (bytes) 2025-05-07T20:03:43.4681501Z 0x000000000000001a (FINI_ARRAY) 0x1892a0 2025-05-07T20:03:43.4681621Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:43.4681734Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:43.4681845Z 0x0000000000000005 (STRTAB) 0x4598 2025-05-07T20:03:43.4681969Z 0x0000000000000006 (SYMTAB) 0x12e0 2025-05-07T20:03:43.4682099Z 0x000000000000000a (STRSZ) 47880 (bytes) 2025-05-07T20:03:43.4682217Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:43.4682348Z 0x0000000000000003 (PLTGOT) 0x18a1a8 2025-05-07T20:03:43.4682481Z 0x0000000000000002 (PLTRELSZ) 9240 (bytes) 2025-05-07T20:03:43.4682585Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:43.4682708Z 0x0000000000000017 (JMPREL) 0x131f0 2025-05-07T20:03:43.4682819Z 0x0000000000000007 (RELA) 0x105e0 2025-05-07T20:03:43.4682950Z 0x0000000000000008 (RELASZ) 11280 (bytes) 2025-05-07T20:03:43.4683070Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:43.4683182Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:43.4683304Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:43.4683424Z 0x000000006ffffffe (VERNEED) 0x104e0 2025-05-07T20:03:43.4683544Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:43.4683658Z 0x000000006ffffff0 (VERSYM) 0x100a0 2025-05-07T20:03:43.4683760Z 0x000000006ffffff9 (RELACOUNT) 245 2025-05-07T20:03:43.4683931Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:43.4683960Z 2025-05-07T20:03:43.4684080Z ################################################################################ 2025-05-07T20:03:43.4684084Z 2025-05-07T20:03:43.4684088Z 2025-05-07T20:03:43.4684201Z ################################################################################ 2025-05-07T20:03:43.4684484Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:43.4684590Z [CHECK] Listing out library size: 2025-05-07T20:03:43.4684849Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:43.4684853Z 2025-05-07T20:03:43.4692451Z 8 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:43.4695867Z 2025-05-07T20:03:43.4696267Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:43.4696769Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.4696774Z 2025-05-07T20:03:43.4759515Z GLIBC_2.2.5 2025-05-07T20:03:43.4759605Z GLIBC_2.14 2025-05-07T20:03:43.4761523Z 2025-05-07T20:03:43.4761649Z 2025-05-07T20:03:43.4762621Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:43.4763176Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.4763183Z 2025-05-07T20:03:43.4831821Z GLIBCXX_3.4 2025-05-07T20:03:43.4831954Z GLIBCXX_3.4.9 2025-05-07T20:03:43.4832218Z GLIBCXX_3.4.20 2025-05-07T20:03:43.4832684Z GLIBCXX_3.4.21 2025-05-07T20:03:43.4832703Z 2025-05-07T20:03:43.4832709Z 2025-05-07T20:03:43.4858708Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.LqtTcUztSt.symbols.txt 2025-05-07T20:03:43.4858775Z 2025-05-07T20:03:43.4886069Z 2025-05-07T20:03:43.4920128Z [CHECK] Total Number of symbols: 501 2025-05-07T20:03:43.4939399Z [CHECK] Number of fbgemm symbols: 13 2025-05-07T20:03:43.4955797Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.JIezxtEUTr.usymbols.txt 2025-05-07T20:03:43.4955836Z 2025-05-07T20:03:43.4974995Z 2025-05-07T20:03:43.4998214Z [CHECK] Listing out undefined symbols (154 total): 2025-05-07T20:03:43.5015962Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.5016465Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:43.5016717Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:43.5016973Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:43.5017188Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:43.5017456Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:43.5017599Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:43.5017743Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:43.5017882Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:43.5018005Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:43.5018127Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:43.5018238Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:43.5018340Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:43.5018464Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:43.5018571Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:43.5018680Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:43.5018788Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:43.5018882Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:43.5019216Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:43.5019492Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:43.5019684Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:43.5020294Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.5020954Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.5021112Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:43.5021291Z U at::_ops::mul_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:43.5021473Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:43.5021694Z U at::_ops::sub__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:43.5021802Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:03:43.5022348Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.5022942Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.5023084Z U c10::BoolType::get() 2025-05-07T20:03:43.5023265Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:43.5023407Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:43.5023508Z U c10::IntType::get() 2025-05-07T20:03:43.5023691Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:43.5023813Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:43.5024039Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:43.5024210Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:43.5024357Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:43.5024783Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:43.5024953Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:43.5025077Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:43.5025191Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:43.5025307Z U c10::SymIntType::get() 2025-05-07T20:03:43.5025468Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:43.5025625Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:43.5025746Z U c10::TensorType::get() 2025-05-07T20:03:43.5025874Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:43.5026610Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:43.5026757Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:43.5026877Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:43.5027001Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:43.5027136Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:43.5027312Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:43.5027434Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:43.5027698Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:43.5027822Z U c10::cuda::current_device() 2025-05-07T20:03:43.5027924Z U c10::cuda::device_count() 2025-05-07T20:03:43.5028067Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:43.5028229Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:43.5028370Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:43.5028513Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:43.5028689Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:43.5028798Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:43.5029329Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:43.5029733Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:43.5030231Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.5030570Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:43.5031149Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.5031276Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:43.5031385Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:43.5031523Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:43.5031680Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:43.5031804Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:43.5031941Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:43.5032065Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:43.5032195Z U c10::throwNullDataPtrError() 2025-05-07T20:03:43.5032293Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:43.5032402Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:43.5032618Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:43.5032821Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:43.5032969Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:43.5033118Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.5033429Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:43.5033545Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:43.5033675Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:43.5033817Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:43.5033932Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:43.5034154Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.5034309Z U cudaFuncGetAttributes@libcudart.so.11.0 2025-05-07T20:03:43.5034426Z U cudaGetDevice@libcudart.so.11.0 2025-05-07T20:03:43.5034547Z U cudaGetDeviceCount@libcudart.so.11.0 2025-05-07T20:03:43.5034706Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:43.5034939Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:43.5035056Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:43.5035179Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:43.5035323Z U cudaMemsetAsync@libcudart.so.11.0 2025-05-07T20:03:43.5035624Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.11.0 2025-05-07T20:03:43.5035754Z U cudaPeekAtLastError@libcudart.so.11.0 2025-05-07T20:03:43.5035886Z U cudaSetDevice@libcudart.so.11.0 2025-05-07T20:03:43.5036005Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:43.5036135Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.5036280Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:43.5036414Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.5036559Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.5036686Z U log2@GLIBC_2.2.5 2025-05-07T20:03:43.5036876Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:43.5037015Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.5037169Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.5037303Z U memcpy@GLIBC_2.14 2025-05-07T20:03:43.5037405Z U memset@GLIBC_2.2.5 2025-05-07T20:03:43.5037518Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:43.5037668Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:43.5037772Z U printf@GLIBC_2.2.5 2025-05-07T20:03:43.5037885Z U puts@GLIBC_2.2.5 2025-05-07T20:03:43.5038260Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:43.5038666Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:43.5039080Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.5039653Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.5040054Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:43.5040481Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.5040941Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:43.5041470Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.5041635Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.5041781Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.5041966Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:43.5042226Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:43.5042815Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.5042952Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:43.5043145Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:43.5043268Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:43.5043387Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:43.5043517Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:43.5043702Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.5043946Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.5044095Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:43.5044207Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:43.5044302Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:43.5044437Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:43.5045039Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:43.5045619Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:43.5045894Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:43.5046239Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:43.5046356Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:43.5046542Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:43.5046705Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:43.5046853Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:43.5047192Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.5047415Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:43.5047516Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:43.5047636Z w _ITM_registerTMCloneTable 2025-05-07T20:03:43.5047740Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:43.5047832Z w __gmon_start__ 2025-05-07T20:03:43.5047936Z w __pthread_key_create 2025-05-07T20:03:43.5048082Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:43.5048274Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:43.5048281Z 2025-05-07T20:03:43.5063079Z linux-vdso.so.1 (0x00007ffe6ab39000) 2025-05-07T20:03:43.5063419Z libtorch.so => not found 2025-05-07T20:03:43.5063736Z libc10.so => not found 2025-05-07T20:03:43.5064011Z libc10_cuda.so => not found 2025-05-07T20:03:43.5064310Z libtorch_cpu.so => not found 2025-05-07T20:03:43.5064557Z libtorch_cuda.so => not found 2025-05-07T20:03:43.5064656Z libcudart.so.11.0 => not found 2025-05-07T20:03:43.5064843Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f5c4579c000) 2025-05-07T20:03:43.5064975Z libm.so.6 => /lib64/libm.so.6 (0x00007f5c456c1000) 2025-05-07T20:03:43.5065128Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f5c462a8000) 2025-05-07T20:03:43.5065273Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f5c45693000) 2025-05-07T20:03:43.5065410Z libc.so.6 => /lib64/libc.so.6 (0x00007f5c4548b000) 2025-05-07T20:03:43.5065548Z /lib64/ld-linux-x86-64.so.2 (0x00007f5c46304000) 2025-05-07T20:03:43.5065555Z 2025-05-07T20:03:43.5065667Z [CHECK] Displaying ELF information: 2025-05-07T20:03:43.5065934Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:43.5065940Z 2025-05-07T20:03:43.5097095Z 2025-05-07T20:03:43.5097837Z Dynamic section at offset 0x7de050 contains 37 entries: 2025-05-07T20:03:43.5098223Z Tag Type Name/Value 2025-05-07T20:03:43.5098883Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:43.5099434Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:43.5100057Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:43.5100656Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:43.5101252Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:43.5101879Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:43.5102907Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:43.5103461Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:43.5104026Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:43.5104630Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:43.5105176Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:43.5105837Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:43.5106467Z 0x000000000000000c (INIT) 0x14000 2025-05-07T20:03:43.5106589Z 0x000000000000000d (FINI) 0x5fb3c 2025-05-07T20:03:43.5106712Z 0x0000000000000019 (INIT_ARRAY) 0x7dd548 2025-05-07T20:03:43.5106855Z 0x000000000000001b (INIT_ARRAYSZ) 96 (bytes) 2025-05-07T20:03:43.5106967Z 0x000000000000001a (FINI_ARRAY) 0x7dd5a8 2025-05-07T20:03:43.5107128Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:43.5107242Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:43.5107360Z 0x0000000000000005 (STRTAB) 0x4240 2025-05-07T20:03:43.5107477Z 0x0000000000000006 (SYMTAB) 0x1330 2025-05-07T20:03:43.5107613Z 0x000000000000000a (STRSZ) 43494 (bytes) 2025-05-07T20:03:43.5107740Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:43.5107854Z 0x0000000000000003 (PLTGOT) 0x7de2f0 2025-05-07T20:03:43.5107987Z 0x0000000000000002 (PLTRELSZ) 6432 (bytes) 2025-05-07T20:03:43.5108120Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:43.5108235Z 0x0000000000000017 (JMPREL) 0x11f88 2025-05-07T20:03:43.5108345Z 0x0000000000000007 (RELA) 0xf108 2025-05-07T20:03:43.5108485Z 0x0000000000000008 (RELASZ) 11904 (bytes) 2025-05-07T20:03:43.5108626Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:43.5108729Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:43.5108953Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:43.5109091Z 0x000000006ffffffe (VERNEED) 0xf018 2025-05-07T20:03:43.5109203Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:43.5109356Z 0x000000006ffffff0 (VERSYM) 0xec26 2025-05-07T20:03:43.5109466Z 0x000000006ffffff9 (RELACOUNT) 116 2025-05-07T20:03:43.5109582Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:43.5109587Z 2025-05-07T20:03:43.5109709Z ################################################################################ 2025-05-07T20:03:43.5109714Z 2025-05-07T20:03:43.5109718Z 2025-05-07T20:03:43.5109833Z ################################################################################ 2025-05-07T20:03:43.5110158Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:43.5110260Z [CHECK] Listing out library size: 2025-05-07T20:03:43.5110563Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:43.5110568Z 2025-05-07T20:03:43.5110816Z 1 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:43.5110899Z 2025-05-07T20:03:43.5111326Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:43.5111885Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.5111892Z 2025-05-07T20:03:43.5159494Z GLIBC_2.2.5 2025-05-07T20:03:43.5160309Z GLIBC_2.14 2025-05-07T20:03:43.5160435Z 2025-05-07T20:03:43.5160500Z 2025-05-07T20:03:43.5161076Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:43.5161682Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.5161688Z 2025-05-07T20:03:43.5212657Z GLIBCXX_3.4 2025-05-07T20:03:43.5212777Z GLIBCXX_3.4.9 2025-05-07T20:03:43.5212886Z GLIBCXX_3.4.21 2025-05-07T20:03:43.5214091Z 2025-05-07T20:03:43.5214116Z 2025-05-07T20:03:43.5236477Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.IfEKW9QNyx.symbols.txt 2025-05-07T20:03:43.5236519Z 2025-05-07T20:03:43.5257053Z 2025-05-07T20:03:43.5279891Z [CHECK] Total Number of symbols: 274 2025-05-07T20:03:43.5295152Z [CHECK] Number of fbgemm symbols: 44 2025-05-07T20:03:43.5309920Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.1sTdQl1yLX.usymbols.txt 2025-05-07T20:03:43.5309956Z 2025-05-07T20:03:43.5330425Z 2025-05-07T20:03:43.5355055Z [CHECK] Listing out undefined symbols (130 total): 2025-05-07T20:03:43.5377678Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.5377817Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:43.5378017Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:43.5378187Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:43.5378460Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:43.5378600Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:43.5378733Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:43.5378852Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:43.5378992Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:43.5379090Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:43.5379198Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:43.5379325Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:43.5379428Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:43.5379531Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:43.5379650Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:43.5379846Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:03:43.5380455Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.5381141Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.5381321Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:43.5381439Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:03:43.5381937Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.5382138Z U at::get_thread_num() 2025-05-07T20:03:43.5382249Z U at::internal::set_thread_num(int) 2025-05-07T20:03:43.5382848Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.5383128Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:03:43.5383308Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.5383485Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:43.5383639Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.5383780Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:43.5384035Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:43.5384158Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:43.5384318Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:43.5384585Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:43.5384697Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:43.5384881Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:43.5385045Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:43.5385143Z U c10::TensorType::get() 2025-05-07T20:03:43.5385254Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:43.5386035Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:43.5386169Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:43.5386282Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:43.5386413Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:43.5386518Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:43.5386628Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:43.5386744Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:43.5386983Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:43.5387083Z U c10::cuda::device_count() 2025-05-07T20:03:43.5387228Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:43.5387362Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:43.5387504Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:43.5387652Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:43.5387813Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:43.5387917Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:43.5388446Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:43.5388685Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:43.5389180Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.5389499Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:43.5389616Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:43.5389792Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:43.5389931Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:43.5390088Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:43.5390224Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:43.5390360Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:43.5390491Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:43.5390600Z U c10::throwNullDataPtrError() 2025-05-07T20:03:43.5390713Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:43.5390820Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:43.5391011Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:43.5391131Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:43.5391253Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:43.5391380Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.5391536Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:43.5391646Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:43.5391768Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:43.5391935Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:43.5392047Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:43.5392162Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.5392280Z U cudaFuncGetAttributes@libcudart.so.11.0 2025-05-07T20:03:43.5392422Z U cudaGetDevice@libcudart.so.11.0 2025-05-07T20:03:43.5392535Z U cudaGetDeviceCount@libcudart.so.11.0 2025-05-07T20:03:43.5392647Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:43.5392871Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:43.5392985Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:43.5393439Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.11.0 2025-05-07T20:03:43.5393567Z U cudaPeekAtLastError@libcudart.so.11.0 2025-05-07T20:03:43.5393691Z U cudaSetDevice@libcudart.so.11.0 2025-05-07T20:03:43.5393805Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:43.5393929Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.5394138Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:43.5394281Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.5394405Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.5394592Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:43.5394726Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.5394823Z U memcpy@GLIBC_2.14 2025-05-07T20:03:43.5394926Z U memset@GLIBC_2.2.5 2025-05-07T20:03:43.5395042Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:43.5395166Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:43.5395261Z U printf@GLIBC_2.2.5 2025-05-07T20:03:43.5395364Z U puts@GLIBC_2.2.5 2025-05-07T20:03:43.5395718Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:43.5396121Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:43.5396676Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.5397154Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.5397684Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.5397830Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.5397970Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.5398228Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:43.5398816Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.5398933Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:43.5399075Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:43.5399190Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:43.5399298Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:43.5399493Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.5399625Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:43.5399722Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:43.5399853Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:43.5400476Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:43.5400950Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:43.5401224Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:43.5401590Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:43.5401746Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:43.5401917Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:43.5402264Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:43.5402596Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.5402839Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:43.5402950Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:43.5403057Z w _ITM_registerTMCloneTable 2025-05-07T20:03:43.5403167Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:43.5403253Z w __gmon_start__ 2025-05-07T20:03:43.5403346Z w __pthread_key_create 2025-05-07T20:03:43.5403503Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:43.5403746Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:43.5403754Z 2025-05-07T20:03:43.5421880Z linux-vdso.so.1 (0x00007ffc5e1c9000) 2025-05-07T20:03:43.5422102Z libc10.so => not found 2025-05-07T20:03:43.5422226Z libc10_cuda.so => not found 2025-05-07T20:03:43.5422317Z libtorch.so => not found 2025-05-07T20:03:43.5422947Z libtorch_cpu.so => not found 2025-05-07T20:03:43.5423067Z libtorch_cuda.so => not found 2025-05-07T20:03:43.5423333Z libcudart.so.11.0 => not found 2025-05-07T20:03:43.5423541Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007feb75dab000) 2025-05-07T20:03:43.5423820Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007feb75d55000) 2025-05-07T20:03:43.5424323Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007feb75d27000) 2025-05-07T20:03:43.5424460Z libc.so.6 => /lib64/libc.so.6 (0x00007feb75b1f000) 2025-05-07T20:03:43.5424580Z libm.so.6 => /lib64/libm.so.6 (0x00007feb75a44000) 2025-05-07T20:03:43.5424710Z /lib64/ld-linux-x86-64.so.2 (0x00007feb7610d000) 2025-05-07T20:03:43.5424727Z 2025-05-07T20:03:43.5424849Z [CHECK] Displaying ELF information: 2025-05-07T20:03:43.5425122Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:43.5425128Z 2025-05-07T20:03:43.5455514Z 2025-05-07T20:03:43.5456405Z Dynamic section at offset 0xc06b8 contains 37 entries: 2025-05-07T20:03:43.5456808Z Tag Type Name/Value 2025-05-07T20:03:43.5457383Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:43.5457969Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:43.5458574Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:43.5459161Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:43.5459751Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:43.5460355Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:43.5461226Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:43.5461815Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:43.5462383Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:43.5462948Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:43.5463842Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:03:43.5464416Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:43.5464765Z 0x000000000000000c (INIT) 0xa000 2025-05-07T20:03:43.5465002Z 0x000000000000000d (FINI) 0x1813c 2025-05-07T20:03:43.5465110Z 0x0000000000000019 (INIT_ARRAY) 0xc13b0 2025-05-07T20:03:43.5465240Z 0x000000000000001b (INIT_ARRAYSZ) 32 (bytes) 2025-05-07T20:03:43.5465345Z 0x000000000000001a (FINI_ARRAY) 0xc13d0 2025-05-07T20:03:43.5465456Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:43.5465574Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:43.5465675Z 0x0000000000000005 (STRTAB) 0x22f0 2025-05-07T20:03:43.5465775Z 0x0000000000000006 (SYMTAB) 0x928 2025-05-07T20:03:43.5465896Z 0x000000000000000a (STRSZ) 20379 (bytes) 2025-05-07T20:03:43.5466019Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:43.5466124Z 0x0000000000000003 (PLTGOT) 0xc1948 2025-05-07T20:03:43.5466246Z 0x0000000000000002 (PLTRELSZ) 3936 (bytes) 2025-05-07T20:03:43.5466365Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:43.5466463Z 0x0000000000000017 (JMPREL) 0x8298 2025-05-07T20:03:43.5466562Z 0x0000000000000007 (RELA) 0x7578 2025-05-07T20:03:43.5466693Z 0x0000000000000008 (RELASZ) 3360 (bytes) 2025-05-07T20:03:43.5466803Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:43.5466893Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:43.5467007Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:43.5467123Z 0x000000006ffffffe (VERNEED) 0x74b8 2025-05-07T20:03:43.5467389Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:43.5467497Z 0x000000006ffffff0 (VERSYM) 0x728c 2025-05-07T20:03:43.5467611Z 0x000000006ffffff9 (RELACOUNT) 7 2025-05-07T20:03:43.5467705Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:43.5467711Z 2025-05-07T20:03:43.5467821Z ################################################################################ 2025-05-07T20:03:43.5467885Z 2025-05-07T20:03:43.5467889Z 2025-05-07T20:03:43.5468175Z ################################################################################ 2025-05-07T20:03:43.5468517Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:43.5468622Z [CHECK] Listing out library size: 2025-05-07T20:03:43.5468971Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:43.5468976Z 2025-05-07T20:03:43.5469238Z 11 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:43.5469243Z 2025-05-07T20:03:43.5469693Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:43.5470263Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.5470272Z 2025-05-07T20:03:43.5931058Z GLIBC_2.2.5 2025-05-07T20:03:43.5931693Z GLIBC_2.3 2025-05-07T20:03:43.5932228Z GLIBC_2.14 2025-05-07T20:03:43.5932544Z 2025-05-07T20:03:43.5932556Z 2025-05-07T20:03:43.5934271Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:43.5936406Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.5937066Z 2025-05-07T20:03:43.6391440Z GLIBCXX_3.4 2025-05-07T20:03:43.6391821Z GLIBCXX_3.4.9 2025-05-07T20:03:43.6392671Z GLIBCXX_3.4.11 2025-05-07T20:03:43.6393352Z GLIBCXX_3.4.15 2025-05-07T20:03:43.6393582Z GLIBCXX_3.4.18 2025-05-07T20:03:43.6393878Z GLIBCXX_3.4.20 2025-05-07T20:03:43.6394082Z GLIBCXX_3.4.21 2025-05-07T20:03:43.6394220Z 2025-05-07T20:03:43.6394238Z 2025-05-07T20:03:43.6410253Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.81rrScJx43.symbols.txt 2025-05-07T20:03:43.6411784Z 2025-05-07T20:03:43.6820121Z 2025-05-07T20:03:43.6848542Z [CHECK] Total Number of symbols: 4395 2025-05-07T20:03:43.6875757Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:03:43.6894262Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.hIiPkxkfst.usymbols.txt 2025-05-07T20:03:43.6895940Z 2025-05-07T20:03:43.6921976Z 2025-05-07T20:03:43.6950220Z [CHECK] Listing out undefined symbols (192 total): 2025-05-07T20:03:43.6972885Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.6973804Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.6974384Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:43.6974745Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:43.6975100Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:43.6975419Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:43.6975734Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:43.6976034Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:43.6976359Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:43.6976681Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:43.6977003Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:43.6977323Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:43.6977618Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:43.6977937Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:43.6978246Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:43.6978635Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:43.6979332Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:43.6979682Z U at::RecordFunction::end() 2025-05-07T20:03:43.6980026Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:43.6980501Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:43.6981037Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:03:43.6981697Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:03:43.6982402Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:03:43.6983012Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:03:43.6983926Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.6984836Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:43.6985281Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:43.6985706Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:43.6986079Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:43.6986441Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:43.6986760Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:43.6987023Z U c10::AnyType::get() 2025-05-07T20:03:43.6987356Z U c10::BoolType::get() 2025-05-07T20:03:43.6987703Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:43.6988125Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:43.6988522Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:43.6989216Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:43.6990425Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:43.6991502Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:43.6992045Z U c10::Error::what() const 2025-05-07T20:03:43.6992346Z U c10::FloatType::get() 2025-05-07T20:03:43.6992639Z U c10::GradMode::is_enabled() 2025-05-07T20:03:43.6993081Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:43.6993653Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:43.6994079Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:43.6994430Z U c10::IValue::isBoolList() const 2025-05-07T20:03:43.6994758Z U c10::IValue::isDoubleList() const 2025-05-07T20:03:43.6995097Z U c10::IValue::isIntList() const 2025-05-07T20:03:43.6995440Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:43.6995767Z U c10::IValue::isTensorList() const 2025-05-07T20:03:43.6996144Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:43.6996493Z U c10::IntType::get() 2025-05-07T20:03:43.6997198Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:43.6997974Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:43.6998454Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:43.6998817Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:43.6999180Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:43.6999743Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:43.7000369Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:43.7000845Z U c10::StringType::get() 2025-05-07T20:03:43.7001161Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:43.7001551Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:43.7001918Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:43.7002772Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:03:43.7003472Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:43.7004136Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:43.7004533Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:03:43.7005007Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:43.7005391Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:43.7005751Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:43.7006098Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:43.7006429Z U c10::SymIntType::get() 2025-05-07T20:03:43.7006794Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:43.7007154Z U c10::TensorType::get() 2025-05-07T20:03:43.7007474Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:43.7008148Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:43.7009491Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:43.7010359Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:43.7011219Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.7012161Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:43.7013178Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.7014199Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:43.7014834Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:43.7015243Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:43.7015744Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:43.7016344Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:43.7016934Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:43.7017322Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:43.7017716Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:03:43.7018110Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:43.7020334Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:43.7020758Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:43.7021234Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:03:43.7021853Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:43.7022329Z U free@GLIBC_2.2.5 2025-05-07T20:03:43.7022669Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:43.7023047Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:43.7023345Z U memcpy@GLIBC_2.14 2025-05-07T20:03:43.7023784Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:43.7024089Z U memset@GLIBC_2.2.5 2025-05-07T20:03:43.7024394Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:43.7024754Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:43.7025080Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:43.7025488Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:43.7026194Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:43.7027023Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:43.7027938Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.7029027Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.7030242Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:43.7031178Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.7032231Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.7033351Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.7034420Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:43.7035433Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:43.7036271Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:43.7037165Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:43.7037785Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:43.7038127Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:43.7038499Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.7038897Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.7039344Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:43.7039885Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:43.7040327Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:43.7040803Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:43.7041880Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.7042686Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:43.7043051Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:43.7043391Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:43.7043976Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:43.7044317Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:43.7044735Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.7045278Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.7045780Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:43.7046195Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:43.7046621Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:43.7047453Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:43.7048138Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:43.7048653Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:43.7048999Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:43.7049288Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:43.7049604Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:43.7050414Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:43.7051579Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:43.7052407Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:43.7052898Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:43.7053433Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:43.7054035Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:43.7054524Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:43.7055035Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:43.7055788Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:43.7056379Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:43.7056819Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:43.7057273Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:43.7057678Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:43.7057998Z U torch::autograd::Node::metadata() 2025-05-07T20:03:43.7058340Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:43.7058821Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:43.7059408Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:43.7059970Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:43.7060401Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:43.7060922Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:43.7063787Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:43.7066567Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:43.7066967Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:43.7067393Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:43.7068413Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:03:43.7069448Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:43.7070085Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:43.7070928Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:43.7071485Z U typeinfo for c10::Error 2025-05-07T20:03:43.7071807Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:43.7072176Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:43.7072510Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:43.7072958Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:43.7073495Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:43.7073902Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:43.7074343Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:43.7074775Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:43.7075224Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:43.7075599Z U vtable for c10::Error 2025-05-07T20:03:43.7076166Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.7076766Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:43.7077235Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:43.7077712Z U vtable for torch::autograd::Node 2025-05-07T20:03:43.7078130Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:43.7078546Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:43.7078877Z w _ITM_registerTMCloneTable 2025-05-07T20:03:43.7079179Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:43.7079492Z w __gmon_start__ 2025-05-07T20:03:43.7079854Z w __pthread_key_create 2025-05-07T20:03:43.7080181Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:43.7080508Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:43.7080898Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:43.7081433Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:43.7081806Z 2025-05-07T20:03:43.7081964Z linux-vdso.so.1 (0x00007ffcb81d3000) 2025-05-07T20:03:43.7082279Z libc10.so => not found 2025-05-07T20:03:43.7082939Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fdd9de0a000) 2025-05-07T20:03:43.7083623Z libtorch.so => not found 2025-05-07T20:03:43.7084251Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007fdd9ec2e000) 2025-05-07T20:03:43.7084911Z libtorch_cpu.so => not found 2025-05-07T20:03:43.7085200Z libtorch_cuda.so => not found 2025-05-07T20:03:43.7085533Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fdd9dba6000) 2025-05-07T20:03:43.7085970Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fdd9ebfe000) 2025-05-07T20:03:43.7086360Z libc.so.6 => /lib64/libc.so.6 (0x00007fdd9d99e000) 2025-05-07T20:03:43.7086748Z /lib64/ld-linux-x86-64.so.2 (0x00007fdd9ec3d000) 2025-05-07T20:03:43.7087095Z libc10.so => not found 2025-05-07T20:03:43.7087339Z libc10_cuda.so => not found 2025-05-07T20:03:43.7087886Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007fdd9d400000) 2025-05-07T20:03:43.7088439Z libtorch.so => not found 2025-05-07T20:03:43.7088704Z libtorch_cpu.so => not found 2025-05-07T20:03:43.7088995Z libtorch_cuda.so => not found 2025-05-07T20:03:43.7089277Z libcudart.so.11.0 => not found 2025-05-07T20:03:43.7089613Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fdd9eba6000) 2025-05-07T20:03:43.7089960Z libtorch.so => not found 2025-05-07T20:03:43.7090211Z libc10.so => not found 2025-05-07T20:03:43.7090448Z libtorch_cpu.so => not found 2025-05-07T20:03:43.7090723Z libtorch_cuda.so => not found 2025-05-07T20:03:43.7091065Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fdd9eb9f000) 2025-05-07T20:03:43.7091488Z libm.so.6 => /lib64/libm.so.6 (0x00007fdd9eac2000) 2025-05-07T20:03:43.7091814Z libc10.so => not found 2025-05-07T20:03:43.7092340Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007fdd9ea4a000) 2025-05-07T20:03:43.7093024Z libtorch.so => not found 2025-05-07T20:03:43.7093267Z libtorch_cpu.so => not found 2025-05-07T20:03:43.7093541Z libtorch_cuda.so => not found 2025-05-07T20:03:43.7093800Z libtorch_cpu.so => not found 2025-05-07T20:03:43.7094066Z libtorch_cuda.so => not found 2025-05-07T20:03:43.7094325Z libtorch.so => not found 2025-05-07T20:03:43.7094616Z librt.so.1 => /lib64/librt.so.1 (0x00007fdd9d997000) 2025-05-07T20:03:43.7095034Z 2025-05-07T20:03:43.7095140Z [CHECK] Displaying ELF information: 2025-05-07T20:03:43.7095667Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:43.7096067Z 2025-05-07T20:03:43.7096071Z 2025-05-07T20:03:43.7096242Z Dynamic section at offset 0xa3d920 contains 37 entries: 2025-05-07T20:03:43.7096620Z Tag Type Name/Value 2025-05-07T20:03:43.7097044Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:43.7097572Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:03:43.7098124Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:43.7098648Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:03:43.7099196Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:43.7099729Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:43.7100309Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:43.7100834Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:43.7101329Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:43.7101866Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:43.7102679Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_pt2.so] 2025-05-07T20:03:43.7103334Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:43.7103760Z 0x000000000000000c (INIT) 0x189000 2025-05-07T20:03:43.7104098Z 0x000000000000000d (FINI) 0x8a73b8 2025-05-07T20:03:43.7104447Z 0x0000000000000019 (INIT_ARRAY) 0xa32f68 2025-05-07T20:03:43.7104799Z 0x000000000000001b (INIT_ARRAYSZ) 256 (bytes) 2025-05-07T20:03:43.7105162Z 0x000000000000001a (FINI_ARRAY) 0xa33068 2025-05-07T20:03:43.7105504Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:43.7105855Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:43.7106194Z 0x0000000000000005 (STRTAB) 0x20fc8 2025-05-07T20:03:43.7106517Z 0x0000000000000006 (SYMTAB) 0x73a8 2025-05-07T20:03:43.7106951Z 0x000000000000000a (STRSZ) 1247927 (bytes) 2025-05-07T20:03:43.7107319Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:43.7107670Z 0x0000000000000003 (PLTGOT) 0xa3ebb0 2025-05-07T20:03:43.7108026Z 0x0000000000000002 (PLTRELSZ) 42648 (bytes) 2025-05-07T20:03:43.7108384Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:43.7108760Z 0x0000000000000017 (JMPREL) 0x17dc38 2025-05-07T20:03:43.7109087Z 0x0000000000000007 (RELA) 0x153de8 2025-05-07T20:03:43.7109444Z 0x0000000000000008 (RELASZ) 171600 (bytes) 2025-05-07T20:03:43.7109812Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:43.7110147Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:43.7110471Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:43.7110830Z 0x000000006ffffffe (VERNEED) 0x153cd8 2025-05-07T20:03:43.7111165Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:43.7111500Z 0x000000006ffffff0 (VERSYM) 0x151a80 2025-05-07T20:03:43.7111840Z 0x000000006ffffff9 (RELACOUNT) 34 2025-05-07T20:03:43.7112145Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:43.7112354Z 2025-05-07T20:03:43.7112480Z ################################################################################ 2025-05-07T20:03:43.7112706Z 2025-05-07T20:03:43.7112710Z 2025-05-07T20:03:43.7112958Z ################################################################################ 2025-05-07T20:03:43.7113474Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:43.7113987Z [CHECK] Listing out library size: 2025-05-07T20:03:43.7114447Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:43.7114826Z 2025-05-07T20:03:43.7115065Z 211 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:43.7115383Z 2025-05-07T20:03:43.7115785Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:43.7116799Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.7117396Z 2025-05-07T20:03:43.7491190Z GLIBC_2.2.5 2025-05-07T20:03:43.7491554Z GLIBC_2.14 2025-05-07T20:03:43.7492055Z 2025-05-07T20:03:43.7492068Z 2025-05-07T20:03:43.7492795Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:43.7494162Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.7494817Z 2025-05-07T20:03:43.7880477Z GLIBCXX_3.4 2025-05-07T20:03:43.7881149Z GLIBCXX_3.4.9 2025-05-07T20:03:43.7881767Z GLIBCXX_3.4.11 2025-05-07T20:03:43.7882358Z GLIBCXX_3.4.14 2025-05-07T20:03:43.7882936Z GLIBCXX_3.4.18 2025-05-07T20:03:43.7883505Z GLIBCXX_3.4.20 2025-05-07T20:03:43.7884054Z GLIBCXX_3.4.21 2025-05-07T20:03:43.7884417Z 2025-05-07T20:03:43.7884431Z 2025-05-07T20:03:43.7903757Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.cZNyyVnlUQ.symbols.txt 2025-05-07T20:03:43.7905243Z 2025-05-07T20:03:43.8265178Z 2025-05-07T20:03:43.8293054Z [CHECK] Total Number of symbols: 5040 2025-05-07T20:03:43.8332357Z [CHECK] Number of fbgemm symbols: 3788 2025-05-07T20:03:43.8353730Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.mtbEoBVLHV.usymbols.txt 2025-05-07T20:03:43.8354273Z 2025-05-07T20:03:43.8394347Z 2025-05-07T20:03:43.8428863Z [CHECK] Listing out undefined symbols (253 total): 2025-05-07T20:03:43.8447143Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.8448257Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.8448913Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:43.8449268Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:43.8449713Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:43.8450195Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:43.8450577Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:43.8450974Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:43.8451341Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:43.8451716Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:43.8452074Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:43.8452404Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:43.8452725Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:43.8453037Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:43.8453366Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:43.8453686Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:43.8454021Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:43.8454337Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:43.8454660Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:43.8454964Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:43.8455281Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:43.8455721Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:43.8456588Z U at::_ops::arange_start::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.8457857Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.8459242Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.8460179Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:43.8460968Z U at::_ops::scalar_tensor::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.8461868Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:43.8462578Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:43.8463705Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:03:43.8464867Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.8465523Z U at::detail::getCUDAHooks() 2025-05-07T20:03:43.8465845Z U at::detail::getHIPHooks() 2025-05-07T20:03:43.8466133Z U at::get_thread_num() 2025-05-07T20:03:43.8466424Z U at::globalContext() 2025-05-07T20:03:43.8466723Z U at::internal::set_thread_num(int) 2025-05-07T20:03:43.8467082Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:03:43.8467528Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.8468019Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.8468466Z U c10::ClassType::addMethod(torch::jit::Function*) 2025-05-07T20:03:43.8469067Z U c10::ClassType::getMethod(std::__cxx11::basic_string, std::allocator > const&) const 2025-05-07T20:03:43.8469695Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:43.8470593Z U c10::DictType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:43.8471686Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:43.8472238Z U c10::Error::what() const 2025-05-07T20:03:43.8472541Z U c10::GradMode::is_enabled() 2025-05-07T20:03:43.8472965Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:43.8473528Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.8474032Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.8474501Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:43.8474914Z U c10::IValue::is(c10::IValue const&) const 2025-05-07T20:03:43.8475264Z U c10::IValue::isTensorList() const 2025-05-07T20:03:43.8475651Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:43.8476005Z U c10::IntType::get() 2025-05-07T20:03:43.8476709Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:43.8477487Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:43.8477898Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:43.8478240Z U c10::NoneType::get() 2025-05-07T20:03:43.8478664Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:43.8479153Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:43.8479530Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:43.8479919Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:43.8480311Z U c10::StringType::get() 2025-05-07T20:03:43.8482430Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:43.8482853Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:43.8483548Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:43.8484219Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:43.8484604Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:43.8484986Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:43.8485821Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:03:43.8486457Z U c10::TensorType::get() 2025-05-07T20:03:43.8487390Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:03:43.8488356Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:43.8489297Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:43.8490272Z U c10::_fastEqualsForContainer(c10::IValue const&, c10::IValue const&) 2025-05-07T20:03:43.8490723Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:43.8491095Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:43.8491432Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:43.8491788Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:43.8492112Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:43.8492446Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:43.8492891Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:43.8493347Z U c10::cuda::device_count() 2025-05-07T20:03:43.8493675Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:43.8494056Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:43.8494440Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:43.8494811Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:43.8495211Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:43.8495573Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:43.8496194Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:43.8497386Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:43.8499039Z U c10::detail::infer_schema::make_function_schema(std::__cxx11::basic_string, std::allocator >&&, std::__cxx11::basic_string, std::allocator >&&, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:43.8500639Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:43.8501544Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.8502734Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:43.8503948Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.8504839Z U c10::getCustomClassTypeImpl(std::type_index const&) 2025-05-07T20:03:43.8505221Z U c10::get_default_dtype() 2025-05-07T20:03:43.8505728Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:43.8506347Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:03:43.8506792Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:43.8507154Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:43.8507493Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:43.8508131Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:43.8508812Z U c10::ivalue::Future::extractStorages(c10::IValue const&) 2025-05-07T20:03:43.8509232Z U c10::ivalue::Object::resizeObject(unsigned long) 2025-05-07T20:03:43.8509775Z U c10::ivalue::checkCustomClassType(c10::ClassType const*, c10::Type const*) 2025-05-07T20:03:43.8510283Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:43.8510686Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:43.8511118Z U c10::operator<<(std::ostream&, c10::FunctionSchema const&) 2025-05-07T20:03:43.8511544Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:43.8512032Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:43.8512476Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:43.8512952Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.8513348Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:43.8513725Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:43.8514083Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:43.8514453Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:43.8514792Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:43.8515159Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.8515521Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:43.8515878Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:43.8516216Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:43.8516567Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:43.8516928Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:43.8517290Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:43.8518310Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8520035Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8521824Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8523577Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8525490Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8527189Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8528760Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:03:43.8530291Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:43.8531952Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8533795Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:43.8535545Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8537216Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:03:43.8538878Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:43.8540862Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8542757Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:03:43.8544448Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:43.8546260Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8548113Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:43.8550038Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8551946Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:03:43.8553906Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:43.8555856Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8557755Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8559647Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8561621Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8563521Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8565438Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8567403Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:43.8568606Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.8569026Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.8569651Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.8570011Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.8570646Z U linearize_cache_indices_cuda(at::Tensor const&, at::Tensor const&, at::Tensor const&, std::optional const&, long, long) 2025-05-07T20:03:43.8571299Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:43.8571705Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.8572073Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.8572877Z U lru_cache_populate_byte_cuda(at::Tensor, at::Tensor, long, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long, bool, std::optional) 2025-05-07T20:03:43.8573945Z U lxu_cache_lookup_cuda(at::Tensor, at::Tensor, long, bool, std::optional, std::optional, std::optional) 2025-05-07T20:03:43.8574583Z U memcpy@GLIBC_2.14 2025-05-07T20:03:43.8574861Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:43.8575127Z U memset@GLIBC_2.2.5 2025-05-07T20:03:43.8575411Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:43.8575760Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:43.8576183Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:43.8576830Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:43.8577626Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:43.8578485Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.8579488Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:43.8580466Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:43.8581368Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.8582312Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:43.8583379Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.8584343Z U std::__cxx11::basic_string, std::allocator >::find(char, unsigned long) const@GLIBCXX_3.4.21 2025-05-07T20:03:43.8585181Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:43.8585973Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:43.8587086Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream(std::__cxx11::basic_string, std::allocator > const&, std::_Ios_Openmode)@GLIBCXX_3.4.21 2025-05-07T20:03:43.8588273Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:43.8589132Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:43.8589734Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:03:43.8590130Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:03:43.8590474Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:43.8590815Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:43.8591193Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:43.8591561Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.8591971Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:43.8592367Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:43.8592530Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:43.8592828Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:43.8593670Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.8593927Z U std::condition_variable::condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:03:43.8594097Z U std::condition_variable::notify_all()@GLIBCXX_3.4.11 2025-05-07T20:03:43.8594294Z U std::condition_variable::~condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:03:43.8594460Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:03:43.8594603Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:43.8594739Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:43.8594895Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:43.8595027Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:43.8595233Z U std::istream& std::istream::_M_extract(long&)@GLIBCXX_3.4.9 2025-05-07T20:03:43.8595395Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:43.8595820Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:43.8595990Z U std::logic_error::~logic_error()@GLIBCXX_3.4 2025-05-07T20:03:43.8596212Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.8596459Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:43.8596591Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:43.8596818Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:43.8596965Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:43.8597207Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:03:43.8597423Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:43.8597570Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:43.8597690Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:43.8597799Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:43.8611095Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:43.8611326Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:43.8611975Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:43.8612463Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:43.8612763Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:43.8613855Z U torch::detail::class_base::class_base(std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator >, std::type_info const&, std::type_info const&) 2025-05-07T20:03:43.8614344Z U torch::detail::class_base::withNewArguments(c10::FunctionSchema const&, std::initializer_list) 2025-05-07T20:03:43.8614732Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:43.8615131Z U torch::registerCustomClassMethod(std::unique_ptr >) 2025-05-07T20:03:43.8615273Z U torch::serialize::InputArchive::InputArchive() 2025-05-07T20:03:43.8615806Z U torch::serialize::InputArchive::load_from(char const*, unsigned long, std::optional) 2025-05-07T20:03:43.8616264Z U torch::serialize::InputArchive::read(std::__cxx11::basic_string, std::allocator > const&, at::Tensor&, bool) 2025-05-07T20:03:43.8616594Z U torch::serialize::OutputArchive::OutputArchive(std::shared_ptr) 2025-05-07T20:03:43.8616762Z U torch::serialize::OutputArchive::save_to(std::ostream&) 2025-05-07T20:03:43.8617261Z U torch::serialize::OutputArchive::write(std::__cxx11::basic_string, std::allocator > const&, at::Tensor const&, bool) 2025-05-07T20:03:43.8617392Z U typeinfo for c10::Error 2025-05-07T20:03:43.8617525Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:43.8617668Z U typeinfo for std::logic_error@GLIBCXX_3.4 2025-05-07T20:03:43.8617818Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:43.8617956Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:43.8618152Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:03:43.8618431Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:43.8618583Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:43.8618748Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:43.8618903Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:43.8619054Z U vtable for c10::Error 2025-05-07T20:03:43.8619391Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:43.8619621Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:43.8619763Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:03:43.8619874Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:43.8619976Z w _ITM_registerTMCloneTable 2025-05-07T20:03:43.8620105Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:43.8620193Z w __gmon_start__ 2025-05-07T20:03:43.8620290Z w __pthread_key_create 2025-05-07T20:03:43.8620433Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:43.8620549Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:43.8620695Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:43.8620915Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:43.8620944Z 2025-05-07T20:03:43.8621057Z linux-vdso.so.1 (0x00007ffd1d9ee000) 2025-05-07T20:03:43.8621143Z libc10.so => not found 2025-05-07T20:03:43.8621236Z libc10_cuda.so => not found 2025-05-07T20:03:43.8621619Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f788b600000) 2025-05-07T20:03:43.8622078Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f788a800000) 2025-05-07T20:03:43.8622172Z libtorch.so => not found 2025-05-07T20:03:43.8622283Z libtorch_cpu.so => not found 2025-05-07T20:03:43.8622375Z libtorch_cuda.so => not found 2025-05-07T20:03:43.8622475Z libcudart.so.11.0 => not found 2025-05-07T20:03:43.8622635Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f788a59c000) 2025-05-07T20:03:43.8622797Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f78994d7000) 2025-05-07T20:03:43.8622912Z libc.so.6 => /lib64/libc.so.6 (0x00007f788a394000) 2025-05-07T20:03:43.8622996Z libc10.so => not found 2025-05-07T20:03:43.8623371Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f789945d000) 2025-05-07T20:03:43.8623520Z libtorch.so => not found 2025-05-07T20:03:43.8623619Z libtorch_cpu.so => not found 2025-05-07T20:03:43.8623729Z libtorch_cuda.so => not found 2025-05-07T20:03:43.8623846Z libm.so.6 => /lib64/libm.so.6 (0x00007f7899380000) 2025-05-07T20:03:43.8623973Z /lib64/ld-linux-x86-64.so.2 (0x00007f789950b000) 2025-05-07T20:03:43.8624069Z libtorch.so => not found 2025-05-07T20:03:43.8624161Z libc10.so => not found 2025-05-07T20:03:43.8624254Z libc10_cuda.so => not found 2025-05-07T20:03:43.8624343Z libtorch_cpu.so => not found 2025-05-07T20:03:43.8624461Z libtorch_cuda.so => not found 2025-05-07T20:03:43.8624563Z libcudart.so.11.0 => not found 2025-05-07T20:03:43.8624710Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f788bbaa000) 2025-05-07T20:03:43.8624817Z libtorch_cpu.so => not found 2025-05-07T20:03:43.8624927Z libtorch_cuda.so => not found 2025-05-07T20:03:43.8625017Z libtorch.so => not found 2025-05-07T20:03:43.8625147Z librt.so.1 => /lib64/librt.so.1 (0x00007f7899377000) 2025-05-07T20:03:43.8625442Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f788bba5000) 2025-05-07T20:03:43.8625447Z 2025-05-07T20:03:43.8625548Z [CHECK] Displaying ELF information: 2025-05-07T20:03:43.8625774Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:43.8625779Z 2025-05-07T20:03:43.8625857Z 2025-05-07T20:03:43.8626015Z Dynamic section at offset 0xd2d8688 contains 38 entries: 2025-05-07T20:03:43.8626121Z Tag Type Name/Value 2025-05-07T20:03:43.8626302Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:43.8626497Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:43.8626699Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:03:43.8626907Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:03:43.8627103Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:43.8627291Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:43.8627483Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:43.8627693Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:43.8627881Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:43.8628064Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:43.8628241Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:43.8628469Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_inference.so] 2025-05-07T20:03:43.8628649Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:43.8628764Z 0x000000000000000c (INIT) 0x19c000 2025-05-07T20:03:43.8628886Z 0x000000000000000d (FINI) 0x73d58c 2025-05-07T20:03:43.8629005Z 0x0000000000000019 (INIT_ARRAY) 0xd2d69c0 2025-05-07T20:03:43.8629123Z 0x000000000000001b (INIT_ARRAYSZ) 392 (bytes) 2025-05-07T20:03:43.8629259Z 0x000000000000001a (FINI_ARRAY) 0xd2d6b48 2025-05-07T20:03:43.8629380Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:43.8629490Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:43.8629601Z 0x0000000000000005 (STRTAB) 0x25568 2025-05-07T20:03:43.8629718Z 0x0000000000000006 (SYMTAB) 0x7cd0 2025-05-07T20:03:43.8629851Z 0x000000000000000a (STRSZ) 1383267 (bytes) 2025-05-07T20:03:43.8629971Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:43.8630096Z 0x0000000000000003 (PLTGOT) 0xd2d8928 2025-05-07T20:03:43.8630220Z 0x0000000000000002 (PLTRELSZ) 20640 (bytes) 2025-05-07T20:03:43.8630327Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:43.8630518Z 0x0000000000000017 (JMPREL) 0x196378 2025-05-07T20:03:43.8630625Z 0x0000000000000007 (RELA) 0x179950 2025-05-07T20:03:43.8630754Z 0x0000000000000008 (RELASZ) 117288 (bytes) 2025-05-07T20:03:43.8630867Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:43.8630981Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:43.8631104Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:43.8631218Z 0x000000006ffffffe (VERNEED) 0x179830 2025-05-07T20:03:43.8631326Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:43.8631593Z 0x000000006ffffff0 (VERSYM) 0x1770cc 2025-05-07T20:03:43.8631700Z 0x000000006ffffff9 (RELACOUNT) 447 2025-05-07T20:03:43.8631812Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:43.8631816Z 2025-05-07T20:03:43.8631926Z ################################################################################ 2025-05-07T20:03:43.8631930Z 2025-05-07T20:03:43.8631937Z 2025-05-07T20:03:43.8632051Z ################################################################################ 2025-05-07T20:03:43.8632359Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:43.8632458Z [CHECK] Listing out library size: 2025-05-07T20:03:43.8632881Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:43.8632887Z 2025-05-07T20:03:43.8633296Z 188 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:43.8633300Z 2025-05-07T20:03:43.8633727Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:43.8634293Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.8634298Z 2025-05-07T20:03:43.9524246Z GLIBC_2.2.5 2025-05-07T20:03:43.9524681Z GLIBC_2.3 2025-05-07T20:03:43.9525375Z GLIBC_2.14 2025-05-07T20:03:43.9525407Z 2025-05-07T20:03:43.9525413Z 2025-05-07T20:03:43.9525925Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:43.9526542Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:43.9526548Z 2025-05-07T20:03:44.0476816Z GLIBCXX_3.4 2025-05-07T20:03:44.0477073Z GLIBCXX_3.4.9 2025-05-07T20:03:44.0477330Z GLIBCXX_3.4.20 2025-05-07T20:03:44.0477578Z GLIBCXX_3.4.21 2025-05-07T20:03:44.0477883Z 2025-05-07T20:03:44.0478006Z 2025-05-07T20:03:44.0501163Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.CNmMCn7b1T.symbols.txt 2025-05-07T20:03:44.0501193Z 2025-05-07T20:03:44.1427675Z 2025-05-07T20:03:44.1470849Z [CHECK] Total Number of symbols: 12561 2025-05-07T20:03:44.1525177Z [CHECK] Number of fbgemm symbols: 5267 2025-05-07T20:03:44.1542732Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.cyKUZwWUVW.usymbols.txt 2025-05-07T20:03:44.1544228Z 2025-05-07T20:03:44.1591209Z 2025-05-07T20:03:44.1618592Z [CHECK] Listing out undefined symbols (175 total): 2025-05-07T20:03:44.1633111Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:44.1633903Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:44.1634340Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:44.1634760Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:44.1635192Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:44.1635584Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:44.1635989Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:44.1636621Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:44.1637014Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:44.1637402Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:44.1637737Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:44.1638068Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:44.1638389Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:44.1638726Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:44.1639053Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:44.1639392Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:44.1639834Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:44.1640275Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:44.1640581Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:44.1640876Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:44.1641200Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:44.1641563Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:44.1641975Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:44.1642516Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:03:44.1643204Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:03:44.1643813Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:03:44.1644429Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:03:44.1645441Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:44.1646450Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:44.1646925Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:44.1647392Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:44.1647826Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:44.1648256Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:03:44.1648737Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.1649122Z U c10::BoolType::get() 2025-05-07T20:03:44.1649487Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:44.1649938Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:44.1650319Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:44.1651025Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:44.1652225Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:44.1653278Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:44.1653840Z U c10::Error::what() const 2025-05-07T20:03:44.1654124Z U c10::FloatType::get() 2025-05-07T20:03:44.1654461Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:44.1654874Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.1655338Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:44.1655692Z U c10::IntType::get() 2025-05-07T20:03:44.1656038Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:44.1656428Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:44.1656766Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:44.1657125Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:44.1657497Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:44.1657882Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:44.1658278Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:44.1658908Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:44.1659548Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:44.1659926Z U c10::SymInt::operator+=(c10::SymInt const&) 2025-05-07T20:03:44.1660278Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:44.1660633Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:44.1660998Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:44.1661365Z U c10::SymInt::sym_ge(c10::SymInt const&) const 2025-05-07T20:03:44.1661707Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:03:44.1662062Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:44.1662444Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:03:44.1662770Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:44.1663086Z U c10::SymIntType::get() 2025-05-07T20:03:44.1663387Z U c10::TensorType::get() 2025-05-07T20:03:44.1663715Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:44.1664643Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:44.1665543Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:44.1665908Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:44.1666239Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:44.1666581Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:44.1666923Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:44.1667259Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:44.1667726Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:44.1668174Z U c10::cuda::device_count() 2025-05-07T20:03:44.1668528Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:44.1668892Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:44.1669289Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:44.1669691Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:44.1670074Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:44.1670459Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:44.1671180Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:44.1672040Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:44.1672994Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:44.1674251Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:44.1675341Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:44.1676195Z U c10::get_default_dtype() 2025-05-07T20:03:44.1676540Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:44.1676916Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:44.1677480Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:44.1678146Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:44.1678598Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:44.1678955Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:44.1679479Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:44.1679838Z U c10::operator%(c10::SymInt const&, int) 2025-05-07T20:03:44.1680217Z U c10::operator*(c10::SymInt const&, long) 2025-05-07T20:03:44.1680566Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:44.1680901Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:03:44.1681295Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:44.1681676Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:44.1682167Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:44.1682591Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:44.1682931Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:44.1683343Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:44.1683769Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:44.1684151Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:44.1684501Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:44.1684858Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:44.1685196Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:44.1685514Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:44.1685914Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:44.1686255Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:44.1686594Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:44.1686935Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:44.1687258Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:44.1687618Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:44.1687960Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:44.1688670Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:44.1689209Z U float at::Tensor::item() const 2025-05-07T20:03:44.1689561Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:44.1689988Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.1690339Z U free@GLIBC_2.2.5 2025-05-07T20:03:44.1690814Z U int at::Tensor::item() const 2025-05-07T20:03:44.1691210Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:44.1691606Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.1693552Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:44.1693983Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:44.1694408Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.1694776Z U memcpy@GLIBC_2.14 2025-05-07T20:03:44.1695093Z U memset@GLIBC_2.2.5 2025-05-07T20:03:44.1695407Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:44.1695776Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:44.1696373Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:44.1697232Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:44.1698153Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:44.1699250Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:44.1700327Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:44.1701270Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:44.1702549Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:44.1703656Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:44.1704475Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:44.1704901Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:44.1705336Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:44.1705888Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:44.1706831Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:44.1707678Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:44.1708050Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:44.1708400Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:44.1708764Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:44.1709106Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:44.1709521Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:44.1710087Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:44.1710574Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:44.1710937Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:44.1711240Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:44.1711561Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:44.1712406Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:44.1713691Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:44.1714648Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:44.1715413Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:44.1716005Z U typeinfo for c10::Error 2025-05-07T20:03:44.1716378Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:44.1716808Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:44.1717256Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:44.1717648Z U vtable for c10::Error 2025-05-07T20:03:44.1718195Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:44.1718886Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:44.1719410Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:44.1719825Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:44.1720149Z w _ITM_registerTMCloneTable 2025-05-07T20:03:44.1720513Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:44.1720834Z w __gmon_start__ 2025-05-07T20:03:44.1721108Z w __pthread_key_create 2025-05-07T20:03:44.1721476Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:44.1721978Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:44.1722403Z 2025-05-07T20:03:44.1722548Z linux-vdso.so.1 (0x00007fffae533000) 2025-05-07T20:03:44.1722866Z libc10.so => not found 2025-05-07T20:03:44.1723118Z libc10_cuda.so => not found 2025-05-07T20:03:44.1723805Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fc4fd20a000) 2025-05-07T20:03:44.1724502Z libtorch.so => not found 2025-05-07T20:03:44.1724788Z libtorch_cpu.so => not found 2025-05-07T20:03:44.1725070Z libtorch_cuda.so => not found 2025-05-07T20:03:44.1725478Z libcudart.so.11.0 => not found 2025-05-07T20:03:44.1725815Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fc4fcfa6000) 2025-05-07T20:03:44.1726223Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fc5097d4000) 2025-05-07T20:03:44.1726582Z libc.so.6 => /lib64/libc.so.6 (0x00007fc4fcd9e000) 2025-05-07T20:03:44.1726937Z /lib64/ld-linux-x86-64.so.2 (0x00007fc509808000) 2025-05-07T20:03:44.1727232Z libc10.so => not found 2025-05-07T20:03:44.1727477Z libc10_cuda.so => not found 2025-05-07T20:03:44.1727971Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007fc4fc800000) 2025-05-07T20:03:44.1728837Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007fc5097c7000) 2025-05-07T20:03:44.1729461Z libtorch.so => not found 2025-05-07T20:03:44.1729707Z libtorch_cpu.so => not found 2025-05-07T20:03:44.1729979Z libtorch_cuda.so => not found 2025-05-07T20:03:44.1730241Z libcudart.so.11.0 => not found 2025-05-07T20:03:44.1730557Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fc50976f000) 2025-05-07T20:03:44.1730915Z libm.so.6 => /lib64/libm.so.6 (0x00007fc4fc725000) 2025-05-07T20:03:44.1731236Z libc10.so => not found 2025-05-07T20:03:44.1731716Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007fc4fc6ad000) 2025-05-07T20:03:44.1732253Z libtorch.so => not found 2025-05-07T20:03:44.1732507Z libtorch_cpu.so => not found 2025-05-07T20:03:44.1732753Z libtorch_cuda.so => not found 2025-05-07T20:03:44.1733016Z libtorch.so => not found 2025-05-07T20:03:44.1733239Z libc10.so => not found 2025-05-07T20:03:44.1733526Z libtorch_cpu.so => not found 2025-05-07T20:03:44.1733773Z libtorch_cuda.so => not found 2025-05-07T20:03:44.1734107Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fc509766000) 2025-05-07T20:03:44.1734457Z libtorch_cpu.so => not found 2025-05-07T20:03:44.1734721Z libtorch_cuda.so => not found 2025-05-07T20:03:44.1734963Z libtorch.so => not found 2025-05-07T20:03:44.1735248Z librt.so.1 => /lib64/librt.so.1 (0x00007fc50975f000) 2025-05-07T20:03:44.1735472Z 2025-05-07T20:03:44.1735588Z [CHECK] Displaying ELF information: 2025-05-07T20:03:44.1736028Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:44.1736394Z 2025-05-07T20:03:44.1752461Z 2025-05-07T20:03:44.1753676Z Dynamic section at offset 0xbaf1f50 contains 38 entries: 2025-05-07T20:03:44.1754888Z Tag Type Name/Value 2025-05-07T20:03:44.1756095Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:44.1757262Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:44.1757813Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:03:44.1758376Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:44.1758900Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:44.1759554Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:44.1760324Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:44.1760802Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:44.1761284Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:44.1761784Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:44.1762280Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:44.1762839Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_forward.so] 2025-05-07T20:03:44.1763349Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:44.1763736Z 0x000000000000000c (INIT) 0x448000 2025-05-07T20:03:44.1764059Z 0x000000000000000d (FINI) 0x1fced1c 2025-05-07T20:03:44.1764407Z 0x0000000000000019 (INIT_ARRAY) 0xbaea2f0 2025-05-07T20:03:44.1764733Z 0x000000000000001b (INIT_ARRAYSZ) 752 (bytes) 2025-05-07T20:03:44.1765265Z 0x000000000000001a (FINI_ARRAY) 0xbaea5e0 2025-05-07T20:03:44.1765604Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:44.1766127Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:44.1766641Z 0x0000000000000005 (STRTAB) 0x5dd10 2025-05-07T20:03:44.1766966Z 0x0000000000000006 (SYMTAB) 0x14360 2025-05-07T20:03:44.1767337Z 0x000000000000000a (STRSZ) 3688571 (bytes) 2025-05-07T20:03:44.1767702Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:44.1768056Z 0x0000000000000003 (PLTGOT) 0xbaf21f0 2025-05-07T20:03:44.1768416Z 0x0000000000000002 (PLTRELSZ) 14520 (bytes) 2025-05-07T20:03:44.1768779Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:44.1769113Z 0x0000000000000017 (JMPREL) 0x443ae8 2025-05-07T20:03:44.1769435Z 0x0000000000000007 (RELA) 0x3e88a0 2025-05-07T20:03:44.1769783Z 0x0000000000000008 (RELASZ) 373320 (bytes) 2025-05-07T20:03:44.1770129Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:44.1770458Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:44.1770772Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:44.1771142Z 0x000000006ffffffe (VERNEED) 0x3e87b0 2025-05-07T20:03:44.1771467Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:44.1771803Z 0x000000006ffffff0 (VERSYM) 0x3e258c 2025-05-07T20:03:44.1772248Z 0x000000006ffffff9 (RELACOUNT) 1838 2025-05-07T20:03:44.1772558Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:44.1772758Z 2025-05-07T20:03:44.1772888Z ################################################################################ 2025-05-07T20:03:44.1773113Z 2025-05-07T20:03:44.1773117Z 2025-05-07T20:03:44.1773228Z ################################################################################ 2025-05-07T20:03:44.1773834Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:44.1774421Z [CHECK] Listing out library size: 2025-05-07T20:03:44.1774958Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:44.1775426Z 2025-05-07T20:03:44.1775700Z 5 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:44.1776087Z 2025-05-07T20:03:44.1776559Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:44.1777728Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:44.1778404Z 2025-05-07T20:03:44.2030383Z GLIBC_2.2.5 2025-05-07T20:03:44.2031379Z GLIBC_2.3 2025-05-07T20:03:44.2031960Z GLIBC_2.14 2025-05-07T20:03:44.2032280Z 2025-05-07T20:03:44.2032293Z 2025-05-07T20:03:44.2034022Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:44.2035960Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:44.2036660Z 2025-05-07T20:03:44.2285369Z GLIBCXX_3.4 2025-05-07T20:03:44.2285986Z GLIBCXX_3.4.9 2025-05-07T20:03:44.2286642Z GLIBCXX_3.4.11 2025-05-07T20:03:44.2287206Z GLIBCXX_3.4.15 2025-05-07T20:03:44.2287771Z GLIBCXX_3.4.18 2025-05-07T20:03:44.2288331Z GLIBCXX_3.4.20 2025-05-07T20:03:44.2288877Z GLIBCXX_3.4.21 2025-05-07T20:03:44.2289219Z 2025-05-07T20:03:44.2289232Z 2025-05-07T20:03:44.2305668Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.BxCNVtEOhP.symbols.txt 2025-05-07T20:03:44.2306265Z 2025-05-07T20:03:44.2520272Z 2025-05-07T20:03:44.2548166Z [CHECK] Total Number of symbols: 2987 2025-05-07T20:03:44.2569150Z [CHECK] Number of fbgemm symbols: 1 2025-05-07T20:03:44.2586504Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.EhrBXidcuW.usymbols.txt 2025-05-07T20:03:44.2588282Z 2025-05-07T20:03:44.2609393Z 2025-05-07T20:03:44.2634819Z [CHECK] Listing out undefined symbols (196 total): 2025-05-07T20:03:44.2651330Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:44.2653881Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:44.2655487Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:44.2656218Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:44.2656654Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:44.2656949Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:44.2657231Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:44.2657530Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:44.2657823Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:44.2658119Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:44.2658408Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:44.2658697Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:44.2659236Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:44.2659515Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:44.2659809Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:44.2660092Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:03:44.2660461Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:44.2660844Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:44.2661161Z U at::RecordFunction::end() 2025-05-07T20:03:44.2661478Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:44.2661813Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:44.2662754Z U at::_ops::_sparse_coo_tensor_unsafe::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:44.2663848Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:03:44.2664799Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:44.2666118Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:44.2666986Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:44.2667456Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:44.2667925Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:44.2668336Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:03:44.2668734Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:44.2669106Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:44.2669472Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:44.2669851Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:44.2670157Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:44.2670431Z U c10::AnyType::get() 2025-05-07T20:03:44.2670698Z U c10::BoolType::get() 2025-05-07T20:03:44.2671059Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:44.2671450Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:44.2672140Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:44.2673699Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:44.2674842Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:44.2675435Z U c10::Error::what() const 2025-05-07T20:03:44.2675749Z U c10::FloatType::get() 2025-05-07T20:03:44.2676050Z U c10::GradMode::is_enabled() 2025-05-07T20:03:44.2676378Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:44.2676753Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:44.2677147Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:44.2677488Z U c10::IValue::isBoolList() const 2025-05-07T20:03:44.2677811Z U c10::IValue::isIntList() const 2025-05-07T20:03:44.2678221Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:44.2678549Z U c10::IValue::isTensorList() const 2025-05-07T20:03:44.2678915Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:44.2679263Z U c10::IntType::get() 2025-05-07T20:03:44.2679965Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:44.2680738Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:44.2681145Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:44.2681509Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:44.2681865Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:44.2682329Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:44.2682966Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:44.2683452Z U c10::StringType::get() 2025-05-07T20:03:44.2683814Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:44.2684210Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:44.2684681Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:03:44.2685129Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:44.2685539Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:03:44.2686331Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:44.2686938Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:44.2687296Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:03:44.2687662Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:03:44.2688013Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:44.2688357Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:44.2688680Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:44.2689031Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:03:44.2689376Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:44.2689693Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:44.2689996Z U c10::SymIntType::get() 2025-05-07T20:03:44.2690300Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:44.2690613Z U c10::TensorType::get() 2025-05-07T20:03:44.2690913Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:44.2691549Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:44.2692539Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:44.2693372Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:44.2694408Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:44.2695538Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:44.2696597Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:44.2697752Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:44.2698396Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:44.2698818Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:44.2699200Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:44.2699845Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:44.2700473Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:44.2700883Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:44.2701308Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:03:44.2701720Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:44.2702372Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:44.2702799Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:44.2703279Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:03:44.2703733Z U free@GLIBC_2.2.5 2025-05-07T20:03:44.2704156Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:44.2704531Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:44.2704826Z U memcpy@GLIBC_2.14 2025-05-07T20:03:44.2705097Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:44.2705403Z U memset@GLIBC_2.2.5 2025-05-07T20:03:44.2705750Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:44.2706087Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:44.2706420Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:44.2706827Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:44.2707505Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:44.2708361Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:44.2709264Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:44.2710325Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:44.2711379Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:44.2712292Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:44.2713474Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:44.2714478Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:44.2715508Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:44.2716525Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:44.2717365Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:44.2718341Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:44.2718954Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:44.2719281Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:44.2719650Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:44.2720042Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:44.2720455Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:44.2720875Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:44.2721254Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:44.2721752Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:44.2722701Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:44.2723512Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:44.2723899Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:44.2724257Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:44.2724598Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:44.2724935Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:44.2725360Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:44.2725990Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:44.2726436Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:44.2726815Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:44.2727209Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:44.2727830Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:44.2728464Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:44.2728792Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:44.2729087Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:44.2729353Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:44.2729637Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:44.2730406Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:44.2731520Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:44.2732283Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:44.2732936Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:44.2733449Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:44.2734043Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:44.2734537Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:44.2735228Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:44.2735889Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:44.2736568Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:44.2737034Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:44.2737523Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:44.2737941Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:44.2738288Z U torch::autograd::Node::metadata() 2025-05-07T20:03:44.2738641Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:44.2739139Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:44.2739786Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:44.2740317Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:44.2740792Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:44.2741352Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:44.2744494Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:44.2747467Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:44.2747882Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:44.2748317Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:44.2749792Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:03:44.2751054Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:44.2751749Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:44.2752642Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:44.2753339Z U typeinfo for c10::Error 2025-05-07T20:03:44.2753786Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:44.2754163Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:44.2754550Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:44.2754926Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:44.2755299Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:44.2755691Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:44.2756125Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:44.2756572Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:44.2757001Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:44.2757456Z U vtable for c10::Error 2025-05-07T20:03:44.2757993Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:44.2758589Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:44.2759081Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:44.2759541Z U vtable for torch::autograd::Node 2025-05-07T20:03:44.2759957Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:44.2760358Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:44.2760682Z w _ITM_registerTMCloneTable 2025-05-07T20:03:44.2760992Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:44.2761278Z w __gmon_start__ 2025-05-07T20:03:44.2761556Z w __pthread_key_create 2025-05-07T20:03:44.2761855Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:44.2762201Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:44.2762566Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:44.2763119Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:44.2763523Z 2025-05-07T20:03:44.2763679Z linux-vdso.so.1 (0x00007fff52884000) 2025-05-07T20:03:44.2763987Z libc10.so => not found 2025-05-07T20:03:44.2764608Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f4b66a8e000) 2025-05-07T20:03:44.2765740Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f4b65a00000) 2025-05-07T20:03:44.2766392Z libtorch.so => not found 2025-05-07T20:03:44.2766637Z libtorch_cpu.so => not found 2025-05-07T20:03:44.2766881Z libtorch_cuda.so => not found 2025-05-07T20:03:44.2767193Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f4b6579c000) 2025-05-07T20:03:44.2767579Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f4b66a5e000) 2025-05-07T20:03:44.2767937Z libc.so.6 => /lib64/libc.so.6 (0x00007f4b65594000) 2025-05-07T20:03:44.2768258Z /lib64/ld-linux-x86-64.so.2 (0x00007f4b66a9d000) 2025-05-07T20:03:44.2768574Z libtorch.so => not found 2025-05-07T20:03:44.2768803Z libc10.so => not found 2025-05-07T20:03:44.2769024Z libtorch_cpu.so => not found 2025-05-07T20:03:44.2769276Z libtorch_cuda.so => not found 2025-05-07T20:03:44.2769565Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f4b66a06000) 2025-05-07T20:03:44.2769973Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f4b66a01000) 2025-05-07T20:03:44.2770321Z libtorch.so => not found 2025-05-07T20:03:44.2770553Z libc10.so => not found 2025-05-07T20:03:44.2770774Z libc10_cuda.so => not found 2025-05-07T20:03:44.2770873Z libtorch_cpu.so => not found 2025-05-07T20:03:44.2770961Z libtorch_cuda.so => not found 2025-05-07T20:03:44.2771053Z libcudart.so.11.0 => not found 2025-05-07T20:03:44.2771176Z libm.so.6 => /lib64/libm.so.6 (0x00007f4b66924000) 2025-05-07T20:03:44.2771181Z 2025-05-07T20:03:44.2771275Z [CHECK] Displaying ELF information: 2025-05-07T20:03:44.2771573Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:44.2771578Z 2025-05-07T20:03:44.2771581Z 2025-05-07T20:03:44.2771740Z Dynamic section at offset 0x4b06b0 contains 37 entries: 2025-05-07T20:03:44.2771842Z Tag Type Name/Value 2025-05-07T20:03:44.2772021Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:44.2772224Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:03:44.2772439Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:44.2772621Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:44.2772806Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:44.2773074Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:44.2773260Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:44.2773442Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:44.2773635Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:44.2773831Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:44.2774114Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_split_host.so] 2025-05-07T20:03:44.2774294Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:44.2774399Z 0x000000000000000c (INIT) 0xd0000 2025-05-07T20:03:44.2774662Z 0x000000000000000d (FINI) 0x3f2b18 2025-05-07T20:03:44.2774769Z 0x0000000000000019 (INIT_ARRAY) 0x4a9ff8 2025-05-07T20:03:44.2774900Z 0x000000000000001b (INIT_ARRAYSZ) 304 (bytes) 2025-05-07T20:03:44.2775006Z 0x000000000000001a (FINI_ARRAY) 0x4aa128 2025-05-07T20:03:44.2775115Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:44.2775232Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:44.2775333Z 0x0000000000000005 (STRTAB) 0x15da8 2025-05-07T20:03:44.2775461Z 0x0000000000000006 (SYMTAB) 0x4588 2025-05-07T20:03:44.2775598Z 0x000000000000000a (STRSZ) 609567 (bytes) 2025-05-07T20:03:44.2775703Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:44.2775806Z 0x0000000000000003 (PLTGOT) 0x4b1940 2025-05-07T20:03:44.2775932Z 0x0000000000000002 (PLTRELSZ) 31704 (bytes) 2025-05-07T20:03:44.2776070Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:44.2776172Z 0x0000000000000017 (JMPREL) 0xc7630 2025-05-07T20:03:44.2776272Z 0x0000000000000007 (RELA) 0xac330 2025-05-07T20:03:44.2776414Z 0x0000000000000008 (RELASZ) 111360 (bytes) 2025-05-07T20:03:44.2776521Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:44.2776609Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:44.2776737Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:44.2776840Z 0x000000006ffffffe (VERNEED) 0xac220 2025-05-07T20:03:44.2776943Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:44.2777047Z 0x000000006ffffff0 (VERSYM) 0xaaac8 2025-05-07T20:03:44.2777157Z 0x000000006ffffff9 (RELACOUNT) 40 2025-05-07T20:03:44.2777246Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:44.2777250Z 2025-05-07T20:03:44.2777354Z ################################################################################ 2025-05-07T20:03:44.2777358Z 2025-05-07T20:03:44.2777362Z 2025-05-07T20:03:44.2777476Z ################################################################################ 2025-05-07T20:03:44.2777746Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:44.2777843Z [CHECK] Listing out library size: 2025-05-07T20:03:44.2778114Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:44.2778117Z 2025-05-07T20:03:44.2778318Z 18 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:44.2778322Z 2025-05-07T20:03:44.2778697Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:44.2779181Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:44.2779185Z 2025-05-07T20:03:44.2867965Z GLIBC_2.2.5 2025-05-07T20:03:44.2868889Z GLIBC_2.3 2025-05-07T20:03:44.2869140Z GLIBC_2.14 2025-05-07T20:03:44.2869158Z 2025-05-07T20:03:44.2869523Z 2025-05-07T20:03:44.2870969Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:44.2872618Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:44.2872634Z 2025-05-07T20:03:44.2983788Z GLIBCXX_3.4 2025-05-07T20:03:44.2984675Z GLIBCXX_3.4.9 2025-05-07T20:03:44.2984988Z GLIBCXX_3.4.11 2025-05-07T20:03:44.2985206Z GLIBCXX_3.4.15 2025-05-07T20:03:44.2985425Z GLIBCXX_3.4.18 2025-05-07T20:03:44.2985664Z GLIBCXX_3.4.20 2025-05-07T20:03:44.2985882Z GLIBCXX_3.4.21 2025-05-07T20:03:44.2985901Z 2025-05-07T20:03:44.2985914Z 2025-05-07T20:03:44.3001797Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.pP8NLEYg20.symbols.txt 2025-05-07T20:03:44.3001828Z 2025-05-07T20:03:44.3083414Z 2025-05-07T20:03:44.3108061Z [CHECK] Total Number of symbols: 1515 2025-05-07T20:03:44.3123198Z [CHECK] Number of fbgemm symbols: 211 2025-05-07T20:03:44.3141556Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.e8m6oiXDeu.usymbols.txt 2025-05-07T20:03:44.3141584Z 2025-05-07T20:03:44.3161523Z 2025-05-07T20:03:44.3188076Z [CHECK] Listing out undefined symbols (273 total): 2025-05-07T20:03:44.3202741Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:44.3204009Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:44.3204319Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:44.3204878Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:44.3205303Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:44.3205696Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:44.3206108Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:44.3206475Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:44.3206806Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:44.3207211Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:44.3207624Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:44.3207729Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:44.3207853Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:44.3207953Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:44.3208063Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:44.3208165Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:44.3208296Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:44.3208402Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:44.3208513Z U __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:03:44.3208637Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:44.3208742Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:44.3208838Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:44.3208958Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:44.3209052Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:44.3209161Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:03:44.3209302Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:44.3209491Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:44.3209616Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:44.3209741Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:44.3209896Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:44.3210086Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:44.3210298Z U at::TensorMaker::make_tensor() 2025-05-07T20:03:44.3210436Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:03:44.3210589Z U at::_ops::concat::call(c10::ArrayRef, long) 2025-05-07T20:03:44.3210753Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:44.3211377Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:44.3212038Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:44.3212230Z U at::_ops::eq_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:44.3212409Z U at::_ops::eq_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:44.3212596Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:44.3212780Z U at::_ops::gt_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:44.3213123Z U at::_ops::index_add::call(at::Tensor const&, long, at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:44.3213332Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:03:44.3213461Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:03:44.3213638Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:44.3213873Z U at::_ops::narrow::call(at::Tensor const&, long, c10::SymInt, c10::SymInt) 2025-05-07T20:03:44.3214054Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:44.3214302Z U at::_ops::split_with_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:03:44.3214619Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:44.3215276Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:03:44.3215456Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:44.3215620Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:44.3216121Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:44.3216827Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:44.3216960Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:44.3217101Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:44.3217256Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:44.3217357Z U at::globalContext() 2025-05-07T20:03:44.3217507Z U at::has_internal_overlap(at::TensorBase const&) 2025-05-07T20:03:44.3217632Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:44.3217728Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:44.3217862Z U bool at::Tensor::item() const 2025-05-07T20:03:44.3217959Z U c10::AnyType::get() 2025-05-07T20:03:44.3218122Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:03:44.3219719Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.3219819Z U c10::BoolType::get() 2025-05-07T20:03:44.3219975Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:44.3220165Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:44.3220278Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:44.3220792Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:44.3221531Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:44.3221885Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:44.3221987Z U c10::Error::what() const 2025-05-07T20:03:44.3222097Z U c10::GradMode::is_enabled() 2025-05-07T20:03:44.3222199Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:44.3222366Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.3222615Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:44.3222723Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:44.3222830Z U c10::IValue::isBoolList() const 2025-05-07T20:03:44.3222944Z U c10::IValue::isIntList() const 2025-05-07T20:03:44.3223078Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:44.3223189Z U c10::IValue::isTensorList() const 2025-05-07T20:03:44.3223319Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:44.3223423Z U c10::IntType::get() 2025-05-07T20:03:44.3223881Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:44.3224038Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:44.3224167Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:44.3224285Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:44.3224398Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:44.3224669Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:44.3224816Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:44.3224916Z U c10::StringType::get() 2025-05-07T20:03:44.3225056Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:44.3225445Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:44.3225574Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:44.3225688Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:44.3225793Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:44.3225885Z U c10::SymIntType::get() 2025-05-07T20:03:44.3226034Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:44.3226143Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:44.3226563Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:03:44.3226714Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:44.3226808Z U c10::TensorType::get() 2025-05-07T20:03:44.3227033Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:03:44.3227135Z U c10::Type::is_module() const 2025-05-07T20:03:44.3227247Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:44.3227932Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:44.3228066Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:44.3228174Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:44.3228291Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:44.3228419Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:44.3228527Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:44.3228636Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:44.3228888Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:44.3228982Z U c10::cuda::device_count() 2025-05-07T20:03:44.3229110Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:44.3229267Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:44.3229396Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:44.3229526Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:44.3229673Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:44.3229810Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:44.3230217Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:44.3230718Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:44.3230958Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:44.3231424Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:44.3231759Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:44.3232307Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:44.3232563Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:44.3232957Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:44.3233332Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:03:44.3233453Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:44.3233584Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:44.3233918Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:44.3234107Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:44.3234277Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:44.3234451Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:44.3234577Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:44.3234708Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:44.3234924Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:44.3235308Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:44.3235464Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:44.3235610Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:44.3235775Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:44.3235900Z U c10::throwNullDataPtrError() 2025-05-07T20:03:44.3236026Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:03:44.3236138Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:44.3236262Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:44.3236456Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:44.3236577Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:44.3236721Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:44.3236849Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:44.3236987Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:44.3237108Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:44.3237275Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:44.3237393Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:44.3237510Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:44.3237649Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:44.3237799Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:44.3237936Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:44.3238072Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:44.3238195Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:44.3238314Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:44.3238432Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:44.3238569Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:44.3238693Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:44.3238895Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:03:44.3239065Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.3239160Z U free@GLIBC_2.2.5 2025-05-07T20:03:44.3239304Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.3239410Z U log2f@GLIBC_2.2.5 2025-05-07T20:03:44.3239524Z U long at::Tensor::item() const 2025-05-07T20:03:44.3239701Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:44.3239847Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:44.3239999Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.3240094Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:44.3240188Z U memcpy@GLIBC_2.14 2025-05-07T20:03:44.3240295Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:44.3240390Z U memset@GLIBC_2.2.5 2025-05-07T20:03:44.3240506Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:44.3240638Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:44.3240731Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:44.3240948Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:44.3241307Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:44.3241754Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:44.3242158Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:44.3242717Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:44.3243107Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:44.3243522Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:44.3243992Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:44.3244521Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:44.3245122Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:44.3245461Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:44.3245866Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:44.3245994Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:44.3246114Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:44.3246256Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:44.3246402Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:44.3246575Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:44.3246707Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:44.3246910Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:44.3247154Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:44.3247754Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:44.3247887Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:44.3248010Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:44.3248149Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:44.3248263Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:44.3248378Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:44.3248577Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:44.3248818Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:44.3248944Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:44.3249121Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:44.3249256Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:44.3249437Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:44.3249881Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:44.3250247Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:44.3250354Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:44.3250470Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:44.3250560Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:44.3250682Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:44.3251300Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:44.3251769Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:44.3252029Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:44.3252168Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:44.3252474Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:44.3252669Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:44.3252907Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:44.3253096Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:44.3253451Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:44.3253636Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:44.3253829Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:44.3254008Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:44.3254144Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:44.3254259Z U torch::autograd::Node::metadata() 2025-05-07T20:03:44.3254401Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:44.3254660Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:44.3254937Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:44.3255080Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:44.3255304Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:44.3255522Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:44.3258249Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:44.3258407Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:44.3258560Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:44.3258734Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:44.3258938Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:44.3259353Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:44.3259737Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:44.3260299Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:44.3260417Z U typeinfo for c10::Error 2025-05-07T20:03:44.3260522Z U typeinfo for c10::Type 2025-05-07T20:03:44.3260661Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:44.3260800Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:44.3261055Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:44.3261168Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:44.3261307Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:44.3261465Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:44.3261635Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:44.3261779Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:44.3261940Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:03:44.3262036Z U vtable for c10::Error 2025-05-07T20:03:44.3262146Z U vtable for c10::ListType 2025-05-07T20:03:44.3262468Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:44.3262594Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:44.3262806Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:44.3262938Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:03:44.3263045Z U vtable for torch::autograd::Node 2025-05-07T20:03:44.3263217Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:44.3263332Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:44.3263433Z w _ITM_registerTMCloneTable 2025-05-07T20:03:44.3263532Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:44.3263629Z w __gmon_start__ 2025-05-07T20:03:44.3263720Z w __pthread_key_create 2025-05-07T20:03:44.3263825Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:44.3263928Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:44.3264071Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:44.3264284Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:44.3264292Z 2025-05-07T20:03:44.3264391Z linux-vdso.so.1 (0x00007ffc39597000) 2025-05-07T20:03:44.3264479Z libc10.so => not found 2025-05-07T20:03:44.3264567Z libc10_cuda.so => not found 2025-05-07T20:03:44.3265094Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f3464508000) 2025-05-07T20:03:44.3265530Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f3463c00000) 2025-05-07T20:03:44.3265616Z libtorch.so => not found 2025-05-07T20:03:44.3265711Z libtorch_cpu.so => not found 2025-05-07T20:03:44.3265796Z libtorch_cuda.so => not found 2025-05-07T20:03:44.3265891Z libcudart.so.11.0 => not found 2025-05-07T20:03:44.3266041Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f346399c000) 2025-05-07T20:03:44.3266198Z libm.so.6 => /lib64/libm.so.6 (0x00007f34638c1000) 2025-05-07T20:03:44.3266340Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f34659d1000) 2025-05-07T20:03:44.3266448Z libc.so.6 => /lib64/libc.so.6 (0x00007f34636b9000) 2025-05-07T20:03:44.3266562Z /lib64/ld-linux-x86-64.so.2 (0x00007f3465a05000) 2025-05-07T20:03:44.3266649Z libc10.so => not found 2025-05-07T20:03:44.3266736Z libc10_cuda.so => not found 2025-05-07T20:03:44.3266816Z libtorch.so => not found 2025-05-07T20:03:44.3266899Z libtorch_cpu.so => not found 2025-05-07T20:03:44.3267001Z libtorch_cuda.so => not found 2025-05-07T20:03:44.3267085Z libcudart.so.11.0 => not found 2025-05-07T20:03:44.3267217Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f3465977000) 2025-05-07T20:03:44.3267313Z libtorch.so => not found 2025-05-07T20:03:44.3267394Z libc10.so => not found 2025-05-07T20:03:44.3267474Z libc10_cuda.so => not found 2025-05-07T20:03:44.3267560Z libtorch_cpu.so => not found 2025-05-07T20:03:44.3267666Z libtorch_cuda.so => not found 2025-05-07T20:03:44.3267753Z libcudart.so.11.0 => not found 2025-05-07T20:03:44.3267770Z 2025-05-07T20:03:44.3267865Z [CHECK] Displaying ELF information: 2025-05-07T20:03:44.3268110Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:44.3268115Z 2025-05-07T20:03:44.3283050Z 2025-05-07T20:03:44.3283383Z Dynamic section at offset 0x11af470 contains 40 entries: 2025-05-07T20:03:44.3283496Z Tag Type Name/Value 2025-05-07T20:03:44.3283712Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:44.3284440Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:44.3284864Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:03:44.3285109Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:44.3285315Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:44.3285567Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:44.3285781Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:44.3285993Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:44.3286205Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:44.3286402Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:44.3286602Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:44.3286793Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:44.3287017Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:44.3287259Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:03:44.3287445Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:44.3287565Z 0x000000000000000c (INIT) 0x53000 2025-05-07T20:03:44.3287695Z 0x000000000000000d (FINI) 0x14c8cc 2025-05-07T20:03:44.3287815Z 0x0000000000000019 (INIT_ARRAY) 0x11ae010 2025-05-07T20:03:44.3287940Z 0x000000000000001b (INIT_ARRAYSZ) 144 (bytes) 2025-05-07T20:03:44.3288078Z 0x000000000000001a (FINI_ARRAY) 0x11ae0a0 2025-05-07T20:03:44.3288194Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:44.3288306Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:44.3288429Z 0x0000000000000005 (STRTAB) 0xb768 2025-05-07T20:03:44.3288535Z 0x0000000000000006 (SYMTAB) 0x2948 2025-05-07T20:03:44.3288673Z 0x000000000000000a (STRSZ) 240496 (bytes) 2025-05-07T20:03:44.3288792Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:44.3288916Z 0x0000000000000003 (PLTGOT) 0x11af730 2025-05-07T20:03:44.3289153Z 0x0000000000000002 (PLTRELSZ) 16896 (bytes) 2025-05-07T20:03:44.3289262Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:44.3289383Z 0x0000000000000017 (JMPREL) 0x4e360 2025-05-07T20:03:44.3289493Z 0x0000000000000007 (RELA) 0x47010 2025-05-07T20:03:44.3289626Z 0x0000000000000008 (RELASZ) 29520 (bytes) 2025-05-07T20:03:44.3289761Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:44.3289858Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:44.3289979Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:44.3290093Z 0x000000006ffffffe (VERNEED) 0x46eb0 2025-05-07T20:03:44.3290216Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:03:44.3290329Z 0x000000006ffffff0 (VERSYM) 0x462d8 2025-05-07T20:03:44.3290436Z 0x000000006ffffff9 (RELACOUNT) 213 2025-05-07T20:03:44.3290547Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:44.3290558Z 2025-05-07T20:03:44.3290670Z ################################################################################ 2025-05-07T20:03:44.3290675Z 2025-05-07T20:03:44.3290679Z 2025-05-07T20:03:44.3290786Z ################################################################################ 2025-05-07T20:03:44.3291118Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:44.3291268Z [CHECK] Listing out library size: 2025-05-07T20:03:44.3291581Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:44.3291586Z 2025-05-07T20:03:44.3298304Z 1 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:44.3298371Z 2025-05-07T20:03:44.3301163Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:44.3303334Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:44.3303385Z 2025-05-07T20:03:44.3355538Z GLIBC_2.2.5 2025-05-07T20:03:44.3355640Z GLIBC_2.3 2025-05-07T20:03:44.3355728Z GLIBC_2.14 2025-05-07T20:03:44.3356892Z 2025-05-07T20:03:44.3356923Z 2025-05-07T20:03:44.3357644Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:44.3358232Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:44.3358238Z 2025-05-07T20:03:44.3422066Z GLIBCXX_3.4 2025-05-07T20:03:44.3423062Z GLIBCXX_3.4.9 2025-05-07T20:03:44.3423213Z GLIBCXX_3.4.18 2025-05-07T20:03:44.3423314Z GLIBCXX_3.4.20 2025-05-07T20:03:44.3423398Z GLIBCXX_3.4.21 2025-05-07T20:03:44.3423405Z 2025-05-07T20:03:44.3423411Z 2025-05-07T20:03:44.3441990Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.tA2y7jRoDK.symbols.txt 2025-05-07T20:03:44.3442071Z 2025-05-07T20:03:44.3468744Z 2025-05-07T20:03:44.3495276Z [CHECK] Total Number of symbols: 349 2025-05-07T20:03:44.3506002Z [CHECK] Number of fbgemm symbols: 57 2025-05-07T20:03:44.3522686Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.7e1kZl2nQG.usymbols.txt 2025-05-07T20:03:44.3522704Z 2025-05-07T20:03:44.3541632Z 2025-05-07T20:03:44.3565090Z [CHECK] Listing out undefined symbols (123 total): 2025-05-07T20:03:44.3582186Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:44.3582598Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:44.3582778Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:44.3582943Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:44.3583438Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:44.3583572Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:44.3583729Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:44.3583860Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:44.3583982Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:44.3584126Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:44.3584226Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:44.3584333Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:44.3584435Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:44.3584542Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:44.3584647Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:44.3584750Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:44.3584857Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:44.3584965Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:44.3585075Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:44.3585178Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:44.3585838Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:44.3586547Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:44.3586721Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:44.3586862Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:44.3586965Z U c10::IntType::get() 2025-05-07T20:03:44.3587146Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:44.3587267Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:44.3587489Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:44.3587911Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:44.3588046Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:44.3588164Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:44.3588281Z U c10::TensorType::get() 2025-05-07T20:03:44.3588402Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:44.3589129Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:44.3589284Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:44.3589398Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:44.3589518Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:44.3589654Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:44.3589771Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:44.3589882Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:44.3590159Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:44.3590260Z U c10::cuda::device_count() 2025-05-07T20:03:44.3590397Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:44.3590550Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:44.3590768Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:44.3590905Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:44.3591066Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:44.3591193Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:44.3591727Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:44.3592011Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:44.3592520Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:44.3593015Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:44.3593641Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:44.3593762Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:44.3593909Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:44.3594057Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:44.3594207Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:44.3594347Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:44.3594471Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:44.3594698Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:44.3594835Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:44.3594977Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:44.3595120Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:44.3595248Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:44.3595365Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:44.3595503Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:44.3595628Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:44.3595764Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:44.3595898Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:44.3596011Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:44.3596130Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:44.3596251Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:44.3596408Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:44.3596530Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:44.3596683Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.3596883Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:44.3597038Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.3597142Z U memcpy@GLIBC_2.14 2025-05-07T20:03:44.3597259Z U memset@GLIBC_2.2.5 2025-05-07T20:03:44.3597380Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:44.3597505Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:44.3597880Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:44.3598284Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:44.3598737Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:44.3599302Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:44.3599699Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:44.3600133Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:44.3600618Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:44.3601138Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:44.3601501Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:44.3601911Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:44.3602249Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:44.3602370Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:44.3602521Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:44.3602706Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:44.3602899Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:44.3603151Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:44.3603757Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:44.3603881Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:44.3604005Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:44.3604129Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:44.3604248Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:44.3604440Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:44.3604725Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:44.3604864Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:44.3604985Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:44.3605097Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:44.3605252Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:44.3605864Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:44.3606371Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:44.3606646Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:44.3607022Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:44.3607259Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.3619761Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:44.3620049Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:44.3620224Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:44.3620760Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:44.3621001Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:44.3621134Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:44.3621252Z w _ITM_registerTMCloneTable 2025-05-07T20:03:44.3621361Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:44.3621458Z w __gmon_start__ 2025-05-07T20:03:44.3621560Z w __pthread_key_create 2025-05-07T20:03:44.3621707Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:44.3621965Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:44.3621971Z 2025-05-07T20:03:44.3622259Z linux-vdso.so.1 (0x00007ffdc85a0000) 2025-05-07T20:03:44.3622376Z libtorch.so => not found 2025-05-07T20:03:44.3622676Z libc10.so => not found 2025-05-07T20:03:44.3622812Z libc10_cuda.so => not found 2025-05-07T20:03:44.3622909Z libtorch_cpu.so => not found 2025-05-07T20:03:44.3623147Z libtorch_cuda.so => not found 2025-05-07T20:03:44.3623259Z libcudart.so.11.0 => not found 2025-05-07T20:03:44.3623424Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f2720803000) 2025-05-07T20:03:44.3623634Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f27207ad000) 2025-05-07T20:03:44.3623783Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f272077f000) 2025-05-07T20:03:44.3623940Z libc.so.6 => /lib64/libc.so.6 (0x00007f2720577000) 2025-05-07T20:03:44.3624071Z /lib64/ld-linux-x86-64.so.2 (0x00007f2720ac0000) 2025-05-07T20:03:44.3624191Z libm.so.6 => /lib64/libm.so.6 (0x00007f272049c000) 2025-05-07T20:03:44.3624216Z 2025-05-07T20:03:44.3624328Z [CHECK] Displaying ELF information: 2025-05-07T20:03:44.3624606Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:44.3624611Z 2025-05-07T20:03:44.3656084Z 2025-05-07T20:03:44.3656829Z Dynamic section at offset 0x50440 contains 37 entries: 2025-05-07T20:03:44.3657215Z Tag Type Name/Value 2025-05-07T20:03:44.3657883Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:44.3658460Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:44.3659035Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:44.3659632Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:44.3660225Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:44.3660856Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:44.3661452Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:44.3662029Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:44.3662614Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:44.3663166Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:44.3663793Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:44.3664574Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:03:44.3664896Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:03:44.3665209Z 0x000000000000000d (FINI) 0x2fa7c 2025-05-07T20:03:44.3665535Z 0x0000000000000019 (INIT_ARRAY) 0x50bf8 2025-05-07T20:03:44.3665882Z 0x000000000000001b (INIT_ARRAYSZ) 40 (bytes) 2025-05-07T20:03:44.3666363Z 0x000000000000001a (FINI_ARRAY) 0x50c20 2025-05-07T20:03:44.3666473Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:44.3666587Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:44.3666685Z 0x0000000000000005 (STRTAB) 0x2e30 2025-05-07T20:03:44.3666782Z 0x0000000000000006 (SYMTAB) 0xd60 2025-05-07T20:03:44.3666921Z 0x000000000000000a (STRSZ) 35916 (bytes) 2025-05-07T20:03:44.3667033Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:44.3667146Z 0x0000000000000003 (PLTGOT) 0x516e0 2025-05-07T20:03:44.3667268Z 0x0000000000000002 (PLTRELSZ) 5544 (bytes) 2025-05-07T20:03:44.3667372Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:44.3667480Z 0x0000000000000017 (JMPREL) 0xdc00 2025-05-07T20:03:44.3667581Z 0x0000000000000007 (RELA) 0xbe48 2025-05-07T20:03:44.3667709Z 0x0000000000000008 (RELASZ) 7608 (bytes) 2025-05-07T20:03:44.3667822Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:44.3667910Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:44.3668038Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:44.3668143Z 0x000000006ffffffe (VERNEED) 0xbd38 2025-05-07T20:03:44.3668240Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:44.3668385Z 0x000000006ffffff0 (VERSYM) 0xba7c 2025-05-07T20:03:44.3668501Z 0x000000006ffffff9 (RELACOUNT) 152 2025-05-07T20:03:44.3668597Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:44.3668662Z 2025-05-07T20:03:44.3668773Z ################################################################################ 2025-05-07T20:03:44.3668779Z 2025-05-07T20:03:44.3668794Z 2025-05-07T20:03:44.3669093Z ################################################################################ 2025-05-07T20:03:44.3669512Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:44.3669615Z [CHECK] Listing out library size: 2025-05-07T20:03:44.3669936Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:44.3669940Z 2025-05-07T20:03:44.3670408Z 492 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:44.3670413Z 2025-05-07T20:03:44.3671057Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:44.3671603Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:44.3671608Z 2025-05-07T20:03:44.5633446Z GLIBC_2.2.5 2025-05-07T20:03:44.5633698Z GLIBC_2.3 2025-05-07T20:03:44.5633913Z GLIBC_2.14 2025-05-07T20:03:44.5634068Z 2025-05-07T20:03:44.5634073Z 2025-05-07T20:03:44.5634548Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:44.5635698Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:44.5636373Z 2025-05-07T20:03:44.7597752Z GLIBCXX_3.4 2025-05-07T20:03:44.7597998Z GLIBCXX_3.4.9 2025-05-07T20:03:44.7598262Z GLIBCXX_3.4.11 2025-05-07T20:03:44.7598482Z GLIBCXX_3.4.14 2025-05-07T20:03:44.7598726Z GLIBCXX_3.4.15 2025-05-07T20:03:44.7598939Z GLIBCXX_3.4.18 2025-05-07T20:03:44.7599128Z GLIBCXX_3.4.20 2025-05-07T20:03:44.7599350Z GLIBCXX_3.4.21 2025-05-07T20:03:44.7603783Z 2025-05-07T20:03:44.7603788Z 2025-05-07T20:03:44.7626176Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.pADFcgTq6p.symbols.txt 2025-05-07T20:03:44.7627781Z 2025-05-07T20:03:44.9570897Z 2025-05-07T20:03:44.9652407Z [CHECK] Total Number of symbols: 12554 2025-05-07T20:03:44.9742736Z [CHECK] Number of fbgemm symbols: 2318 2025-05-07T20:03:44.9758030Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.qDvwE1CgAJ.usymbols.txt 2025-05-07T20:03:44.9759635Z 2025-05-07T20:03:44.9826779Z 2025-05-07T20:03:44.9853569Z [CHECK] Listing out undefined symbols (280 total): 2025-05-07T20:03:44.9874612Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:44.9875512Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:44.9876121Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:44.9876496Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:44.9876947Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:44.9877328Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:44.9877716Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:44.9878107Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:44.9878465Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:44.9878853Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:44.9879224Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:44.9879765Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:44.9880086Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:44.9880417Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:44.9880735Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:44.9881069Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:44.9881461Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:44.9881775Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:44.9882106Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:44.9882421Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:44.9882750Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:44.9883056Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:44.9883388Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:44.9883713Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:44.9884125Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:44.9884565Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:44.9884985Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:44.9885408Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:44.9885771Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:44.9886166Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:44.9886616Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:44.9887249Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:03:44.9887860Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:44.9888847Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:44.9890182Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:44.9891148Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:44.9892203Z U at::_ops::sparse_coo_tensor_indices_size::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:44.9893458Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:44.9894020Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:03:44.9894458Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:44.9895226Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:44.9896385Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:44.9897248Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:44.9897675Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:44.9898031Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:44.9898426Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:44.9898787Z U at::get_thread_num() 2025-05-07T20:03:44.9899122Z U at::globalContext() 2025-05-07T20:03:44.9899440Z U at::internal::set_thread_num(int) 2025-05-07T20:03:44.9899778Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:44.9900184Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:03:44.9900638Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:03:44.9900987Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:44.9901274Z U c10::AnyType::get() 2025-05-07T20:03:44.9901679Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.9902502Z U c10::BoolType::get() 2025-05-07T20:03:44.9902869Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:44.9903366Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:44.9903775Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:44.9904561Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:44.9905866Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:44.9907014Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:44.9907627Z U c10::Error::what() const 2025-05-07T20:03:44.9907960Z U c10::FloatType::get() 2025-05-07T20:03:44.9908279Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:44.9908615Z U c10::GradMode::is_enabled() 2025-05-07T20:03:44.9908943Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:44.9909339Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:44.9909773Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.9910235Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:44.9910626Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:44.9910965Z U c10::IValue::isBoolList() const 2025-05-07T20:03:44.9911306Z U c10::IValue::isIntList() const 2025-05-07T20:03:44.9911648Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:44.9912097Z U c10::IValue::isTensorList() const 2025-05-07T20:03:44.9912469Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:44.9912934Z U c10::IntType::get() 2025-05-07T20:03:44.9913322Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:44.9913737Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:44.9914107Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:44.9914474Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:44.9914958Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:44.9915457Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:44.9915814Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:44.9916349Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:44.9916913Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:44.9917314Z U c10::StringType::get() 2025-05-07T20:03:44.9917671Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:44.9918091Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:44.9918828Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:44.9919502Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:44.9919897Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:44.9920274Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:44.9920611Z U c10::SymIntType::get() 2025-05-07T20:03:44.9920997Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:44.9921396Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:44.9921812Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:44.9922200Z U c10::TensorType::get() 2025-05-07T20:03:44.9922556Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:44.9923615Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:44.9924624Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:44.9924989Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:44.9925487Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:44.9925824Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:44.9926151Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:44.9926496Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:44.9927124Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:44.9927617Z U c10::cuda::device_count() 2025-05-07T20:03:44.9927976Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:44.9928380Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:44.9928968Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:44.9929362Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:44.9929800Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:44.9930259Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:44.9930960Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:44.9932170Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:44.9933070Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:44.9933964Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:44.9934944Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:44.9935991Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:44.9936834Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:44.9937192Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:44.9937740Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:44.9938400Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:44.9938871Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:44.9939356Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:44.9939789Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:44.9940135Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:44.9940533Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:44.9941208Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:44.9941839Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:44.9942220Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:44.9942633Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:44.9943064Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:44.9943488Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:44.9943893Z U c10::operator<=(c10::SymInt const&, int) 2025-05-07T20:03:44.9944249Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:03:44.9944632Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:03:44.9945000Z U c10::throwNullDataPtrError() 2025-05-07T20:03:44.9945326Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:44.9945670Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:44.9946195Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:44.9946734Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:44.9947065Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:44.9947437Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:44.9947805Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:44.9948150Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:44.9948500Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:44.9948834Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:44.9949179Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:44.9949516Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:44.9949887Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:44.9950257Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:44.9950602Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:44.9951009Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:44.9951340Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:44.9951667Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:44.9951995Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:44.9952356Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:44.9953619Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:44.9954862Z U fbgemm::SparseAdaGradSignature::Type fbgemm::GenerateSparseAdaGrad(int, bool, int, bool) 2025-05-07T20:03:44.9955459Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:03:44.9955890Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:44.9956341Z U float at::Tensor::item() const 2025-05-07T20:03:44.9956712Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:44.9957121Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.9957503Z U free@GLIBC_2.2.5 2025-05-07T20:03:44.9957852Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:44.9958253Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.9958714Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:44.9959150Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:44.9959592Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:44.9959951Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:44.9960286Z U memcpy@GLIBC_2.14 2025-05-07T20:03:44.9960570Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:44.9960879Z U memset@GLIBC_2.2.5 2025-05-07T20:03:44.9961186Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:44.9961556Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:44.9962148Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:44.9962926Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:44.9963690Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:03:44.9964464Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:44.9965261Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:44.9966126Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:03:44.9966638Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:44.9967269Z U split_embedding_codegen_forward_cpu(at::Tensor, at::Tensor, at::Tensor, c10::SymInt, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long) 2025-05-07T20:03:44.9968226Z U split_embedding_codegen_grad_indice_weights_cpu(at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor) 2025-05-07T20:03:44.9968817Z U sqrt@GLIBC_2.2.5 2025-05-07T20:03:44.9969108Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:03:44.9969500Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:44.9970172Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:44.9971086Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:44.9971938Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:44.9972950Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:44.9973940Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:44.9974803Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:44.9975739Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:44.9976774Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:44.9977911Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:44.9979018Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:44.9979826Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:44.9980397Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:44.9980735Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:44.9981089Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:44.9981455Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:44.9981840Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:44.9982242Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:44.9982643Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:44.9983022Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:44.9983483Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:44.9984373Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:44.9985125Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:44.9985476Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:44.9985808Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:44.9986127Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:44.9986453Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:44.9986830Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:44.9987347Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:44.9987806Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:44.9988186Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:44.9988581Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:44.9989210Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:44.9991429Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:44.9991776Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:44.9992065Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:44.9992350Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:44.9992644Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:44.9993737Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:44.9994934Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:44.9995771Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:44.9996286Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:44.9996815Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:44.9997426Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:44.9997979Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:44.9998491Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:44.9999159Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:44.9999826Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:45.0000286Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:45.0000799Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:45.0001229Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:45.0001589Z U torch::autograd::Node::metadata() 2025-05-07T20:03:45.0001945Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:45.0002645Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:45.0003311Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:45.0003843Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:45.0004335Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:45.0004892Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:45.0008008Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:45.0011004Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:45.0011441Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:45.0011888Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:45.0012468Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:45.0013168Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:45.0014073Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:45.0015135Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:45.0015929Z U typeinfo for c10::Error 2025-05-07T20:03:45.0016280Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:45.0016678Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:45.0017063Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:45.0017440Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:45.0017808Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:45.0019264Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:03:45.0021735Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:03:45.0023100Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:45.0023550Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:45.0023989Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:45.0024430Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:45.0024802Z U vtable for c10::Error 2025-05-07T20:03:45.0025359Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:45.0025955Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:45.0026429Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:45.0026896Z U vtable for torch::autograd::Node 2025-05-07T20:03:45.0027297Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:45.0027709Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:45.0028033Z w _ITM_registerTMCloneTable 2025-05-07T20:03:45.0028354Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:45.0028653Z w __gmon_start__ 2025-05-07T20:03:45.0028933Z w __pthread_key_create 2025-05-07T20:03:45.0029231Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:45.0029572Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:45.0029956Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:45.0030464Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:45.0030839Z 2025-05-07T20:03:45.0030989Z linux-vdso.so.1 (0x00007fffb1dca000) 2025-05-07T20:03:45.0031287Z libc10.so => not found 2025-05-07T20:03:45.0031530Z libc10_cuda.so => not found 2025-05-07T20:03:45.0032201Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f4430c0a000) 2025-05-07T20:03:45.0033450Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f4430b12000) 2025-05-07T20:03:45.0034303Z libtorch.so => not found 2025-05-07T20:03:45.0034837Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f4430400000) 2025-05-07T20:03:45.0035774Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f442fa00000) 2025-05-07T20:03:45.0036460Z libtorch_cpu.so => not found 2025-05-07T20:03:45.0036729Z libtorch_cuda.so => not found 2025-05-07T20:03:45.0037008Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.0037337Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f442f79c000) 2025-05-07T20:03:45.0037739Z libm.so.6 => /lib64/libm.so.6 (0x00007f4430a37000) 2025-05-07T20:03:45.0038133Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f4430a09000) 2025-05-07T20:03:45.0038508Z libc.so.6 => /lib64/libc.so.6 (0x00007f442f594000) 2025-05-07T20:03:45.0038876Z /lib64/ld-linux-x86-64.so.2 (0x00007f4450f31000) 2025-05-07T20:03:45.0039193Z libc10.so => not found 2025-05-07T20:03:45.0039437Z libc10_cuda.so => not found 2025-05-07T20:03:45.0040051Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f44309fe000) 2025-05-07T20:03:45.0040706Z libtorch.so => not found 2025-05-07T20:03:45.0040987Z libtorch_cpu.so => not found 2025-05-07T20:03:45.0041274Z libtorch_cuda.so => not found 2025-05-07T20:03:45.0041552Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.0041882Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f44309a6000) 2025-05-07T20:03:45.0042240Z libc10.so => not found 2025-05-07T20:03:45.0042481Z libc10_cuda.so => not found 2025-05-07T20:03:45.0042780Z libtorch.so => not found 2025-05-07T20:03:45.0043032Z libtorch_cpu.so => not found 2025-05-07T20:03:45.0043307Z libtorch_cuda.so => not found 2025-05-07T20:03:45.0043580Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.0043859Z libc10.so => not found 2025-05-07T20:03:45.0044362Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f4430388000) 2025-05-07T20:03:45.0044937Z libtorch.so => not found 2025-05-07T20:03:45.0045202Z libtorch_cpu.so => not found 2025-05-07T20:03:45.0045473Z libtorch_cuda.so => not found 2025-05-07T20:03:45.0045752Z libtorch.so => not found 2025-05-07T20:03:45.0045992Z libc10.so => not found 2025-05-07T20:03:45.0046250Z libc10_cuda.so => not found 2025-05-07T20:03:45.0046512Z libtorch_cpu.so => not found 2025-05-07T20:03:45.0046866Z libtorch_cuda.so => not found 2025-05-07T20:03:45.0047134Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.0047412Z libtorch.so => not found 2025-05-07T20:03:45.0047668Z libc10.so => not found 2025-05-07T20:03:45.0047914Z libtorch_cpu.so => not found 2025-05-07T20:03:45.0048192Z libtorch_cuda.so => not found 2025-05-07T20:03:45.0048538Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f4430999000) 2025-05-07T20:03:45.0048937Z libtorch_cpu.so => not found 2025-05-07T20:03:45.0049203Z libtorch_cuda.so => not found 2025-05-07T20:03:45.0049477Z libtorch.so => not found 2025-05-07T20:03:45.0049763Z librt.so.1 => /lib64/librt.so.1 (0x00007f4430992000) 2025-05-07T20:03:45.0050020Z 2025-05-07T20:03:45.0050127Z [CHECK] Displaying ELF information: 2025-05-07T20:03:45.0050617Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:45.0051002Z 2025-05-07T20:03:45.0051007Z 2025-05-07T20:03:45.0051171Z Dynamic section at offset 0x1eb9cd68 contains 42 entries: 2025-05-07T20:03:45.0051573Z Tag Type Name/Value 2025-05-07T20:03:45.0051992Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:45.0052509Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:45.0053045Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:03:45.0053720Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:03:45.0054301Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:45.0054805Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:03:45.0055336Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:45.0055877Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:45.0056417Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:45.0056955Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:45.0057481Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:45.0057992Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:45.0058600Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:45.0059093Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:45.0059583Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:45.0060161Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:03:45.0060754Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:45.0061144Z 0x000000000000000c (INIT) 0x5b0000 2025-05-07T20:03:45.0061487Z 0x000000000000000d (FINI) 0x2ee447c 2025-05-07T20:03:45.0061824Z 0x0000000000000019 (INIT_ARRAY) 0x1eb90820 2025-05-07T20:03:45.0062196Z 0x000000000000001b (INIT_ARRAYSZ) 1824 (bytes) 2025-05-07T20:03:45.0062572Z 0x000000000000001a (FINI_ARRAY) 0x1eb90f40 2025-05-07T20:03:45.0062918Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:45.0063249Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:45.0063576Z 0x0000000000000005 (STRTAB) 0x5ab08 2025-05-07T20:03:45.0063902Z 0x0000000000000006 (SYMTAB) 0x11200 2025-05-07T20:03:45.0064246Z 0x000000000000000a (STRSZ) 5105620 (bytes) 2025-05-07T20:03:45.0064602Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:45.0064930Z 0x0000000000000003 (PLTGOT) 0x1eb9e048 2025-05-07T20:03:45.0065291Z 0x0000000000000002 (PLTRELSZ) 63264 (bytes) 2025-05-07T20:03:45.0065622Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:45.0065950Z 0x0000000000000017 (JMPREL) 0x59f9b0 2025-05-07T20:03:45.0066285Z 0x0000000000000007 (RELA) 0x53f668 2025-05-07T20:03:45.0066615Z 0x0000000000000008 (RELASZ) 394056 (bytes) 2025-05-07T20:03:45.0066978Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:45.0067294Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:45.0067617Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:45.0067958Z 0x000000006ffffffe (VERNEED) 0x53f4f8 2025-05-07T20:03:45.0068290Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:03:45.0068602Z 0x000000006ffffff0 (VERSYM) 0x5392dc 2025-05-07T20:03:45.0068937Z 0x000000006ffffff9 (RELACOUNT) 2708 2025-05-07T20:03:45.0069248Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:45.0069444Z 2025-05-07T20:03:45.0069549Z ################################################################################ 2025-05-07T20:03:45.0069767Z 2025-05-07T20:03:45.0069782Z 2025-05-07T20:03:45.0069890Z ################################################################################ 2025-05-07T20:03:45.0070425Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:45.0070965Z [CHECK] Listing out library size: 2025-05-07T20:03:45.0071477Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:45.0071965Z 2025-05-07T20:03:45.0072205Z 76 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:45.0072555Z 2025-05-07T20:03:45.0073095Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:45.0074463Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:45.0075123Z 2025-05-07T20:03:45.0293966Z GLIBC_2.2.5 2025-05-07T20:03:45.0294707Z GLIBC_2.3 2025-05-07T20:03:45.0294963Z GLIBC_2.14 2025-05-07T20:03:45.0295184Z 2025-05-07T20:03:45.0295189Z 2025-05-07T20:03:45.0295686Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:45.0296964Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:45.0297637Z 2025-05-07T20:03:45.0560762Z GLIBCXX_3.4 2025-05-07T20:03:45.0560978Z GLIBCXX_3.4.9 2025-05-07T20:03:45.0561202Z GLIBCXX_3.4.11 2025-05-07T20:03:45.0561566Z GLIBCXX_3.4.18 2025-05-07T20:03:45.0561788Z GLIBCXX_3.4.20 2025-05-07T20:03:45.0561991Z GLIBCXX_3.4.21 2025-05-07T20:03:45.0562643Z 2025-05-07T20:03:45.0563308Z 2025-05-07T20:03:45.0587523Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.AuBememaNb.symbols.txt 2025-05-07T20:03:45.0589157Z 2025-05-07T20:03:45.0824966Z 2025-05-07T20:03:45.0851882Z [CHECK] Total Number of symbols: 1609 2025-05-07T20:03:45.0873294Z [CHECK] Number of fbgemm symbols: 227 2025-05-07T20:03:45.0891347Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.UD3Ibe8z44.usymbols.txt 2025-05-07T20:03:45.0892049Z 2025-05-07T20:03:45.0911547Z 2025-05-07T20:03:45.0937813Z [CHECK] Listing out undefined symbols (176 total): 2025-05-07T20:03:45.0958640Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:45.0959542Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:45.0960120Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:45.0960489Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:45.0960895Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:45.0961289Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:45.0961684Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:45.0962067Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:45.0962428Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:45.0962806Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:45.0963159Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:45.0963481Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:45.0963811Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:45.0964112Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:45.0964443Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:45.0964767Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:45.0965093Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:45.0965421Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:45.0965836Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:45.0966274Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:45.0966713Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:45.0967302Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:45.0968348Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.0969623Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.0970549Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:45.0971134Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:45.0971990Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.0973088Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.0973896Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:45.0974341Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:45.0974671Z U at::globalContext() 2025-05-07T20:03:45.0975042Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.0975457Z U c10::BoolType::get() 2025-05-07T20:03:45.0975792Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:45.0976202Z U c10::FloatType::get() 2025-05-07T20:03:45.0976515Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:45.0976881Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.0977298Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:45.0977625Z U c10::IntType::get() 2025-05-07T20:03:45.0977981Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:45.0978362Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:45.0978731Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:45.0979125Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:45.0979501Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:45.0980160Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:45.0980783Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:45.0981136Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:45.0981446Z U c10::SymIntType::get() 2025-05-07T20:03:45.0981801Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:45.0982206Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:45.0982576Z U c10::TensorType::get() 2025-05-07T20:03:45.0982904Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:45.0983798Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:45.0984738Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:45.0985098Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:45.0985430Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:45.0985777Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:45.0986161Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:45.0986501Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:45.0986949Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:45.0987407Z U c10::cuda::device_count() 2025-05-07T20:03:45.0987756Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:45.0988109Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:45.0988471Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:45.0988840Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:45.0989235Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:45.0989604Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:45.0990304Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:45.0991158Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:45.0992001Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:45.0993002Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:45.0994275Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:45.0995097Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:45.0995436Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:45.0995819Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:45.0996247Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:45.0996660Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:45.0997031Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:45.0997425Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:45.0997797Z U c10::throwNullDataPtrError() 2025-05-07T20:03:45.0998114Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:45.0998444Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:45.0998856Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:45.0999298Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:45.0999656Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:45.1000044Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:45.1000429Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:45.1000796Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:45.1001156Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:45.1001508Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:45.1001868Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:45.1002457Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:45.1002829Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:45.1003201Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:45.1003597Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:45.1003971Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:45.1004312Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:45.1004780Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:45.1005139Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:45.1005519Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:45.1008029Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:03:45.1010579Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:45.1011015Z U float at::Tensor::item() const 2025-05-07T20:03:45.1011398Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.1011805Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.1012220Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.1012650Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.1013100Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:45.1013538Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.1014033Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.1014470Z U memcpy@GLIBC_2.14 2025-05-07T20:03:45.1014774Z U memset@GLIBC_2.2.5 2025-05-07T20:03:45.1015118Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:45.1015434Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:45.1016174Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:45.1016931Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:45.1017678Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:45.1018699Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:45.1019493Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:45.1020350Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:45.1021272Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:45.1022352Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:45.1023396Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:45.1024326Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:45.1025327Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:45.1026415Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:45.1027599Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:45.1028433Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:45.1029038Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:45.1029384Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:45.1029750Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:45.1030164Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:45.1030597Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:45.1031014Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:45.1031512Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:45.1032450Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:45.1033434Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:45.1033885Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:45.1034234Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:45.1034582Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:45.1034988Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:45.1035571Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:45.1036071Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:45.1036422Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:45.1036740Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:45.1037055Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:45.1037901Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:45.1039094Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:45.1039979Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:45.1040726Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:45.1041786Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:45.1043931Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:45.1046971Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:45.1049751Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:45.1052407Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:45.1055307Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:45.1058365Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:45.1061930Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:45.1066068Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:45.1070250Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:45.1074491Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:45.1078638Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:45.1082661Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:45.1086548Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:03:45.1088390Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:45.1088798Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:45.1089197Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:45.1089801Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:45.1090443Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:45.1090859Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:45.1091166Z w _ITM_registerTMCloneTable 2025-05-07T20:03:45.1091451Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:45.1091735Z w __gmon_start__ 2025-05-07T20:03:45.1092003Z w __pthread_key_create 2025-05-07T20:03:45.1092289Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:45.1092601Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:45.1092950Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:45.1093430Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:45.1093778Z 2025-05-07T20:03:45.1093881Z linux-vdso.so.1 (0x00007ffd3c0f9000) 2025-05-07T20:03:45.1094162Z libc10.so => not found 2025-05-07T20:03:45.1094380Z libc10_cuda.so => not found 2025-05-07T20:03:45.1095085Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f4de9600000) 2025-05-07T20:03:45.1095823Z libtorch.so => not found 2025-05-07T20:03:45.1096056Z libtorch_cpu.so => not found 2025-05-07T20:03:45.1096313Z libtorch_cuda.so => not found 2025-05-07T20:03:45.1096563Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.1096888Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f4de939c000) 2025-05-07T20:03:45.1097273Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f4e0e641000) 2025-05-07T20:03:45.1097635Z libc.so.6 => /lib64/libc.so.6 (0x00007f4de9194000) 2025-05-07T20:03:45.1097978Z /lib64/ld-linux-x86-64.so.2 (0x00007f4e0e675000) 2025-05-07T20:03:45.1098273Z libc10.so => not found 2025-05-07T20:03:45.1098511Z libc10_cuda.so => not found 2025-05-07T20:03:45.1099117Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f4de8f9e000) 2025-05-07T20:03:45.1100180Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f4de8ea6000) 2025-05-07T20:03:45.1100940Z libtorch.so => not found 2025-05-07T20:03:45.1101433Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f4de8800000) 2025-05-07T20:03:45.1102904Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f4de7e00000) 2025-05-07T20:03:45.1103586Z libtorch_cpu.so => not found 2025-05-07T20:03:45.1103864Z libtorch_cuda.so => not found 2025-05-07T20:03:45.1104130Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.1104452Z libm.so.6 => /lib64/libm.so.6 (0x00007f4de8dcb000) 2025-05-07T20:03:45.1104771Z libc10.so => not found 2025-05-07T20:03:45.1105023Z libc10_cuda.so => not found 2025-05-07T20:03:45.1105653Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f4e0e630000) 2025-05-07T20:03:45.1106310Z libtorch.so => not found 2025-05-07T20:03:45.1106572Z libtorch_cpu.so => not found 2025-05-07T20:03:45.1106845Z libtorch_cuda.so => not found 2025-05-07T20:03:45.1107125Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.1107450Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f4e097aa000) 2025-05-07T20:03:45.1107882Z libc10.so => not found 2025-05-07T20:03:45.1108121Z libc10_cuda.so => not found 2025-05-07T20:03:45.1108399Z libtorch.so => not found 2025-05-07T20:03:45.1108650Z libtorch_cpu.so => not found 2025-05-07T20:03:45.1108932Z libtorch_cuda.so => not found 2025-05-07T20:03:45.1109216Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.1109474Z libc10.so => not found 2025-05-07T20:03:45.1110046Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f4e09732000) 2025-05-07T20:03:45.1110605Z libtorch.so => not found 2025-05-07T20:03:45.1110872Z libtorch_cpu.so => not found 2025-05-07T20:03:45.1111139Z libtorch_cuda.so => not found 2025-05-07T20:03:45.1111413Z libtorch.so => not found 2025-05-07T20:03:45.1111652Z libc10.so => not found 2025-05-07T20:03:45.1111904Z libc10_cuda.so => not found 2025-05-07T20:03:45.1112180Z libtorch_cpu.so => not found 2025-05-07T20:03:45.1112442Z libtorch_cuda.so => not found 2025-05-07T20:03:45.1112781Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.1113063Z libtorch.so => not found 2025-05-07T20:03:45.1113325Z libc10.so => not found 2025-05-07T20:03:45.1113561Z libtorch_cpu.so => not found 2025-05-07T20:03:45.1113845Z libtorch_cuda.so => not found 2025-05-07T20:03:45.1114192Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f4e0e621000) 2025-05-07T20:03:45.1114584Z libtorch_cpu.so => not found 2025-05-07T20:03:45.1114851Z libtorch_cuda.so => not found 2025-05-07T20:03:45.1115121Z libtorch.so => not found 2025-05-07T20:03:45.1115420Z librt.so.1 => /lib64/librt.so.1 (0x00007f4e0e61a000) 2025-05-07T20:03:45.1115663Z 2025-05-07T20:03:45.1115778Z [CHECK] Displaying ELF information: 2025-05-07T20:03:45.1116279Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:45.1116679Z 2025-05-07T20:03:45.1116712Z 2025-05-07T20:03:45.1116886Z Dynamic section at offset 0x4b7dd08 contains 38 entries: 2025-05-07T20:03:45.1117268Z Tag Type Name/Value 2025-05-07T20:03:45.1117709Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:45.1118219Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:45.1118815Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:03:45.1119396Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:45.1119926Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:45.1120465Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:45.1121114Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:45.1121655Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:45.1122169Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:45.1122688Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:45.1123212Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:45.1123838Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_gwd.so] 2025-05-07T20:03:45.1124433Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:45.1124842Z 0x000000000000000c (INIT) 0xac000 2025-05-07T20:03:45.1125193Z 0x000000000000000d (FINI) 0x5df4cc 2025-05-07T20:03:45.1125635Z 0x0000000000000019 (INIT_ARRAY) 0x4b7d9f8 2025-05-07T20:03:45.1125993Z 0x000000000000001b (INIT_ARRAYSZ) 200 (bytes) 2025-05-07T20:03:45.1126323Z 0x000000000000001a (FINI_ARRAY) 0x4b7dac0 2025-05-07T20:03:45.1126663Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:45.1126997Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:45.1127304Z 0x0000000000000005 (STRTAB) 0xc368 2025-05-07T20:03:45.1128076Z 0x0000000000000006 (SYMTAB) 0x2c78 2025-05-07T20:03:45.1128399Z 0x000000000000000a (STRSZ) 595540 (bytes) 2025-05-07T20:03:45.1128750Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:45.1129073Z 0x0000000000000003 (PLTGOT) 0x4b7efa8 2025-05-07T20:03:45.1129426Z 0x0000000000000002 (PLTRELSZ) 12672 (bytes) 2025-05-07T20:03:45.1129771Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:45.1130083Z 0x0000000000000017 (JMPREL) 0xa7fe0 2025-05-07T20:03:45.1130400Z 0x0000000000000007 (RELA) 0x9e770 2025-05-07T20:03:45.1130732Z 0x0000000000000008 (RELASZ) 39024 (bytes) 2025-05-07T20:03:45.1131069Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:45.1131370Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:45.1131684Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:45.1132007Z 0x000000006ffffffe (VERNEED) 0x9e650 2025-05-07T20:03:45.1132331Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:45.1132628Z 0x000000006ffffff0 (VERSYM) 0x9d9bc 2025-05-07T20:03:45.1132948Z 0x000000006ffffff9 (RELACOUNT) 239 2025-05-07T20:03:45.1133248Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:45.1133440Z 2025-05-07T20:03:45.1133547Z ################################################################################ 2025-05-07T20:03:45.1133760Z 2025-05-07T20:03:45.1133780Z 2025-05-07T20:03:45.1133884Z ################################################################################ 2025-05-07T20:03:45.1134402Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:45.1134932Z [CHECK] Listing out library size: 2025-05-07T20:03:45.1135420Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:45.1135817Z 2025-05-07T20:03:45.1136051Z 175 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:45.1136407Z 2025-05-07T20:03:45.1136821Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:45.1137836Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:45.1138461Z 2025-05-07T20:03:45.1704470Z GLIBC_2.2.5 2025-05-07T20:03:45.1705128Z GLIBC_2.3 2025-05-07T20:03:45.1705692Z GLIBC_2.14 2025-05-07T20:03:45.1706040Z 2025-05-07T20:03:45.1706465Z 2025-05-07T20:03:45.1707878Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:45.1711314Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:45.1713577Z 2025-05-07T20:03:45.2360757Z GLIBCXX_3.4 2025-05-07T20:03:45.2361014Z GLIBCXX_3.4.9 2025-05-07T20:03:45.2361227Z GLIBCXX_3.4.11 2025-05-07T20:03:45.2361448Z GLIBCXX_3.4.18 2025-05-07T20:03:45.2361650Z GLIBCXX_3.4.20 2025-05-07T20:03:45.2361864Z GLIBCXX_3.4.21 2025-05-07T20:03:45.2368709Z 2025-05-07T20:03:45.2368723Z 2025-05-07T20:03:45.2394517Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.EeGefPs8wB.symbols.txt 2025-05-07T20:03:45.2395067Z 2025-05-07T20:03:45.3009278Z 2025-05-07T20:03:45.3067132Z [CHECK] Total Number of symbols: 3695 2025-05-07T20:03:45.3117301Z [CHECK] Number of fbgemm symbols: 551 2025-05-07T20:03:45.3148194Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.zfnQ27DT5q.usymbols.txt 2025-05-07T20:03:45.3149847Z 2025-05-07T20:03:45.3175748Z 2025-05-07T20:03:45.3201654Z [CHECK] Listing out undefined symbols (183 total): 2025-05-07T20:03:45.3218947Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:45.3221385Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:45.3223141Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:45.3224305Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:45.3225217Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:45.3225628Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:45.3226143Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:45.3226533Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:45.3226991Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:45.3227345Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:45.3227685Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:45.3227973Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:45.3228280Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:45.3228566Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:45.3228866Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:45.3229170Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:45.3229478Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:45.3229766Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:03:45.3230101Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:45.3230494Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:45.3230894Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:45.3231319Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:45.3231753Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:45.3232572Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.3234275Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.3235385Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:45.3236164Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:45.3237103Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.3238328Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.3239199Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:45.3239763Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:45.3240101Z U at::globalContext() 2025-05-07T20:03:45.3240511Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.3240937Z U c10::BoolType::get() 2025-05-07T20:03:45.3241286Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:45.3241680Z U c10::FloatType::get() 2025-05-07T20:03:45.3241993Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:45.3242412Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.3242877Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:45.3243252Z U c10::IntType::get() 2025-05-07T20:03:45.3243643Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:45.3244034Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:45.3244468Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:45.3244873Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:45.3245281Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:45.3245704Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:03:45.3246116Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:45.3246791Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:45.3247433Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:45.3247852Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:45.3248234Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:45.3248582Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:45.3248970Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:03:45.3249334Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:45.3249704Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:45.3250006Z U c10::SymIntType::get() 2025-05-07T20:03:45.3250363Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:45.3250788Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:45.3251146Z U c10::TensorType::get() 2025-05-07T20:03:45.3251487Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:45.3252383Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:45.3253313Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:45.3253668Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:45.3254008Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:45.3254366Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:45.3254763Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:45.3255105Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:45.3255588Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:45.3256031Z U c10::cuda::device_count() 2025-05-07T20:03:45.3271646Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:45.3272432Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:45.3273006Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:45.3273490Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:45.3273930Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:45.3274370Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:45.3275142Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:45.3276066Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:45.3277118Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:45.3278128Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:45.3279247Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:45.3280201Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:45.3280522Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:45.3280889Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:45.3281307Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:45.3281693Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:45.3282058Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:03:45.3282412Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:45.3282816Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:45.3283210Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:45.3283575Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:45.3283931Z U c10::throwNullDataPtrError() 2025-05-07T20:03:45.3284236Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:45.3284559Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:45.3284952Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:45.3285381Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:45.3285757Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:45.3286122Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:45.3286516Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:45.3286861Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:45.3287238Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:45.3287584Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:45.3287955Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:45.3288315Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:45.3288655Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:45.3289029Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:45.3289458Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:45.3289801Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:45.3290115Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:45.3290451Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:45.3290794Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:45.3291155Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:45.3293474Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:03:45.3295862Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:45.3296298Z U float at::Tensor::item() const 2025-05-07T20:03:45.3296689Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.3297122Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.3297506Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.3297901Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.3298362Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:45.3298776Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.3299183Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.3299537Z U memcpy@GLIBC_2.14 2025-05-07T20:03:45.3299837Z U memset@GLIBC_2.2.5 2025-05-07T20:03:45.3300130Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:45.3300481Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:45.3301052Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:45.3301770Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:45.3303090Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:03:45.3303874Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:45.3304673Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:45.3305486Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:03:45.3306291Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:45.3307179Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:45.3308131Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:45.3309197Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:45.3310292Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:45.3311369Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:45.3312373Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:45.3313593Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:45.3314614Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:45.3315454Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:45.3316097Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:45.3316446Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:45.3316848Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:45.3317245Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:45.3317714Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:45.3318141Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:45.3318614Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:45.3319627Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:45.3320452Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:45.3320813Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:45.3321189Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:45.3321521Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:45.3321933Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:45.3322487Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:45.3322982Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:45.3323353Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:45.3323684Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:45.3324022Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:45.3324875Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:45.3326058Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:45.3326915Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:45.3327838Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:45.3329134Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:45.3331981Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:45.3336205Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:45.3340308Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:45.3344458Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:45.3348130Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:45.3351793Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:45.3355727Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:03:45.3357757Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:45.3358206Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:45.3358642Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:45.3359263Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:45.3360034Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:45.3360475Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:45.3360810Z w _ITM_registerTMCloneTable 2025-05-07T20:03:45.3361118Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:45.3361430Z w __gmon_start__ 2025-05-07T20:03:45.3361709Z w __pthread_key_create 2025-05-07T20:03:45.3362213Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:45.3362550Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:45.3362924Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:45.3363447Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:45.3363841Z 2025-05-07T20:03:45.3363996Z linux-vdso.so.1 (0x00007ffc0d9f6000) 2025-05-07T20:03:45.3364321Z libc10.so => not found 2025-05-07T20:03:45.3364564Z libc10_cuda.so => not found 2025-05-07T20:03:45.3365330Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f2458400000) 2025-05-07T20:03:45.3366199Z libtorch.so => not found 2025-05-07T20:03:45.3366438Z libtorch_cpu.so => not found 2025-05-07T20:03:45.3366699Z libtorch_cuda.so => not found 2025-05-07T20:03:45.3366980Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.3367314Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f245819c000) 2025-05-07T20:03:45.3367699Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f2483caf000) 2025-05-07T20:03:45.3368087Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f2483c81000) 2025-05-07T20:03:45.3368435Z libc.so.6 => /lib64/libc.so.6 (0x00007f2457f94000) 2025-05-07T20:03:45.3368792Z /lib64/ld-linux-x86-64.so.2 (0x00007f2483d0b000) 2025-05-07T20:03:45.3369105Z libc10.so => not found 2025-05-07T20:03:45.3369334Z libc10_cuda.so => not found 2025-05-07T20:03:45.3369950Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f2457d9e000) 2025-05-07T20:03:45.3371003Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f2457ca6000) 2025-05-07T20:03:45.3371709Z libtorch.so => not found 2025-05-07T20:03:45.3372202Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f2457600000) 2025-05-07T20:03:45.3373084Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f2456c00000) 2025-05-07T20:03:45.3373722Z libtorch_cpu.so => not found 2025-05-07T20:03:45.3373993Z libtorch_cuda.so => not found 2025-05-07T20:03:45.3374278Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.3374561Z libm.so.6 => /lib64/libm.so.6 (0x00007f2457bcb000) 2025-05-07T20:03:45.3374868Z libc10.so => not found 2025-05-07T20:03:45.3375102Z libc10_cuda.so => not found 2025-05-07T20:03:45.3375691Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f2483c70000) 2025-05-07T20:03:45.3376316Z libtorch.so => not found 2025-05-07T20:03:45.3376559Z libtorch_cpu.so => not found 2025-05-07T20:03:45.3376829Z libtorch_cuda.so => not found 2025-05-07T20:03:45.3377080Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.3377332Z libc10.so => not found 2025-05-07T20:03:45.3377548Z libc10_cuda.so => not found 2025-05-07T20:03:45.3377801Z libtorch.so => not found 2025-05-07T20:03:45.3378031Z libtorch_cpu.so => not found 2025-05-07T20:03:45.3378303Z libtorch_cuda.so => not found 2025-05-07T20:03:45.3378561Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.3378835Z libc10.so => not found 2025-05-07T20:03:45.3379318Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f2478588000) 2025-05-07T20:03:45.3379899Z libtorch.so => not found 2025-05-07T20:03:45.3380164Z libtorch_cpu.so => not found 2025-05-07T20:03:45.3380415Z libtorch_cuda.so => not found 2025-05-07T20:03:45.3380691Z libtorch.so => not found 2025-05-07T20:03:45.3380919Z libc10.so => not found 2025-05-07T20:03:45.3381165Z libc10_cuda.so => not found 2025-05-07T20:03:45.3381406Z libtorch_cpu.so => not found 2025-05-07T20:03:45.3381674Z libtorch_cuda.so => not found 2025-05-07T20:03:45.3381929Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.3382196Z libtorch.so => not found 2025-05-07T20:03:45.3382440Z libc10.so => not found 2025-05-07T20:03:45.3382666Z libtorch_cpu.so => not found 2025-05-07T20:03:45.3382926Z libtorch_cuda.so => not found 2025-05-07T20:03:45.3383262Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f2483c61000) 2025-05-07T20:03:45.3383641Z libtorch_cpu.so => not found 2025-05-07T20:03:45.3383892Z libtorch_cuda.so => not found 2025-05-07T20:03:45.3384148Z libtorch.so => not found 2025-05-07T20:03:45.3384429Z librt.so.1 => /lib64/librt.so.1 (0x00007f2478581000) 2025-05-07T20:03:45.3384677Z 2025-05-07T20:03:45.3384783Z [CHECK] Displaying ELF information: 2025-05-07T20:03:45.3385257Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:45.3385636Z 2025-05-07T20:03:45.3385640Z 2025-05-07T20:03:45.3385796Z Dynamic section at offset 0xaed9e48 contains 39 entries: 2025-05-07T20:03:45.3386195Z Tag Type Name/Value 2025-05-07T20:03:45.3386587Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:45.3387080Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:45.3387632Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:03:45.3388181Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:45.3388668Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:45.3389150Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:45.3389654Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:45.3390137Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:45.3390628Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:45.3391128Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:45.3391601Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:45.3392101Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:45.3392659Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_vbe.so] 2025-05-07T20:03:45.3393496Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:45.3393905Z 0x000000000000000c (INIT) 0x1ad000 2025-05-07T20:03:45.3394278Z 0x000000000000000d (FINI) 0xe4d99c 2025-05-07T20:03:45.3394621Z 0x0000000000000019 (INIT_ARRAY) 0xaed55e8 2025-05-07T20:03:45.3394985Z 0x000000000000001b (INIT_ARRAYSZ) 680 (bytes) 2025-05-07T20:03:45.3395351Z 0x000000000000001a (FINI_ARRAY) 0xaed5890 2025-05-07T20:03:45.3395697Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:45.3396065Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:45.3396396Z 0x0000000000000005 (STRTAB) 0x1b3a0 2025-05-07T20:03:45.3396735Z 0x0000000000000006 (SYMTAB) 0x5920 2025-05-07T20:03:45.3397092Z 0x000000000000000a (STRSZ) 1481806 (bytes) 2025-05-07T20:03:45.3397471Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:45.3397839Z 0x0000000000000003 (PLTGOT) 0xaedb0f8 2025-05-07T20:03:45.3398212Z 0x0000000000000002 (PLTRELSZ) 22176 (bytes) 2025-05-07T20:03:45.3398574Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:45.3398978Z 0x0000000000000017 (JMPREL) 0x1a6bf0 2025-05-07T20:03:45.3399337Z 0x0000000000000007 (RELA) 0x186df0 2025-05-07T20:03:45.3399704Z 0x0000000000000008 (RELASZ) 130560 (bytes) 2025-05-07T20:03:45.3400092Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:45.3400427Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:45.3400752Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:45.3401117Z 0x000000006ffffffe (VERNEED) 0x186cd0 2025-05-07T20:03:45.3401448Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:45.3401572Z 0x000000006ffffff0 (VERSYM) 0x184fee 2025-05-07T20:03:45.3401699Z 0x000000006ffffff9 (RELACOUNT) 811 2025-05-07T20:03:45.3401808Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:45.3401812Z 2025-05-07T20:03:45.3401932Z ################################################################################ 2025-05-07T20:03:45.3401940Z 2025-05-07T20:03:45.3401962Z 2025-05-07T20:03:45.3402229Z ################################################################################ 2025-05-07T20:03:45.3402578Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:45.3402684Z [CHECK] Listing out library size: 2025-05-07T20:03:45.3404446Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:45.3404456Z 2025-05-07T20:03:45.3404722Z 31 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:45.3404727Z 2025-05-07T20:03:45.3405236Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:45.3405820Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:45.3405831Z 2025-05-07T20:03:45.3491633Z GLIBC_2.2.5 2025-05-07T20:03:45.3491942Z GLIBC_2.3 2025-05-07T20:03:45.3492074Z GLIBC_2.14 2025-05-07T20:03:45.3492117Z 2025-05-07T20:03:45.3492121Z 2025-05-07T20:03:45.3492638Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:45.3493271Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:45.3493277Z 2025-05-07T20:03:45.3647649Z GLIBCXX_3.4 2025-05-07T20:03:45.3648564Z GLIBCXX_3.4.9 2025-05-07T20:03:45.3648830Z GLIBCXX_3.4.11 2025-05-07T20:03:45.3649061Z GLIBCXX_3.4.15 2025-05-07T20:03:45.3649339Z GLIBCXX_3.4.18 2025-05-07T20:03:45.3649565Z GLIBCXX_3.4.20 2025-05-07T20:03:45.3649778Z GLIBCXX_3.4.21 2025-05-07T20:03:45.3649796Z 2025-05-07T20:03:45.3649809Z 2025-05-07T20:03:45.3670470Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.XIEfxVOH1y.symbols.txt 2025-05-07T20:03:45.3670547Z 2025-05-07T20:03:45.3791411Z 2025-05-07T20:03:45.3814378Z [CHECK] Total Number of symbols: 1857 2025-05-07T20:03:45.3834819Z [CHECK] Number of fbgemm symbols: 100 2025-05-07T20:03:45.3852330Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.77aSkZ9StZ.usymbols.txt 2025-05-07T20:03:45.3852365Z 2025-05-07T20:03:45.3873501Z 2025-05-07T20:03:45.3899112Z [CHECK] Listing out undefined symbols (267 total): 2025-05-07T20:03:45.3916097Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:45.3916478Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:45.3916679Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:45.3917100Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:45.3917246Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:45.3917434Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:45.3917583Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:45.3917727Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:45.3917866Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:45.3918006Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:45.3918127Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:45.3918249Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:45.3918358Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:45.3918464Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:45.3918575Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:45.3918693Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:45.3918807Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:45.3918918Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:45.3919043Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:45.3919157Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:45.3919308Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:45.3919436Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:45.3919537Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:45.3919645Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:03:45.3919789Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:45.3920025Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:45.3920155Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:45.3920263Z U at::RecordFunction::end() 2025-05-07T20:03:45.3920406Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:45.3920552Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:45.3920744Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:45.3920928Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:45.3921648Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.3922262Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.3922447Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:45.3922912Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.3923476Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.3923602Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:45.3923719Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:45.3923891Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:45.3923987Z U at::globalContext() 2025-05-07T20:03:45.3924111Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:45.3924214Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:45.3924464Z U c10::AnyType::get() 2025-05-07T20:03:45.3924658Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.3924815Z U c10::BoolType::get() 2025-05-07T20:03:45.3924986Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:45.3925159Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:45.3925268Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:45.3925787Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:45.3926401Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:45.3926775Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:45.3926879Z U c10::Error::what() const 2025-05-07T20:03:45.3926978Z U c10::FloatType::get() 2025-05-07T20:03:45.3927090Z U c10::GradMode::is_enabled() 2025-05-07T20:03:45.3927197Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:45.3927373Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.3927554Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:45.3927687Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:45.3927798Z U c10::IValue::isBoolList() const 2025-05-07T20:03:45.3927903Z U c10::IValue::isIntList() const 2025-05-07T20:03:45.3928047Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:45.3928158Z U c10::IValue::isTensorList() const 2025-05-07T20:03:45.3928302Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:45.3928414Z U c10::IntType::get() 2025-05-07T20:03:45.3928576Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:45.3928699Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:45.3928838Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:45.3928962Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:45.3929173Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:45.3929453Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:45.3929611Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:45.3929719Z U c10::StringType::get() 2025-05-07T20:03:45.3929884Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:45.3930027Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:45.3930200Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:03:45.3930350Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:45.3930508Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:03:45.3930901Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:45.3931030Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:45.3931166Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:03:45.3931300Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:45.3931415Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:45.3931551Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:45.3931677Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:03:45.3931861Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:45.3931963Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:45.3932079Z U c10::SymIntType::get() 2025-05-07T20:03:45.3932221Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:45.3932337Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:45.3932495Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:45.3932589Z U c10::TensorType::get() 2025-05-07T20:03:45.3932701Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:45.3933392Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:45.3933516Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:45.3933630Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:45.3933768Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:45.3933877Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:45.3933987Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:45.3934135Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:45.3934379Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:45.3934473Z U c10::cuda::device_count() 2025-05-07T20:03:45.3934611Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:45.3934820Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:45.3934952Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:45.3935101Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:45.3935257Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:45.3935364Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:45.3935789Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:45.3936280Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:45.3936522Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:45.3937004Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:45.3937325Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:45.3937884Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:45.3938008Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:45.3938116Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:45.3938421Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:45.3938608Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:45.3938747Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:45.3938903Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:45.3939026Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:45.3939173Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:45.3939321Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:45.3939689Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:45.3939811Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:45.3939947Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:45.3940090Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:45.3940240Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:45.3940380Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:03:45.3940529Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:45.3940633Z U c10::throwNullDataPtrError() 2025-05-07T20:03:45.3940737Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:45.3940860Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:45.3941045Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:45.3941157Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:45.3941300Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:45.3941438Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:45.3941568Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:45.3941684Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:45.3941822Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:45.3941949Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:45.3942060Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:45.3942194Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:45.3942313Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:45.3942435Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:45.3942565Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:45.3942680Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:45.3942788Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:45.3942897Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:45.3943029Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:45.3943140Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:45.3945237Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:03:45.3945489Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:03:45.3945623Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.3945782Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.3945875Z U free@GLIBC_2.2.5 2025-05-07T20:03:45.3945991Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.3946139Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.3946306Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:45.3946435Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.3946641Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.3946735Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:45.3946825Z U memcpy@GLIBC_2.14 2025-05-07T20:03:45.3946925Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:45.3947012Z U memset@GLIBC_2.2.5 2025-05-07T20:03:45.3947122Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:45.3947235Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:45.3947592Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:45.3947911Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:45.3948025Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:45.3948234Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:45.3948569Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:45.3948955Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:45.3949360Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:45.3949869Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:45.3950274Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:45.3950659Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:45.3951104Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:45.3951591Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:45.3951924Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:45.3952461Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:45.3952882Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:45.3953447Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:45.3953829Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:45.3953957Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:45.3954092Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:45.3954240Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:45.3954385Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:45.3954577Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:45.3954711Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:45.3954859Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:45.3955179Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:45.3955774Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:45.3955905Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:45.3956044Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:45.3956166Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:45.3956288Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:45.3956418Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:45.3956603Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:45.3956847Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:45.3956995Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:45.3957161Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:45.3957299Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:45.3957778Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:45.3957920Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:45.3958029Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:45.3958133Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:45.3958267Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:45.3958393Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:45.3958999Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:45.3959603Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:45.3959852Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:45.3959983Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:45.3960260Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:45.3960433Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:45.3960641Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:45.3960813Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:45.3961143Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:45.3961300Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:45.3961478Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:45.3961648Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:45.3961781Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:45.3961886Z U torch::autograd::Node::metadata() 2025-05-07T20:03:45.3962191Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:45.3962633Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:45.3962909Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:45.3963103Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:45.3963341Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:45.3963561Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:45.3966314Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:45.3966477Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:45.3966645Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:45.3966855Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:45.3967671Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:03:45.3967873Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:45.3968294Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:45.3968666Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:45.3969246Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:45.3969356Z U typeinfo for c10::Error 2025-05-07T20:03:45.3969497Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:45.3969645Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:45.3969780Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:45.3969919Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:45.3970054Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:45.3971523Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:45.3973203Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:45.3974485Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:45.3975851Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:45.3977125Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:45.3978452Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:45.3978595Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:45.3978757Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:45.3978907Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:45.3979085Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:45.3979197Z U vtable for c10::Error 2025-05-07T20:03:45.3979508Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:45.3979641Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:45.3979874Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:45.3979980Z U vtable for torch::autograd::Node 2025-05-07T20:03:45.3980147Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:45.3980275Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:45.3980377Z w _ITM_registerTMCloneTable 2025-05-07T20:03:45.3980471Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:45.3980559Z w __gmon_start__ 2025-05-07T20:03:45.3980663Z w __pthread_key_create 2025-05-07T20:03:45.3980768Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:45.3980873Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:45.3981024Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:45.3981271Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:45.3981280Z 2025-05-07T20:03:45.3981405Z linux-vdso.so.1 (0x00007fff7beb0000) 2025-05-07T20:03:45.3981505Z libc10.so => not found 2025-05-07T20:03:45.3981593Z libc10_cuda.so => not found 2025-05-07T20:03:45.3982139Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007fbf5b000000) 2025-05-07T20:03:45.3982249Z libtorch.so => not found 2025-05-07T20:03:45.3982337Z libtorch_cpu.so => not found 2025-05-07T20:03:45.3982422Z libtorch_cuda.so => not found 2025-05-07T20:03:45.3982524Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.3982675Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fbf5ad9c000) 2025-05-07T20:03:45.3982814Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fbf7d331000) 2025-05-07T20:03:45.3982953Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fbf7d303000) 2025-05-07T20:03:45.3983141Z libc.so.6 => /lib64/libc.so.6 (0x00007fbf5ab94000) 2025-05-07T20:03:45.3983259Z /lib64/ld-linux-x86-64.so.2 (0x00007fbf7d38d000) 2025-05-07T20:03:45.3983340Z libc10.so => not found 2025-05-07T20:03:45.3983439Z libc10_cuda.so => not found 2025-05-07T20:03:45.3983886Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fbf5a99e000) 2025-05-07T20:03:45.3984404Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fbf5a8a6000) 2025-05-07T20:03:45.3984504Z libtorch.so => not found 2025-05-07T20:03:45.3984843Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007fbf5a200000) 2025-05-07T20:03:45.3985273Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fbf59800000) 2025-05-07T20:03:45.3985389Z libtorch_cpu.so => not found 2025-05-07T20:03:45.3985475Z libtorch_cuda.so => not found 2025-05-07T20:03:45.3985561Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.3985676Z libm.so.6 => /lib64/libm.so.6 (0x00007fbf7d224000) 2025-05-07T20:03:45.3985769Z libc10.so => not found 2025-05-07T20:03:45.3985849Z libc10_cuda.so => not found 2025-05-07T20:03:45.3986286Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007fbf7d217000) 2025-05-07T20:03:45.3986389Z libtorch.so => not found 2025-05-07T20:03:45.3986473Z libtorch_cpu.so => not found 2025-05-07T20:03:45.3986559Z libtorch_cuda.so => not found 2025-05-07T20:03:45.3986647Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.3986766Z libc10.so => not found 2025-05-07T20:03:45.3986848Z libc10_cuda.so => not found 2025-05-07T20:03:45.3986929Z libtorch.so => not found 2025-05-07T20:03:45.3987037Z libtorch_cpu.so => not found 2025-05-07T20:03:45.3987124Z libtorch_cuda.so => not found 2025-05-07T20:03:45.3987211Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.3987291Z libc10.so => not found 2025-05-07T20:03:45.3987639Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007fbf7b188000) 2025-05-07T20:03:45.3987723Z libtorch.so => not found 2025-05-07T20:03:45.3987814Z libtorch_cpu.so => not found 2025-05-07T20:03:45.3987921Z libtorch_cuda.so => not found 2025-05-07T20:03:45.3987999Z libtorch.so => not found 2025-05-07T20:03:45.3988076Z libc10.so => not found 2025-05-07T20:03:45.3988176Z libc10_cuda.so => not found 2025-05-07T20:03:45.3988261Z libtorch_cpu.so => not found 2025-05-07T20:03:45.3988349Z libtorch_cuda.so => not found 2025-05-07T20:03:45.3988436Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.3988535Z libtorch.so => not found 2025-05-07T20:03:45.3988614Z libc10.so => not found 2025-05-07T20:03:45.3988698Z libtorch_cpu.so => not found 2025-05-07T20:03:45.3988801Z libtorch_cuda.so => not found 2025-05-07T20:03:45.3988969Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fbf7b183000) 2025-05-07T20:03:45.3989055Z libtorch_cpu.so => not found 2025-05-07T20:03:45.3989141Z libtorch_cuda.so => not found 2025-05-07T20:03:45.3989234Z libtorch.so => not found 2025-05-07T20:03:45.3989360Z librt.so.1 => /lib64/librt.so.1 (0x00007fbf7b17e000) 2025-05-07T20:03:45.3989365Z 2025-05-07T20:03:45.3989463Z [CHECK] Displaying ELF information: 2025-05-07T20:03:45.3989761Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:45.3989766Z 2025-05-07T20:03:45.3998408Z 2025-05-07T20:03:45.3998980Z Dynamic section at offset 0x1e278a8 contains 39 entries: 2025-05-07T20:03:45.3999328Z Tag Type Name/Value 2025-05-07T20:03:45.3999963Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:45.4000551Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:45.4001499Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:03:45.4002428Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:45.4003024Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:45.4003621Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:45.4004250Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:45.4004829Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:45.4005399Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:45.4005994Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:45.4006597Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:45.4006808Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:45.4007115Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_dense.so] 2025-05-07T20:03:45.4007298Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:45.4007411Z 0x000000000000000c (INIT) 0x79000 2025-05-07T20:03:45.4007530Z 0x000000000000000d (FINI) 0x25a06c 2025-05-07T20:03:45.4007737Z 0x0000000000000019 (INIT_ARRAY) 0x1e260e0 2025-05-07T20:03:45.4007861Z 0x000000000000001b (INIT_ARRAYSZ) 184 (bytes) 2025-05-07T20:03:45.4007984Z 0x000000000000001a (FINI_ARRAY) 0x1e26198 2025-05-07T20:03:45.4008114Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:45.4008227Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:45.4008375Z 0x0000000000000005 (STRTAB) 0xe130 2025-05-07T20:03:45.4008501Z 0x0000000000000006 (SYMTAB) 0x3300 2025-05-07T20:03:45.4008757Z 0x000000000000000a (STRSZ) 373406 (bytes) 2025-05-07T20:03:45.4008871Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:45.4008981Z 0x0000000000000003 (PLTGOT) 0x1e27b58 2025-05-07T20:03:45.4009116Z 0x0000000000000002 (PLTRELSZ) 18480 (bytes) 2025-05-07T20:03:45.4009213Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:45.4009317Z 0x0000000000000017 (JMPREL) 0x73f80 2025-05-07T20:03:45.4009433Z 0x0000000000000007 (RELA) 0x6a398 2025-05-07T20:03:45.4009552Z 0x0000000000000008 (RELASZ) 39912 (bytes) 2025-05-07T20:03:45.4009661Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:45.4009754Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:45.4009887Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:45.4009997Z 0x000000006ffffffe (VERNEED) 0x6a258 2025-05-07T20:03:45.4010095Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:45.4010214Z 0x000000006ffffff0 (VERSYM) 0x693ce 2025-05-07T20:03:45.4010315Z 0x000000006ffffff9 (RELACOUNT) 270 2025-05-07T20:03:45.4010406Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:45.4010424Z 2025-05-07T20:03:45.4010544Z ################################################################################ 2025-05-07T20:03:45.4010549Z 2025-05-07T20:03:45.4010552Z 2025-05-07T20:03:45.4010653Z ################################################################################ 2025-05-07T20:03:45.4010871Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:45.4010974Z [CHECK] Listing out library size: 2025-05-07T20:03:45.4011189Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:45.4011193Z 2025-05-07T20:03:45.4012614Z 40 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:45.4013538Z 2025-05-07T20:03:45.4014467Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:45.4015223Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:45.4015228Z 2025-05-07T20:03:45.4404471Z GLIBC_2.2.5 2025-05-07T20:03:45.4404572Z GLIBC_2.3 2025-05-07T20:03:45.4404651Z GLIBC_2.14 2025-05-07T20:03:45.4406146Z 2025-05-07T20:03:45.4406156Z 2025-05-07T20:03:45.4406750Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:45.4407266Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:45.4407271Z 2025-05-07T20:03:45.4798865Z GLIBCXX_3.4 2025-05-07T20:03:45.4799837Z GLIBCXX_3.4.9 2025-05-07T20:03:45.4800132Z GLIBCXX_3.4.11 2025-05-07T20:03:45.4800361Z GLIBCXX_3.4.14 2025-05-07T20:03:45.4800583Z GLIBCXX_3.4.15 2025-05-07T20:03:45.4800812Z GLIBCXX_3.4.18 2025-05-07T20:03:45.4801045Z GLIBCXX_3.4.19 2025-05-07T20:03:45.4801261Z GLIBCXX_3.4.20 2025-05-07T20:03:45.4801482Z GLIBCXX_3.4.21 2025-05-07T20:03:45.4801517Z 2025-05-07T20:03:45.4801551Z 2025-05-07T20:03:45.4822015Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.4Yt4Z9u7cM.symbols.txt 2025-05-07T20:03:45.4822043Z 2025-05-07T20:03:45.5150922Z 2025-05-07T20:03:45.5179118Z [CHECK] Total Number of symbols: 6602 2025-05-07T20:03:45.5216840Z [CHECK] Number of fbgemm symbols: 4516 2025-05-07T20:03:45.5236403Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.kDO45zidOu.usymbols.txt 2025-05-07T20:03:45.5236436Z 2025-05-07T20:03:45.5271950Z 2025-05-07T20:03:45.5297071Z [CHECK] Listing out undefined symbols (472 total): 2025-05-07T20:03:45.5312124Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:45.5313555Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:45.5313877Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:45.5314169Z U __assert_fail@GLIBC_2.2.5 2025-05-07T20:03:45.5314625Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:45.5315055Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:45.5315436Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:45.5315847Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:45.5316221Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:45.5316554Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:45.5316967Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:45.5317296Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:45.5317580Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:45.5317884Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:45.5318187Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:45.5318487Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:45.5318768Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:45.5319082Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:45.5319381Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:45.5319669Z U __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:03:45.5319944Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:45.5320266Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:45.5320537Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:45.5320842Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:45.5321150Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:03:45.5321415Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:45.5322166Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:45.5322547Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:45.5322892Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:45.5323257Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:45.5323390Z U at::SplitUntil32Bit::begin() const 2025-05-07T20:03:45.5323501Z U at::SplitUntil32Bit::end() const 2025-05-07T20:03:45.5323644Z U at::SplitUntil32Bit::iterator::operator*() const 2025-05-07T20:03:45.5323780Z U at::SplitUntil32Bit::iterator::operator++() 2025-05-07T20:03:45.5324021Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:03:45.5324214Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:45.5324393Z U at::TensorIteratorBase::build(at::TensorIteratorConfig&) 2025-05-07T20:03:45.5324578Z U at::TensorIteratorBase::can_use_32bit_indexing() const 2025-05-07T20:03:45.5324715Z U at::TensorIteratorBase::data_ptr(long) const 2025-05-07T20:03:45.5324849Z U at::TensorIteratorBase::is_contiguous() const 2025-05-07T20:03:45.5324987Z U at::TensorIteratorBase::numel() const 2025-05-07T20:03:45.5326645Z U at::TensorIteratorBase::with_32bit_indexing() const 2025-05-07T20:03:45.5326873Z U at::TensorIteratorConfig::add_borrowed_input(at::TensorBase const&) 2025-05-07T20:03:45.5327105Z U at::TensorIteratorConfig::add_borrowed_output(at::TensorBase const&) 2025-05-07T20:03:45.5327217Z U at::TensorMaker::make_tensor() 2025-05-07T20:03:45.5327384Z U at::_ops::_is_all_true::call(at::Tensor const&) 2025-05-07T20:03:45.5327545Z U at::_ops::_unique::call(at::Tensor const&, bool, bool) 2025-05-07T20:03:45.5327792Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:45.5328009Z U at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:45.5328140Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:03:45.5328611Z U at::_ops::baddbmm::call(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) 2025-05-07T20:03:45.5328940Z U at::_ops::broadcast_to::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:45.5329105Z U at::_ops::cat::call(c10::IListRef const&, long) 2025-05-07T20:03:45.5329305Z U at::_ops::cat_out::call(c10::IListRef const&, long, at::Tensor&) 2025-05-07T20:03:45.5329469Z U at::_ops::clamp_max::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:45.5329677Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:03:45.5329854Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:03:45.5329999Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:45.5330222Z U at::_ops::cumsum::call(at::Tensor const&, long, std::optional) 2025-05-07T20:03:45.5330544Z U at::_ops::diff::call(at::Tensor const&, long, long, std::optional const&, std::optional const&) 2025-05-07T20:03:45.5330710Z U at::_ops::div_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:45.5331272Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.5331884Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.5332105Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:45.5332278Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:45.5332391Z U at::_ops::floor::call(at::Tensor const&) 2025-05-07T20:03:45.5332901Z U at::_ops::full::call(c10::ArrayRef, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.5333063Z U at::_ops::ge_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:45.5333348Z U at::_ops::index_put_::call(at::Tensor&, c10::List > const&, at::Tensor const&, bool) 2025-05-07T20:03:45.5333549Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:03:45.5333667Z U at::_ops::item::call(at::Tensor const&) 2025-05-07T20:03:45.5333823Z U at::_ops::le_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:45.5333938Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:03:45.5334120Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:45.5334661Z U at::_ops::ones_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.5334858Z U at::_ops::permute::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:45.5335342Z U at::_ops::range::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.5335541Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:45.5335818Z U at::_ops::resize_::call(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:03:45.5335979Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:45.5336406Z U at::_ops::set__source_Storage_storage_offset::call(at::Tensor&, c10::Storage, c10::SymInt, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:45.5336744Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:03:45.5336879Z U at::_ops::sort::call(at::Tensor const&, long, bool) 2025-05-07T20:03:45.5337106Z U at::_ops::split_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:03:45.5337245Z U at::_ops::squeeze_dim::call(at::Tensor const&, long) 2025-05-07T20:03:45.5337465Z U at::_ops::sub_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:45.5337655Z U at::_ops::sum::call(at::Tensor const&, std::optional) 2025-05-07T20:03:45.5337904Z U at::_ops::tensor_split_indices::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:03:45.5338194Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:45.5338803Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:03:45.5338967Z U at::_ops::transpose_int::call(at::Tensor const&, long, long) 2025-05-07T20:03:45.5339211Z U at::_ops::unique_consecutive::call(at::Tensor const&, bool, bool, std::optional) 2025-05-07T20:03:45.5339445Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:03:45.5339611Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:45.5339763Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:45.5339883Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:03:45.5340336Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.5340899Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.5341159Z U at::checkScalarTypes(char const*, at::TensorArg const&, c10::ArrayRef) 2025-05-07T20:03:45.5341280Z U at::cuda::getCurrentCUDABlasHandle() 2025-05-07T20:03:45.5341414Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:45.5341542Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:03:45.5341675Z U at::cuda::get_p2p_access(signed char, signed char) 2025-05-07T20:03:45.5342034Z U at::detail::computeStorageNbytes(c10::ArrayRef, c10::ArrayRef, unsigned long, unsigned long) 2025-05-07T20:03:45.5342153Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:45.5342302Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:45.5342411Z U at::get_num_threads() 2025-05-07T20:03:45.5342521Z U at::get_thread_num() 2025-05-07T20:03:45.5342722Z U at::internal::OpaqueOptionalTensorRef::~OpaqueOptionalTensorRef() 2025-05-07T20:03:45.5342829Z U at::internal::set_thread_num(int) 2025-05-07T20:03:45.5343066Z U at::native::_rowwise_prune(at::Tensor const&, at::Tensor const&, c10::ScalarType) 2025-05-07T20:03:45.5343618Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.5344224Z U at::native::empty_meta_symint(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:45.5344486Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:03:45.5344627Z U at::print(std::ostream&, at::Tensor const&, long) 2025-05-07T20:03:45.5344753Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:45.5344910Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:03:45.5344998Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:45.5345118Z U bool at::Tensor::item() const 2025-05-07T20:03:45.5345243Z U bool* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.5345387Z U bool* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.5345483Z U c10::AnyType::get() 2025-05-07T20:03:45.5345651Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:03:45.5345814Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.5346007Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.5346109Z U c10::BoolType::get() 2025-05-07T20:03:45.5346206Z U c10::DeviceObjType::get() 2025-05-07T20:03:45.5346352Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:45.5346587Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:45.5346692Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:45.5347182Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:45.5347797Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:45.5348151Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:45.5348258Z U c10::Error::what() const 2025-05-07T20:03:45.5348352Z U c10::FloatType::get() 2025-05-07T20:03:45.5348449Z U c10::GradMode::is_enabled() 2025-05-07T20:03:45.5348549Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:45.5348708Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.5348870Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.5349042Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:45.5349160Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:45.5349266Z U c10::IValue::isBoolList() const 2025-05-07T20:03:45.5349368Z U c10::IValue::isIntList() const 2025-05-07T20:03:45.5349483Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:45.5349615Z U c10::IValue::isTensorList() const 2025-05-07T20:03:45.5349746Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:45.5349860Z U c10::InferenceMode::is_enabled() 2025-05-07T20:03:45.5349953Z U c10::IntType::get() 2025-05-07T20:03:45.5350413Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:45.5350588Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:45.5350704Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:45.5350821Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:45.5350956Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:45.5351161Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:45.5351281Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:45.5351394Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:45.5351509Z U c10::ScalarTypeType::get() 2025-05-07T20:03:45.5351769Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:45.5352073Z U c10::SmallVectorBase::mallocForGrow(unsigned long, unsigned long, unsigned long&) 2025-05-07T20:03:45.5352238Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:45.5352334Z U c10::StringType::get() 2025-05-07T20:03:45.5352468Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:45.5352618Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:45.5352859Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:45.5353440Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:45.5353594Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:45.5353736Z U c10::SymInt::operator%(c10::SymInt const&) const 2025-05-07T20:03:45.5354023Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:03:45.5354174Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:45.5354291Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:45.5354428Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:45.5354568Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:03:45.5354676Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:45.5354779Z U c10::SymIntType::get() 2025-05-07T20:03:45.5354945Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:45.5355067Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:45.5355523Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:03:45.5355702Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:45.5355800Z U c10::TensorType::get() 2025-05-07T20:03:45.5356631Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:03:45.5356845Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:03:45.5356956Z U c10::Type::is_module() const 2025-05-07T20:03:45.5357080Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:45.5357865Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:45.5358003Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:45.5358175Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:03:45.5358463Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:03:45.5358811Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:03:45.5358933Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:45.5359069Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:45.5359185Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:45.5359302Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:45.5359427Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:45.5359684Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:45.5359793Z U c10::cuda::current_device() 2025-05-07T20:03:45.5359910Z U c10::cuda::device_count() 2025-05-07T20:03:45.5360049Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:45.5360293Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:45.5360450Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:45.5360591Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:45.5360750Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:45.5360881Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:45.5361338Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:45.5361874Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:45.5362202Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:45.5362709Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:45.5363063Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:45.5363675Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:45.5363958Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:45.5364252Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:45.5364460Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:03:45.5364583Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:45.5364714Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:45.5365042Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:45.5365256Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:45.5365407Z U c10::impl::PyObjectSlot::PyObjectSlot() 2025-05-07T20:03:45.5365539Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:03:45.5365693Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:45.5365887Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:45.5366027Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:45.5366146Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:45.5366303Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:45.5366708Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:45.5366836Z U c10::operator*(c10::SymInt const&, int) 2025-05-07T20:03:45.5366956Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:03:45.5367124Z U c10::operator+(c10::SymInt const&, unsigned long) 2025-05-07T20:03:45.5367247Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:45.5367390Z U c10::operator-(c10::SymInt const&, unsigned long) 2025-05-07T20:03:45.5367534Z U c10::operator/(c10::SymInt const&, int) 2025-05-07T20:03:45.5367661Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:03:45.5367809Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:45.5367972Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:45.5368146Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:45.5368289Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:45.5368429Z U c10::operator==(c10::SymInt const&, int) 2025-05-07T20:03:45.5368558Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:03:45.5368682Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:03:45.5368817Z U c10::report_overflow(char const*) 2025-05-07T20:03:45.5368937Z U c10::throwNullDataPtrError() 2025-05-07T20:03:45.5369066Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:03:45.5369174Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:45.5369310Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:45.5369513Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:45.5369706Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:45.5369838Z U cublasGemmStridedBatchedEx 2025-05-07T20:03:45.5369942Z U cublasSetStream_v2 2025-05-07T20:03:45.5370079Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:45.5370230Z U cudaDeviceGetByPCIBusId@libcudart.so.11.0 2025-05-07T20:03:45.5370364Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:45.5370502Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:45.5370624Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:45.5370767Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:45.5370882Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:45.5371001Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:45.5371146Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:45.5371255Z U cudaFree@libcudart.so.11.0 2025-05-07T20:03:45.5371386Z U cudaFuncGetAttributes@libcudart.so.11.0 2025-05-07T20:03:45.5371526Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:45.5371638Z U cudaGetDevice@libcudart.so.11.0 2025-05-07T20:03:45.5371793Z U cudaGetDeviceCount@libcudart.so.11.0 2025-05-07T20:03:45.5371932Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:45.5372067Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:45.5372185Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:45.5372346Z U cudaHostGetDevicePointer@libcudart.so.11.0 2025-05-07T20:03:45.5372476Z U cudaHostRegister@libcudart.so.11.0 2025-05-07T20:03:45.5372598Z U cudaHostUnregister@libcudart.so.11.0 2025-05-07T20:03:45.5372716Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:45.5372855Z U cudaMallocManaged@libcudart.so.11.0 2025-05-07T20:03:45.5372967Z U cudaMemAdvise@libcudart.so.11.0 2025-05-07T20:03:45.5373093Z U cudaMemPrefetchAsync@libcudart.so.11.0 2025-05-07T20:03:45.5373213Z U cudaMemcpy2DAsync@libcudart.so.11.0 2025-05-07T20:03:45.5373345Z U cudaMemsetAsync@libcudart.so.11.0 2025-05-07T20:03:45.5373646Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.11.0 2025-05-07T20:03:45.5373893Z U cudaPeekAtLastError@libcudart.so.11.0 2025-05-07T20:03:45.5374131Z U cudaSetDevice@libcudart.so.11.0 2025-05-07T20:03:45.5374241Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:45.5374363Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:45.5374493Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:45.5374636Z U double* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.5374794Z U double* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.5374902Z U exit@GLIBC_2.2.5 2025-05-07T20:03:45.5374989Z U exp10@GLIBC_2.2.5 2025-05-07T20:03:45.5375077Z U exp2@GLIBC_2.2.5 2025-05-07T20:03:45.5375162Z U exp@GLIBC_2.2.5 2025-05-07T20:03:45.5375265Z U expf@GLIBC_2.2.5 2025-05-07T20:03:45.5375454Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:03:45.5375638Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:45.5375842Z U fbgemm_gpu::asynchronous_exclusive_cumsum_cpu(at::Tensor const&) 2025-05-07T20:03:45.5376028Z U fbgemm_gpu::asynchronous_exclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:45.5376211Z U fbgemm_gpu::asynchronous_inclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:45.5376416Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.5376567Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.5376655Z U fmod@GLIBC_2.2.5 2025-05-07T20:03:45.5376753Z U free@GLIBC_2.2.5 2025-05-07T20:03:45.5376869Z U get_info_B_num_bits_from_T(int, int) 2025-05-07T20:03:45.5376976Z U int at::Tensor::item() const 2025-05-07T20:03:45.5377130Z U int const* at::TensorBase::const_data_ptr() const 2025-05-07T20:03:45.5377262Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.5377401Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.5377494Z U isnan@GLIBC_2.2.5 2025-05-07T20:03:45.5377595Z U lgamma@GLIBC_2.2.5 2025-05-07T20:03:45.5377687Z U llrint@GLIBC_2.2.5 2025-05-07T20:03:45.5377781Z U llround@GLIBC_2.2.5 2025-05-07T20:03:45.5377880Z U log10@GLIBC_2.2.5 2025-05-07T20:03:45.5377963Z U log2@GLIBC_2.2.5 2025-05-07T20:03:45.5378050Z U log@GLIBC_2.2.5 2025-05-07T20:03:45.5378138Z U logl@GLIBC_2.2.5 2025-05-07T20:03:45.5378288Z U long at::Tensor::item() const 2025-05-07T20:03:45.5378458Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:45.5378621Z U long const* at::TensorBase::const_data_ptr() const 2025-05-07T20:03:45.5378763Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.5378931Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.5379021Z U lrint@GLIBC_2.2.5 2025-05-07T20:03:45.5379129Z U madvise@GLIBC_2.2.5 2025-05-07T20:03:45.5379218Z U malloc@GLIBC_2.2.5 2025-05-07T20:03:45.5379310Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:45.5379397Z U memcpy@GLIBC_2.14 2025-05-07T20:03:45.5379501Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:45.5379588Z U memset@GLIBC_2.2.5 2025-05-07T20:03:45.5379685Z U nextafter@GLIBC_2.2.5 2025-05-07T20:03:45.5379800Z U nvmlDeviceGetCount_v2 2025-05-07T20:03:45.5379909Z U nvmlDeviceGetHandleByIndex_v2 2025-05-07T20:03:45.5380034Z U nvmlDeviceGetNvLinkRemotePciInfo_v2 2025-05-07T20:03:45.5380150Z U nvmlDeviceGetNvLinkState 2025-05-07T20:03:45.5380253Z U nvmlDeviceGetPciInfo_v3 2025-05-07T20:03:45.5380339Z U nvmlInit_v2 2025-05-07T20:03:45.5380449Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:45.5380578Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:45.5380698Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:45.5380789Z U pow@GLIBC_2.2.5 2025-05-07T20:03:45.5380890Z U printf@GLIBC_2.2.5 2025-05-07T20:03:45.5380979Z U puts@GLIBC_2.2.5 2025-05-07T20:03:45.5381071Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:45.5381222Z U short* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.5381429Z U signed char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.5381517Z U sin@GLIBC_2.2.5 2025-05-07T20:03:45.5381719Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:45.5381895Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:45.5382077Z U std::_Rb_tree_increment(std::_Rb_tree_node_base const*)@GLIBCXX_3.4 2025-05-07T20:03:45.5382242Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:45.5382678Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:03:45.5383006Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:45.5383396Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:45.5383774Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:45.5384295Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:45.5384668Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:45.5385057Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:45.5385515Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:45.5386018Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:45.5386562Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:45.5386889Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:45.5387261Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:45.5387609Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:45.5387734Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:03:45.5387846Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:03:45.5387953Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:45.5388074Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:45.5388178Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:03:45.5388312Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:45.5388460Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:45.5388593Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:45.5388727Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:03:45.5388905Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:45.5389031Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:45.5389168Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:45.5389375Z U std::basic_filebuf >::close()@GLIBCXX_3.4 2025-05-07T20:03:45.5389702Z U std::basic_ifstream >::basic_ifstream(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:03:45.5389933Z U std::basic_ifstream >::~basic_ifstream()@GLIBCXX_3.4 2025-05-07T20:03:45.5390171Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:45.5390545Z U std::basic_ofstream >::basic_ofstream(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:03:45.5390779Z U std::basic_ofstream >::~basic_ofstream()@GLIBCXX_3.4 2025-05-07T20:03:45.5391347Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:45.5391493Z U std::chrono::_V2::system_clock::now()@GLIBCXX_3.4.19 2025-05-07T20:03:45.5391589Z U std::cout@GLIBCXX_3.4 2025-05-07T20:03:45.5391759Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:03:45.5391884Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:45.5392002Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:45.5392150Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:45.5392265Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:45.5392375Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:45.5392583Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:03:45.5393030Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:45.5393451Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:45.5393588Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:03:45.5393717Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:45.5393923Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:03:45.5394123Z U std::ostream::write(char const*, long)@GLIBCXX_3.4 2025-05-07T20:03:45.5394296Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:45.5394439Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:45.5394636Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:45.5395075Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:45.5395224Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:45.5395349Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:45.5395449Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:45.5395549Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:45.5395646Z U sysconf@GLIBC_2.2.5 2025-05-07T20:03:45.5395796Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:45.5396410Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:45.5396890Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:45.5397428Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:03:45.5397700Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:45.5397850Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:45.5398152Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:45.5398347Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:45.5398574Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:45.5398816Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:45.5399173Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:45.5399348Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:45.5399544Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:45.5399730Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:45.5399875Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:45.5399994Z U torch::autograd::Node::metadata() 2025-05-07T20:03:45.5400140Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:45.5400410Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:45.5400694Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:45.5400841Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:45.5401074Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:45.5401301Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:45.5404609Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:45.5404781Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:45.5404939Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:45.5405130Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:45.5405292Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:45.5405715Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:45.5406104Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:45.5406515Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:45.5406742Z U torch::pickle_load(std::vector > const&) 2025-05-07T20:03:45.5406868Z U torch::pickle_save(c10::IValue const&) 2025-05-07T20:03:45.5407437Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:45.5407570Z U typeinfo for c10::Error 2025-05-07T20:03:45.5407679Z U typeinfo for c10::Type 2025-05-07T20:03:45.5407826Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:45.5407956Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:45.5408108Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:45.5408303Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:45.5408424Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:45.5408639Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:03:45.5408854Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:45.5409315Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(float const*, unsigned long, int, unsigned char*) 2025-05-07T20:03:45.5409852Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:03:45.5410311Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, float const*, unsigned long, int, unsigned char*) 2025-05-07T20:03:45.5410861Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:03:45.5411314Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, float*) 2025-05-07T20:03:45.5411844Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, unsigned short*) 2025-05-07T20:03:45.5412366Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:03:45.5412915Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:03:45.5413461Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:03:45.5414072Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:03:45.5414672Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:03:45.5414841Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:45.5415005Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:45.5415162Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:45.5415331Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:45.5415495Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:03:45.5415607Z U vtable for at::TensorIterator 2025-05-07T20:03:45.5415738Z U vtable for at::TensorIteratorBase 2025-05-07T20:03:45.5415839Z U vtable for c10::Error 2025-05-07T20:03:45.5415943Z U vtable for c10::ListType 2025-05-07T20:03:45.5416281Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:45.5416432Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:45.5416665Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:45.5416796Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:03:45.5416920Z U vtable for torch::autograd::Node 2025-05-07T20:03:45.5417104Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:45.5417217Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:45.5417337Z w _ITM_registerTMCloneTable 2025-05-07T20:03:45.5417447Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:45.5417582Z w __gmon_start__ 2025-05-07T20:03:45.5417694Z w __pthread_key_create 2025-05-07T20:03:45.5417804Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:45.5417917Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:45.5418014Z w pthread_once 2025-05-07T20:03:45.5418185Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:45.5418365Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:45.5418372Z 2025-05-07T20:03:45.5418481Z linux-vdso.so.1 (0x00007ffe609f7000) 2025-05-07T20:03:45.5418585Z libc10.so => not found 2025-05-07T20:03:45.5418799Z libc10_cuda.so => not found 2025-05-07T20:03:45.5419165Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f5e59c00000) 2025-05-07T20:03:45.5419276Z libnvidia-ml.so.1 => not found 2025-05-07T20:03:45.5419365Z libtorch.so => not found 2025-05-07T20:03:45.5419914Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f5e5cde8000) 2025-05-07T20:03:45.5420375Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f5e59200000) 2025-05-07T20:03:45.5420486Z libtorch_cpu.so => not found 2025-05-07T20:03:45.5420604Z libtorch_cuda.so => not found 2025-05-07T20:03:45.5420701Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.5420880Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f5e58f9c000) 2025-05-07T20:03:45.5421005Z libm.so.6 => /lib64/libm.so.6 (0x00007f5e59b25000) 2025-05-07T20:03:45.5421155Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f5e5cdb8000) 2025-05-07T20:03:45.5421310Z libc.so.6 => /lib64/libc.so.6 (0x00007f5e58d94000) 2025-05-07T20:03:45.5421439Z /lib64/ld-linux-x86-64.so.2 (0x00007f5e5cee6000) 2025-05-07T20:03:45.5421527Z libc10.so => not found 2025-05-07T20:03:45.5421884Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f5e5a188000) 2025-05-07T20:03:45.5421991Z libtorch.so => not found 2025-05-07T20:03:45.5422090Z libtorch_cpu.so => not found 2025-05-07T20:03:45.5422177Z libtorch_cuda.so => not found 2025-05-07T20:03:45.5422266Z libc10.so => not found 2025-05-07T20:03:45.5422351Z libc10_cuda.so => not found 2025-05-07T20:03:45.5422437Z libtorch.so => not found 2025-05-07T20:03:45.5422525Z libtorch_cpu.so => not found 2025-05-07T20:03:45.5422630Z libtorch_cuda.so => not found 2025-05-07T20:03:45.5422718Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.5422863Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f5e59acf000) 2025-05-07T20:03:45.5422964Z libtorch.so => not found 2025-05-07T20:03:45.5423047Z libc10.so => not found 2025-05-07T20:03:45.5423136Z libc10_cuda.so => not found 2025-05-07T20:03:45.5423244Z libtorch_cpu.so => not found 2025-05-07T20:03:45.5423334Z libtorch_cuda.so => not found 2025-05-07T20:03:45.5423430Z libcudart.so.11.0 => not found 2025-05-07T20:03:45.5423523Z libtorch_cpu.so => not found 2025-05-07T20:03:45.5423622Z libtorch_cuda.so => not found 2025-05-07T20:03:45.5423705Z libtorch.so => not found 2025-05-07T20:03:45.5423833Z librt.so.1 => /lib64/librt.so.1 (0x00007f5e5a17d000) 2025-05-07T20:03:45.5424022Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f5e5a178000) 2025-05-07T20:03:45.5424061Z 2025-05-07T20:03:45.5424166Z [CHECK] Displaying ELF information: 2025-05-07T20:03:45.5424364Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:45.5424370Z 2025-05-07T20:03:45.5424374Z 2025-05-07T20:03:45.5424543Z Dynamic section at offset 0x27457c0 contains 42 entries: 2025-05-07T20:03:45.5424660Z Tag Type Name/Value 2025-05-07T20:03:45.5424853Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:45.5425065Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:45.5426114Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:03:45.5426323Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:03:45.5426529Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:45.5426775Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:03:45.5426994Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:45.5427196Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:45.5427402Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:45.5427610Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:45.5427806Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:45.5427999Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:45.5428193Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:45.5428378Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:45.5428598Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:45.5428845Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_py.so] 2025-05-07T20:03:45.5429024Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:45.5429149Z 0x000000000000000c (INIT) 0x1b0000 2025-05-07T20:03:45.5429253Z 0x000000000000000d (FINI) 0x73d51c 2025-05-07T20:03:45.5429371Z 0x0000000000000019 (INIT_ARRAY) 0x27387d0 2025-05-07T20:03:45.5429529Z 0x000000000000001b (INIT_ARRAYSZ) 1160 (bytes) 2025-05-07T20:03:45.5429655Z 0x000000000000001a (FINI_ARRAY) 0x2738c58 2025-05-07T20:03:45.5429769Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:45.5429886Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:45.5430003Z 0x0000000000000005 (STRTAB) 0x2fcd8 2025-05-07T20:03:45.5430106Z 0x0000000000000006 (SYMTAB) 0x91d0 2025-05-07T20:03:45.5430242Z 0x000000000000000a (STRSZ) 1264098 (bytes) 2025-05-07T20:03:45.5430367Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:45.5430482Z 0x0000000000000003 (PLTGOT) 0x2746aa0 2025-05-07T20:03:45.5430613Z 0x0000000000000002 (PLTRELSZ) 68832 (bytes) 2025-05-07T20:03:45.5430722Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:45.5430838Z 0x0000000000000017 (JMPREL) 0x19e3c8 2025-05-07T20:03:45.5430945Z 0x0000000000000007 (RELA) 0x167bd0 2025-05-07T20:03:45.5431077Z 0x0000000000000008 (RELASZ) 223224 (bytes) 2025-05-07T20:03:45.5431201Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:45.5431296Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:45.5431420Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:45.5431533Z 0x000000006ffffffe (VERNEED) 0x167a50 2025-05-07T20:03:45.5431656Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:03:45.5431769Z 0x000000006ffffff0 (VERSYM) 0x1646ba 2025-05-07T20:03:45.5431874Z 0x000000006ffffff9 (RELACOUNT) 2456 2025-05-07T20:03:45.5431982Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:45.5431988Z 2025-05-07T20:03:45.5432099Z ################################################################################ 2025-05-07T20:03:45.5432104Z 2025-05-07T20:03:45.5432108Z 2025-05-07T20:03:45.5432306Z [CHECK] Verifying sample subset of symbols in the built libraries ... 2025-05-07T20:03:45.5518598Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:45.5545074Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:45.5596923Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:45.5633011Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:45.5860797Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:45.5905254Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:45.5943949Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:45.5972009Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:45.6081358Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:45.6105587Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:45.6158048Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:45.6203515Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:45.6434435Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:45.6470991Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:45.6504912Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:45.6542942Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:45.6954668Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:45.7314037Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:45.8256465Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:45.8480195Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:45.8569992Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:45.8603082Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:46.0467795Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:46.0702410Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:46.1301649Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:46.1431024Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:46.1760452Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:46.1762705Z ################################################################################ 2025-05-07T20:03:46.1764012Z [BUILD] Wheel Audit: dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:46.1764422Z 2025-05-07T20:03:46.1764897Z + conda run --no-capture-output -n build_binary auditwheel show dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:46.1765629Z 2025-05-07T20:03:54.1883067Z 2025-05-07T20:03:54.1884450Z fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl is 2025-05-07T20:03:54.1885035Z consistent with the following platform tag: "linux_x86_64". 2025-05-07T20:03:54.1885333Z 2025-05-07T20:03:54.1885537Z The wheel references external versioned symbols in these 2025-05-07T20:03:54.1886148Z system-provided shared libraries: librt.so.1 with versions 2025-05-07T20:03:54.1886589Z {'GLIBC_2.2.5'}, libgcc_s.so.1 with versions {'GCC_3.0', 2025-05-07T20:03:54.1886996Z 'GCC_12.0.0'}, libstdc++.so.6 with versions {'GLIBCXX_3.4.14', 2025-05-07T20:03:54.1887597Z 'GLIBCXX_3.4.18', 'CXXABI_1.3.11', 'GLIBCXX_3.4.21', 'GLIBCXX_3.4', 2025-05-07T20:03:54.1888031Z 'CXXABI_1.3', 'GLIBCXX_3.4.9', 'GLIBCXX_3.4.15', 'GLIBCXX_3.4.19', 2025-05-07T20:03:54.1888483Z 'GLIBCXX_3.4.11', 'CXXABI_1.3.3', 'GLIBCXX_3.4.20', 'CXXABI_1.3.7', 2025-05-07T20:03:54.1888918Z 'CXXABI_1.3.5'}, libc.so.6 with versions {'GLIBC_2.14', 'GLIBC_2.17', 2025-05-07T20:03:54.1889549Z 'GLIBC_2.3.3', 'GLIBC_2.3', 'GLIBC_2.6', 'GLIBC_2.3.2', 2025-05-07T20:03:54.1889968Z 'GLIBC_2.2.5'}, libpthread.so.0 with versions {'GLIBC_2.3.4', 2025-05-07T20:03:54.1890362Z 'GLIBC_2.2.5'}, libm.so.6 with versions {'GLIBC_2.2.5'}, 2025-05-07T20:03:54.1890811Z libcudart.so.11.0 with versions {'libcudart.so.11.0'}, libgomp.so.1 2025-05-07T20:03:54.1892307Z with versions {'OMP_1.0'}, libdl.so.2 with versions {'GLIBC_2.2.5'} 2025-05-07T20:03:54.1892627Z 2025-05-07T20:03:54.1892826Z This constrains the platform tag to "manylinux_2_35_x86_64". In order 2025-05-07T20:03:54.1893346Z to achieve a more compatible tag, you would need to recompile a new 2025-05-07T20:03:54.1893794Z wheel from source on a system with earlier versions of these 2025-05-07T20:03:54.1894195Z libraries, such as a recent manylinux image. 2025-05-07T20:03:54.2733354Z 2025-05-07T20:03:54.2733463Z 2025-05-07T20:03:54.2733816Z ################################################################################ 2025-05-07T20:03:54.2734535Z [BUILD] Enumerating the built wheels ... 2025-05-07T20:03:54.2735046Z + ls -lth dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:54.2735413Z 2025-05-07T20:03:54.2754446Z -rw-r--r--. 1 root root 262M May 7 20:03 dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:54.2755856Z 2025-05-07T20:03:54.2756212Z [BUILD] Enumerating the wheel SHAs ... 2025-05-07T20:03:54.2757553Z + sha1sum dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:54.2758652Z 2025-05-07T20:03:54.7715121Z d739780d869fe373d1f5af873560d01f3916d6b0 dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:54.7715688Z 2025-05-07T20:03:54.7715981Z + sha256sum dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:54.7716364Z 2025-05-07T20:03:55.9207438Z 94fc520ace106455b43b8879411c0a1dec08c529c9ebdd4e2837174f6b343206 dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:55.9209422Z 2025-05-07T20:03:55.9210162Z + md5sum dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:55.9211258Z 2025-05-07T20:03:56.3627014Z fb03a783686277bc815a9cb2527b23db dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:03:56.3627544Z 2025-05-07T20:03:56.3627719Z [BUILD] FBGEMM-GPU build + package completed 2025-05-07T20:03:56.3741911Z ##[group]Run actions/upload-artifact@v4 2025-05-07T20:03:56.3742227Z with: 2025-05-07T20:03:56.3742503Z name: fbgemm_default_x86_clang_py3.13_cu11.8.0.whl 2025-05-07T20:03:56.3743018Z path: fbgemm_gpu/dist/*.whl 2025-05-07T20:03:56.3743290Z if-no-files-found: error 2025-05-07T20:03:56.3743585Z compression-level: 6 2025-05-07T20:03:56.3743814Z overwrite: false 2025-05-07T20:03:56.3744089Z include-hidden-files: false 2025-05-07T20:03:56.3744356Z env: 2025-05-07T20:03:56.3744600Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T20:03:56.3744905Z BUILD_ENV: build_binary 2025-05-07T20:03:56.3745147Z BUILD_TARGET: default 2025-05-07T20:03:56.3745380Z BUILD_VARIANT: cuda 2025-05-07T20:03:56.3745644Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T20:03:56.3745875Z ##[endgroup] 2025-05-07T20:03:56.3749325Z ##[command]/usr/bin/docker exec 2b02554cc61113bb96fb80bbab95670dde250cea5f4d3e11972b04e9d3bcf9fd sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:03:56.8092998Z With the provided path, there will be 1 file uploaded 2025-05-07T20:03:56.8093562Z Artifact name is valid! 2025-05-07T20:03:56.8093954Z Root directory input is valid! 2025-05-07T20:03:56.9194547Z Beginning upload of artifact content to blob storage 2025-05-07T20:03:57.6968218Z Uploaded bytes 8388608 2025-05-07T20:03:58.2574492Z Uploaded bytes 16777216 2025-05-07T20:03:58.6864475Z Uploaded bytes 25165824 2025-05-07T20:03:59.2012751Z Uploaded bytes 33554432 2025-05-07T20:03:59.6965417Z Uploaded bytes 41943040 2025-05-07T20:04:00.2279382Z Uploaded bytes 50331648 2025-05-07T20:04:00.7943022Z Uploaded bytes 58720256 2025-05-07T20:04:01.2931373Z Uploaded bytes 67108864 2025-05-07T20:04:01.8536643Z Uploaded bytes 75497472 2025-05-07T20:04:02.2555544Z Uploaded bytes 83886080 2025-05-07T20:04:02.8280692Z Uploaded bytes 92274688 2025-05-07T20:04:03.3477524Z Uploaded bytes 100663296 2025-05-07T20:04:03.9675289Z Uploaded bytes 109051904 2025-05-07T20:04:04.3711275Z Uploaded bytes 117440512 2025-05-07T20:04:04.9199194Z Uploaded bytes 125829120 2025-05-07T20:04:05.3541349Z Uploaded bytes 134217728 2025-05-07T20:04:05.8491851Z Uploaded bytes 142606336 2025-05-07T20:04:06.3522219Z Uploaded bytes 150994944 2025-05-07T20:04:06.9219706Z Uploaded bytes 159383552 2025-05-07T20:04:07.4290048Z Uploaded bytes 167772160 2025-05-07T20:04:07.9332496Z Uploaded bytes 176160768 2025-05-07T20:04:08.4782928Z Uploaded bytes 184549376 2025-05-07T20:04:09.0238067Z Uploaded bytes 192937984 2025-05-07T20:04:09.5504647Z Uploaded bytes 201326592 2025-05-07T20:04:10.0902211Z Uploaded bytes 209715200 2025-05-07T20:04:10.6493851Z Uploaded bytes 218103808 2025-05-07T20:04:11.0801412Z Uploaded bytes 226492416 2025-05-07T20:04:11.5986635Z Uploaded bytes 234881024 2025-05-07T20:04:12.0933608Z Uploaded bytes 243269632 2025-05-07T20:04:12.6950408Z Uploaded bytes 251658240 2025-05-07T20:04:13.0621812Z Uploaded bytes 260046848 2025-05-07T20:04:13.5485914Z Uploaded bytes 268056314 2025-05-07T20:04:13.5636212Z Finished uploading artifact content to blob storage! 2025-05-07T20:04:13.5638275Z SHA256 digest of uploaded artifact zip is a74b5b1440d0fc4a949d21cc9addd3e895d0ca4b1a167ffa69d634b041e428cc 2025-05-07T20:04:13.5640060Z Finalizing artifact upload 2025-05-07T20:04:13.7146415Z Artifact fbgemm_default_x86_clang_py3.13_cu11.8.0.whl.zip successfully finalized. Artifact ID 3081409350 2025-05-07T20:04:13.7147371Z Artifact fbgemm_default_x86_clang_py3.13_cu11.8.0.whl has been successfully uploaded! Final size is 268056314 bytes. Artifact ID is 3081409350 2025-05-07T20:04:13.7154652Z Artifact download URL: https://github.com/pytorch/FBGEMM/actions/runs/14891846252/artifacts/3081409350 2025-05-07T20:04:13.7381423Z Post job cleanup. 2025-05-07T20:04:13.7386062Z ##[command]/usr/bin/docker exec 2b02554cc61113bb96fb80bbab95670dde250cea5f4d3e11972b04e9d3bcf9fd sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:04:14.0113929Z [command]/usr/bin/git version 2025-05-07T20:04:14.0149904Z git version 2.47.1 2025-05-07T20:04:14.0180175Z Copying '/github/home/.gitconfig' to '/__w/_temp/74bf872e-0ea2-4eee-8795-0faff89cf3df/.gitconfig' 2025-05-07T20:04:14.0189690Z Temporarily overriding HOME='/__w/_temp/74bf872e-0ea2-4eee-8795-0faff89cf3df' before making global git config changes 2025-05-07T20:04:14.0190893Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T20:04:14.0193000Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T20:04:14.0239905Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T20:04:14.0266518Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T20:04:14.0540609Z Entering 'external/asmjit' 2025-05-07T20:04:14.0590037Z Entering 'external/composable_kernel' 2025-05-07T20:04:14.0644992Z Entering 'external/cpuinfo' 2025-05-07T20:04:14.0694305Z Entering 'external/cutlass' 2025-05-07T20:04:14.0751674Z Entering 'external/googletest' 2025-05-07T20:04:14.0820991Z Entering 'external/hipify_torch' 2025-05-07T20:04:14.0885381Z Entering 'external/json' 2025-05-07T20:04:14.0956506Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T20:04:14.0977071Z http.https://github.com/.extraheader 2025-05-07T20:04:14.0982744Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-05-07T20:04:14.1011341Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T20:04:14.1276954Z Entering 'external/asmjit' 2025-05-07T20:04:14.1311185Z http.https://github.com/.extraheader 2025-05-07T20:04:14.1346377Z Entering 'external/composable_kernel' 2025-05-07T20:04:14.1380717Z http.https://github.com/.extraheader 2025-05-07T20:04:14.1419207Z Entering 'external/cpuinfo' 2025-05-07T20:04:14.1464410Z http.https://github.com/.extraheader 2025-05-07T20:04:14.1504705Z Entering 'external/cutlass' 2025-05-07T20:04:14.1540983Z http.https://github.com/.extraheader 2025-05-07T20:04:14.1583891Z Entering 'external/googletest' 2025-05-07T20:04:14.1614170Z http.https://github.com/.extraheader 2025-05-07T20:04:14.1654996Z Entering 'external/hipify_torch' 2025-05-07T20:04:14.1686653Z http.https://github.com/.extraheader 2025-05-07T20:04:14.1716074Z Entering 'external/json' 2025-05-07T20:04:14.1751614Z http.https://github.com/.extraheader 2025-05-07T20:04:14.1914173Z Stop and remove container: 3b54c127a5cb47de8f35c3c3802a9fab_amazonlinux2023_18a699 2025-05-07T20:04:14.1919634Z ##[command]/usr/bin/docker rm --force 2b02554cc61113bb96fb80bbab95670dde250cea5f4d3e11972b04e9d3bcf9fd 2025-05-07T20:04:14.9095047Z 2b02554cc61113bb96fb80bbab95670dde250cea5f4d3e11972b04e9d3bcf9fd 2025-05-07T20:04:14.9126517Z Remove container network: github_network_935158e13aba4e44929564f3b9c47480 2025-05-07T20:04:14.9130856Z ##[command]/usr/bin/docker network rm github_network_935158e13aba4e44929564f3b9c47480 2025-05-07T20:04:15.9948005Z github_network_935158e13aba4e44929564f3b9c47480 2025-05-07T20:04:15.9979134Z A job completed hook has been configured by the self-hosted runner administrator 2025-05-07T20:04:15.9997250Z ##[group]Run '/home/ec2-user/runner-scripts/after_job.sh' 2025-05-07T20:04:16.0003375Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T20:04:16.0003801Z ##[endgroup] 2025-05-07T20:04:16.0110517Z [!ALERT!] Swap in detected! [!ALERT!] 2025-05-07T20:04:26.0703649Z [!ALERT!] Swap out detected [!ALERT!] 2025-05-07T20:04:41.9994654Z Cleaning up orphan processes