2025-05-07T19:42:37.4837256Z Current runner version: '2.323.0' 2025-05-07T19:42:37.4843526Z Runner name: 'i-07daac3f3a185b77c' 2025-05-07T19:42:37.4844513Z Machine name: 'ip-10-0-20-46' 2025-05-07T19:42:37.4847355Z ##[group]GITHUB_TOKEN Permissions 2025-05-07T19:42:37.4849574Z Contents: read 2025-05-07T19:42:37.4850293Z Metadata: read 2025-05-07T19:42:37.4851037Z Packages: read 2025-05-07T19:42:37.4851640Z ##[endgroup] 2025-05-07T19:42:37.4854151Z Secret source: None 2025-05-07T19:42:37.4855153Z Prepare workflow directory 2025-05-07T19:42:37.5484975Z Prepare all required actions 2025-05-07T19:42:37.5522841Z Getting action download info 2025-05-07T19:42:37.7378788Z Download action repository 'actions/checkout@v4' (SHA:11bd71901bbe5b1630ceea73d27597364c9af683) 2025-05-07T19:42:37.9464668Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-05-07T19:42:38.3058780Z Complete job name: build_artifact (x86, linux.24xlarge, default, 3.13, 12.6.3, gcc) 2025-05-07T19:42:38.3846426Z A job started hook has been configured by the self-hosted runner administrator 2025-05-07T19:42:38.3957553Z ##[group]Run '/home/ec2-user/runner-scripts/before_job.sh' 2025-05-07T19:42:38.3966686Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:38.3967544Z ##[endgroup] 2025-05-07T19:42:39.5068673Z Runner Type: linux.24xlarge 2025-05-07T19:42:39.5069982Z Instance Type: c5.24xlarge 2025-05-07T19:42:39.5070965Z AMI Name: unknown 2025-05-07T19:42:39.5095407Z AMI ID: ami-071226ecf16aa7d96 2025-05-07T19:42:44.5822775Z ##[group]Checking docker version 2025-05-07T19:42:44.5836392Z ##[command]/usr/bin/docker version --format '{{.Server.APIVersion}}' 2025-05-07T19:42:44.6046453Z '1.44' 2025-05-07T19:42:44.6062681Z Docker daemon API version: '1.44' 2025-05-07T19:42:44.6063239Z ##[command]/usr/bin/docker version --format '{{.Client.APIVersion}}' 2025-05-07T19:42:44.6267163Z '1.44' 2025-05-07T19:42:44.6286383Z Docker client API version: '1.44' 2025-05-07T19:42:44.6291370Z ##[endgroup] 2025-05-07T19:42:44.6294036Z ##[group]Clean up resources from previous jobs 2025-05-07T19:42:44.6299431Z ##[command]/usr/bin/docker ps --all --quiet --no-trunc --filter "label=e1751d" 2025-05-07T19:42:44.6451543Z ##[command]/usr/bin/docker network prune --force --filter "label=e1751d" 2025-05-07T19:42:44.6584627Z ##[endgroup] 2025-05-07T19:42:44.6584974Z ##[group]Create local container network 2025-05-07T19:42:44.6594098Z ##[command]/usr/bin/docker network create --label e1751d github_network_a087cb1554ec44a393d5ce1ddb30c3db 2025-05-07T19:42:44.9462219Z 0f67270358a1449c933b54298c3eec97094d3b404dd44efa6ca75c85ae8fdf06 2025-05-07T19:42:44.9487597Z ##[endgroup] 2025-05-07T19:42:44.9520883Z ##[group]Starting job container 2025-05-07T19:42:44.9547343Z ##[command]/usr/bin/docker pull amazonlinux:2023 2025-05-07T19:42:45.0980356Z 2023: Pulling from library/amazonlinux 2025-05-07T19:42:45.1095925Z Digest: sha256:cb5b4c509d62ae388f674c139ae5e8281fc160c217d474445e912043e1941988 2025-05-07T19:42:45.1096602Z Status: Image is up to date for amazonlinux:2023 2025-05-07T19:42:45.1121576Z docker.io/library/amazonlinux:2023 2025-05-07T19:42:45.1211618Z ##[command]/usr/bin/docker create --name c54895c4960e410caa647b9f8dfd47b1_amazonlinux2023_5ed145 --label e1751d --workdir /__w/FBGEMM/FBGEMM --network github_network_a087cb1554ec44a393d5ce1ddb30c3db --user root -e "HOME=/github/home" -e GITHUB_ACTIONS=true -e CI=true -v "/var/run/docker.sock":"/var/run/docker.sock" -v "/home/ec2-user/actions-runner/_work":"/__w" -v "/home/ec2-user/actions-runner/externals":"/__e":ro -v "/home/ec2-user/actions-runner/_work/_temp":"/__w/_temp" -v "/home/ec2-user/actions-runner/_work/_actions":"/__w/_actions" -v "/home/ec2-user/actions-runner/_work/_tool":"/__w/_tool" -v "/home/ec2-user/actions-runner/_work/_temp/_github_home":"/github/home" -v "/home/ec2-user/actions-runner/_work/_temp/_github_workflow":"/github/workflow" --entrypoint "tail" amazonlinux:2023 "-f" "/dev/null" 2025-05-07T19:42:45.1662109Z 180e7cabfdf5006d0fba1b706a3ca06949f33b672d353cfee7af27b6030ff987 2025-05-07T19:42:45.1686857Z ##[command]/usr/bin/docker start 180e7cabfdf5006d0fba1b706a3ca06949f33b672d353cfee7af27b6030ff987 2025-05-07T19:42:45.6922886Z 180e7cabfdf5006d0fba1b706a3ca06949f33b672d353cfee7af27b6030ff987 2025-05-07T19:42:45.6945814Z ##[command]/usr/bin/docker ps --all --filter id=180e7cabfdf5006d0fba1b706a3ca06949f33b672d353cfee7af27b6030ff987 --filter status=running --no-trunc --format "{{.ID}} {{.Status}}" 2025-05-07T19:42:45.7107814Z 180e7cabfdf5006d0fba1b706a3ca06949f33b672d353cfee7af27b6030ff987 Up Less than a second 2025-05-07T19:42:45.7129268Z ##[command]/usr/bin/docker inspect --format "{{range .Config.Env}}{{println .}}{{end}}" 180e7cabfdf5006d0fba1b706a3ca06949f33b672d353cfee7af27b6030ff987 2025-05-07T19:42:45.7279239Z HOME=/github/home 2025-05-07T19:42:45.7279629Z GITHUB_ACTIONS=true 2025-05-07T19:42:45.7280106Z CI=true 2025-05-07T19:42:45.7280577Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:42:45.7298684Z ##[endgroup] 2025-05-07T19:42:45.7308798Z ##[group]Waiting for all services to be ready 2025-05-07T19:42:45.7311055Z ##[endgroup] 2025-05-07T19:42:45.7399850Z ##[group]Run yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:45.7400800Z yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:45.7401734Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:45.7402206Z env: 2025-05-07T19:42:45.7402511Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:45.7402912Z BUILD_ENV: build_binary 2025-05-07T19:42:45.7403372Z BUILD_TARGET: default 2025-05-07T19:42:45.7403682Z BUILD_VARIANT: cuda 2025-05-07T19:42:45.7404026Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:42:45.7404307Z ##[endgroup] 2025-05-07T19:42:46.3654634Z Amazon Linux 2023 repository 99 MB/s | 37 MB 00:00 2025-05-07T19:42:52.9518574Z Last metadata expiration check: 0:00:06 ago on Wed May 7 19:42:46 2025. 2025-05-07T19:42:53.5041289Z Dependencies resolved. 2025-05-07T19:42:53.5214042Z Nothing to do. 2025-05-07T19:42:53.5214985Z Complete! 2025-05-07T19:42:53.7537461Z Last metadata expiration check: 0:00:07 ago on Wed May 7 19:42:46 2025. 2025-05-07T19:42:53.8168175Z Dependencies resolved. 2025-05-07T19:42:53.8393807Z ======================================================================================== 2025-05-07T19:42:53.8394621Z Package Arch Version Repository Size 2025-05-07T19:42:53.8395647Z ======================================================================================== 2025-05-07T19:42:53.8396436Z Installing: 2025-05-07T19:42:53.8397010Z binutils x86_64 2.41-50.amzn2023.0.3 amazonlinux 5.3 M 2025-05-07T19:42:53.8397577Z findutils x86_64 1:4.8.0-2.amzn2023.0.2 amazonlinux 539 k 2025-05-07T19:42:53.8398310Z git x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 54 k 2025-05-07T19:42:53.8398966Z pciutils x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 93 k 2025-05-07T19:42:53.8399523Z sudo x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 1.3 M 2025-05-07T19:42:53.8400106Z tar x86_64 2:1.34-1.amzn2023.0.4 amazonlinux 879 k 2025-05-07T19:42:53.8400730Z wget x86_64 1.21.3-1.amzn2023.0.4 amazonlinux 779 k 2025-05-07T19:42:53.8401239Z which x86_64 2.21-26.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:53.8401779Z Installing dependencies: 2025-05-07T19:42:53.8402304Z cracklib x86_64 2.9.6-27.amzn2023.0.2 amazonlinux 82 k 2025-05-07T19:42:53.8402893Z cyrus-sasl-lib x86_64 2.1.27-18.amzn2023.0.3 amazonlinux 786 k 2025-05-07T19:42:53.8403579Z elfutils-debuginfod-client x86_64 0.188-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:53.8404162Z git-core x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 4.7 M 2025-05-07T19:42:53.8405012Z git-core-doc noarch 2.47.1-1.amzn2023.0.2 amazonlinux 2.8 M 2025-05-07T19:42:53.8405543Z gnutls x86_64 3.8.3-6.amzn2023.0.1 amazonlinux 1.1 M 2025-05-07T19:42:53.8406135Z groff-base x86_64 1.22.4-7.amzn2023.0.2 amazonlinux 1.0 M 2025-05-07T19:42:53.8406686Z gzip x86_64 1.12-1.amzn2023.0.1 amazonlinux 160 k 2025-05-07T19:42:53.8407189Z hwdata noarch 0.384-1.amzn2023.0.3 amazonlinux 1.6 M 2025-05-07T19:42:53.8407858Z jansson x86_64 2.14-0.amzn2023 amazonlinux 46 k 2025-05-07T19:42:53.8408371Z kmod-libs x86_64 29-2.amzn2023.0.5 amazonlinux 62 k 2025-05-07T19:42:53.8408925Z less x86_64 608-2.amzn2023.0.2 amazonlinux 168 k 2025-05-07T19:42:53.8409668Z libcbor x86_64 0.7.0-3.amzn2023.0.2 amazonlinux 57 k 2025-05-07T19:42:53.8410288Z libdb x86_64 5.3.28-49.amzn2023.0.2 amazonlinux 756 k 2025-05-07T19:42:53.8527217Z libeconf x86_64 0.4.0-1.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:42:53.8528364Z libedit x86_64 3.1-38.20210714cvs.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:53.8529738Z libfdisk x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 153 k 2025-05-07T19:42:53.8530386Z libfido2 x86_64 1.10.0-2.amzn2023.0.2 amazonlinux 95 k 2025-05-07T19:42:53.8531086Z libmetalink x86_64 0.1.3-14.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:53.8531829Z libpwquality x86_64 1.4.4-6.amzn2023.0.2 amazonlinux 106 k 2025-05-07T19:42:53.8532395Z libsemanage x86_64 3.4-5.amzn2023.0.2 amazonlinux 121 k 2025-05-07T19:42:53.8532947Z libutempter x86_64 1.2.1-4.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:53.8533503Z nano x86_64 8.3-1.amzn2023 amazonlinux 706 k 2025-05-07T19:42:53.8534021Z ncurses x86_64 6.2-4.20200222.amzn2023.0.6 amazonlinux 394 k 2025-05-07T19:42:53.8534504Z nettle x86_64 3.10.1-1.amzn2023.0.1 amazonlinux 573 k 2025-05-07T19:42:53.8535011Z openldap x86_64 2.4.57-6.amzn2023.0.7 amazonlinux 256 k 2025-05-07T19:42:53.8535500Z openssh x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 454 k 2025-05-07T19:42:53.8536038Z openssh-clients x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 708 k 2025-05-07T19:42:53.8536544Z pam x86_64 1.5.1-8.amzn2023.0.4 amazonlinux 542 k 2025-05-07T19:42:53.8537067Z pciutils-libs x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:53.8537644Z perl-AutoLoader noarch 5.74-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:53.8538163Z perl-B x86_64 1.80-477.amzn2023.0.6 amazonlinux 179 k 2025-05-07T19:42:53.8538679Z perl-Carp noarch 1.50-458.amzn2023.0.2 amazonlinux 29 k 2025-05-07T19:42:53.8539238Z perl-Class-Struct noarch 0.66-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:53.8539839Z perl-Data-Dumper x86_64 2.174-460.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:53.8540411Z perl-Digest noarch 1.20-1.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:53.8540947Z perl-Digest-MD5 x86_64 2.58-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:53.8541510Z perl-DynaLoader x86_64 1.47-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:53.8542297Z perl-Encode x86_64 4:3.15-462.amzn2023.0.2 amazonlinux 1.7 M 2025-05-07T19:42:53.8542844Z perl-Errno x86_64 1.30-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:53.8543474Z perl-Error noarch 1:0.17029-5.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:53.8543985Z perl-Exporter noarch 5.74-459.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:53.8544495Z perl-Fcntl x86_64 1.13-477.amzn2023.0.6 amazonlinux 21 k 2025-05-07T19:42:53.8545032Z perl-File-Basename noarch 2.85-477.amzn2023.0.6 amazonlinux 18 k 2025-05-07T19:42:53.8545595Z perl-File-Find noarch 1.37-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:53.8546117Z perl-File-Path noarch 2.18-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:53.8546655Z perl-File-Temp noarch 1:0.231.100-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:53.8547385Z perl-File-stat noarch 1.09-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:53.8547972Z perl-FileHandle noarch 2.03-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:53.8548512Z perl-Getopt-Long noarch 1:2.52-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:53.8549072Z perl-Getopt-Std noarch 1.12-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:53.8549599Z perl-Git noarch 2.47.1-1.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:53.8550146Z perl-HTTP-Tiny noarch 0.078-1.amzn2023.0.3 amazonlinux 56 k 2025-05-07T19:42:53.8550650Z perl-IO x86_64 1.43-477.amzn2023.0.6 amazonlinux 87 k 2025-05-07T19:42:53.8551145Z perl-IPC-Open3 noarch 1.21-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:53.8551695Z perl-MIME-Base64 x86_64 3.16-2.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:53.8552240Z perl-Net-SSLeay x86_64 1.94-1.amzn2023.0.1 amazonlinux 392 k 2025-05-07T19:42:53.8552736Z perl-POSIX x86_64 1.94-477.amzn2023.0.6 amazonlinux 97 k 2025-05-07T19:42:53.8553249Z perl-PathTools x86_64 3.78-459.amzn2023.0.2 amazonlinux 85 k 2025-05-07T19:42:53.8553782Z perl-Pod-Escapes noarch 1:1.07-458.amzn2023.0.2 amazonlinux 20 k 2025-05-07T19:42:53.8554345Z perl-Pod-Perldoc noarch 3.28.01-459.amzn2023.0.3 amazonlinux 84 k 2025-05-07T19:42:53.8554992Z perl-Pod-Simple noarch 1:3.42-2.amzn2023.0.2 amazonlinux 215 k 2025-05-07T19:42:53.8555510Z perl-Pod-Usage noarch 4:2.01-2.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:53.8556047Z perl-Scalar-List-Utils x86_64 4:1.56-459.amzn2023.0.2 amazonlinux 71 k 2025-05-07T19:42:53.8556591Z perl-SelectSaver noarch 1.02-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:53.8557098Z perl-Socket x86_64 4:2.032-1.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:53.8557568Z perl-Storable x86_64 1:3.21-458.amzn2023.0.2 amazonlinux 96 k 2025-05-07T19:42:53.8558061Z perl-Symbol noarch 1.08-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:53.8558589Z perl-Term-ANSIColor noarch 5.01-459.amzn2023.0.2 amazonlinux 48 k 2025-05-07T19:42:53.8559105Z perl-Term-Cap noarch 1.17-458.amzn2023.0.2 amazonlinux 22 k 2025-05-07T19:42:53.8559625Z perl-TermReadKey x86_64 2.38-9.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:53.8560151Z perl-Text-ParseWords noarch 3.30-458.amzn2023.0.2 amazonlinux 17 k 2025-05-07T19:42:53.8560719Z perl-Text-Tabs+Wrap noarch 2021.0726-1.amzn2023.0.1 amazonlinux 22 k 2025-05-07T19:42:53.8561351Z perl-Time-Local noarch 2:1.300-5.amzn2023.0.2 amazonlinux 34 k 2025-05-07T19:42:53.8561841Z perl-URI noarch 5.09-1.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:53.8562321Z perl-base noarch 2.27-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:53.8562803Z perl-constant noarch 1.33-459.amzn2023.0.2 amazonlinux 23 k 2025-05-07T19:42:53.8563295Z perl-if noarch 0.60.800-477.amzn2023.0.6 amazonlinux 14 k 2025-05-07T19:42:53.8563773Z perl-interpreter x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 71 k 2025-05-07T19:42:53.8564270Z perl-lib x86_64 0.65-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:53.8564745Z perl-libnet noarch 3.13-2.amzn2023.0.2 amazonlinux 126 k 2025-05-07T19:42:53.8565205Z perl-libs x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 2.0 M 2025-05-07T19:42:53.8566968Z perl-mro x86_64 1.23-477.amzn2023.0.6 amazonlinux 29 k 2025-05-07T19:42:53.8567478Z perl-overload noarch 1.31-477.amzn2023.0.6 amazonlinux 46 k 2025-05-07T19:42:53.8567995Z perl-overloading noarch 0.02-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:53.8568523Z perl-parent noarch 1:0.238-458.amzn2023.0.2 amazonlinux 14 k 2025-05-07T19:42:53.8569019Z perl-podlators noarch 1:4.14-458.amzn2023.0.2 amazonlinux 112 k 2025-05-07T19:42:53.8569522Z perl-subs noarch 1.03-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:53.8570000Z perl-vars noarch 1.05-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:53.8570760Z shadow-utils x86_64 2:4.9-12.amzn2023.0.4 amazonlinux 1.1 M 2025-05-07T19:42:53.8571302Z systemd-libs x86_64 252.23-3.amzn2023 amazonlinux 613 k 2025-05-07T19:42:53.8571801Z util-linux x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 2.2 M 2025-05-07T19:42:53.8572329Z util-linux-core x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 432 k 2025-05-07T19:42:53.8572774Z Installing weak dependencies: 2025-05-07T19:42:53.8573196Z nano-default-editor noarch 8.3-1.amzn2023 amazonlinux 10 k 2025-05-07T19:42:53.8573799Z perl-IO-Socket-IP noarch 0.41-3.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:53.8574364Z perl-IO-Socket-SSL noarch 2.075-1.amzn2023.0.2 amazonlinux 218 k 2025-05-07T19:42:53.8574947Z perl-Mozilla-CA noarch 20200520-4.amzn2023.0.2 amazonlinux 13 k 2025-05-07T19:42:53.8575480Z perl-NDBM_File x86_64 1.15-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:53.8576041Z sudo-python-plugin x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 56 k 2025-05-07T19:42:53.8576397Z 2025-05-07T19:42:53.8576512Z Transaction Summary 2025-05-07T19:42:53.8576764Z ======================================================================================== 2025-05-07T19:42:53.8577096Z Install 107 Packages 2025-05-07T19:42:53.8577251Z 2025-05-07T19:42:53.8577406Z Total download size: 38 M 2025-05-07T19:42:53.8577675Z Installed size: 151 M 2025-05-07T19:42:53.8577913Z Downloading Packages: 2025-05-07T19:42:53.9574103Z (1/107): cracklib-2.9.6-27.amzn2023.0.2.x86_64. 3.9 MB/s | 82 kB 00:00 2025-05-07T19:42:53.9717329Z (2/107): cyrus-sasl-lib-2.1.27-18.amzn2023.0.3. 22 MB/s | 786 kB 00:00 2025-05-07T19:42:53.9724040Z (3/107): elfutils-debuginfod-client-0.188-3.amz 3.5 MB/s | 41 kB 00:00 2025-05-07T19:42:53.9969535Z (4/107): binutils-2.41-50.amzn2023.0.3.x86_64.r 88 MB/s | 5.3 MB 00:00 2025-05-07T19:42:54.0016167Z (5/107): findutils-4.8.0-2.amzn2023.0.2.x86_64. 19 MB/s | 539 kB 00:00 2025-05-07T19:42:54.0025816Z (6/107): git-2.47.1-1.amzn2023.0.2.x86_64.rpm 1.9 MB/s | 54 kB 00:00 2025-05-07T19:42:54.0277183Z (7/107): gnutls-3.8.3-6.amzn2023.0.1.x86_64.rpm 45 MB/s | 1.1 MB 00:00 2025-05-07T19:42:54.0435501Z (8/107): git-core-doc-2.47.1-1.amzn2023.0.2.noa 70 MB/s | 2.8 MB 00:00 2025-05-07T19:42:54.0653008Z (9/107): git-core-2.47.1-1.amzn2023.0.2.x86_64. 70 MB/s | 4.7 MB 00:00 2025-05-07T19:42:54.0707913Z (10/107): groff-base-1.22.4-7.amzn2023.0.2.x86_ 26 MB/s | 1.0 MB 00:00 2025-05-07T19:42:54.0734083Z (11/107): gzip-1.12-1.amzn2023.0.1.x86_64.rpm 5.5 MB/s | 160 kB 00:00 2025-05-07T19:42:54.0773141Z (12/107): jansson-2.14-0.amzn2023.x86_64.rpm 7.3 MB/s | 46 kB 00:00 2025-05-07T19:42:54.0802426Z (13/107): kmod-libs-29-2.amzn2023.0.5.x86_64.rp 9.6 MB/s | 62 kB 00:00 2025-05-07T19:42:54.0889178Z (14/107): hwdata-0.384-1.amzn2023.0.3.noarch.rp 92 MB/s | 1.6 MB 00:00 2025-05-07T19:42:54.0916421Z (15/107): less-608-2.amzn2023.0.2.x86_64.rpm 13 MB/s | 168 kB 00:00 2025-05-07T19:42:54.0924651Z (16/107): libcbor-0.7.0-3.amzn2023.0.2.x86_64.r 4.9 MB/s | 57 kB 00:00 2025-05-07T19:42:54.1012445Z (17/107): libdb-5.3.28-49.amzn2023.0.2.x86_64.r 64 MB/s | 756 kB 00:00 2025-05-07T19:42:54.1028036Z (18/107): libeconf-0.4.0-1.amzn2023.0.3.x86_64. 2.8 MB/s | 28 kB 00:00 2025-05-07T19:42:54.1062873Z (19/107): libedit-3.1-38.20210714cvs.amzn2023.0 8.0 MB/s | 108 kB 00:00 2025-05-07T19:42:54.1092456Z (20/107): libfdisk-2.37.4-1.amzn2023.0.4.x86_64 19 MB/s | 153 kB 00:00 2025-05-07T19:42:54.1117468Z (21/107): libfido2-1.10.0-2.amzn2023.0.2.x86_64 11 MB/s | 95 kB 00:00 2025-05-07T19:42:54.1134226Z (22/107): libmetalink-0.1.3-14.amzn2023.0.2.x86 4.8 MB/s | 31 kB 00:00 2025-05-07T19:42:54.1157554Z (23/107): libpwquality-1.4.4-6.amzn2023.0.2.x86 17 MB/s | 106 kB 00:00 2025-05-07T19:42:54.1181694Z (24/107): libsemanage-3.4-5.amzn2023.0.2.x86_64 20 MB/s | 121 kB 00:00 2025-05-07T19:42:54.1202211Z (25/107): libutempter-1.2.1-4.amzn2023.0.2.x86_ 4.4 MB/s | 26 kB 00:00 2025-05-07T19:42:54.1268030Z (26/107): nano-8.3-1.amzn2023.x86_64.rpm 65 MB/s | 706 kB 00:00 2025-05-07T19:42:54.1291834Z (27/107): nano-default-editor-8.3-1.amzn2023.no 1.0 MB/s | 10 kB 00:00 2025-05-07T19:42:54.1331278Z (28/107): ncurses-6.2-4.20200222.amzn2023.0.6.x 32 MB/s | 394 kB 00:00 2025-05-07T19:42:54.1379590Z (29/107): nettle-3.10.1-1.amzn2023.0.1.x86_64.r 53 MB/s | 573 kB 00:00 2025-05-07T19:42:54.1428228Z (30/107): openldap-2.4.57-6.amzn2023.0.7.x86_64 31 MB/s | 256 kB 00:00 2025-05-07T19:42:54.1466295Z (31/107): openssh-8.7p1-8.amzn2023.0.14.x86_64. 37 MB/s | 454 kB 00:00 2025-05-07T19:42:54.1520343Z (32/107): openssh-clients-8.7p1-8.amzn2023.0.14 51 MB/s | 708 kB 00:00 2025-05-07T19:42:54.1566106Z (33/107): pam-1.5.1-8.amzn2023.0.4.x86_64.rpm 41 MB/s | 542 kB 00:00 2025-05-07T19:42:54.1584881Z (34/107): pciutils-3.7.0-3.amzn2023.0.2.x86_64. 8.5 MB/s | 93 kB 00:00 2025-05-07T19:42:54.1601496Z (35/107): pciutils-libs-3.7.0-3.amzn2023.0.2.x8 5.7 MB/s | 41 kB 00:00 2025-05-07T19:42:54.1627061Z (36/107): perl-AutoLoader-5.74-477.amzn2023.0.6 3.8 MB/s | 22 kB 00:00 2025-05-07T19:42:54.1648919Z (37/107): perl-Carp-1.50-458.amzn2023.0.2.noarc 6.5 MB/s | 29 kB 00:00 2025-05-07T19:42:54.1680473Z (38/107): perl-B-1.80-477.amzn2023.0.6.x86_64.r 19 MB/s | 179 kB 00:00 2025-05-07T19:42:54.1691998Z (39/107): perl-Class-Struct-0.66-477.amzn2023.0 3.4 MB/s | 22 kB 00:00 2025-05-07T19:42:54.1701502Z (40/107): perl-Data-Dumper-2.174-460.amzn2023.0 10 MB/s | 55 kB 00:00 2025-05-07T19:42:54.1723912Z (41/107): perl-Digest-1.20-1.amzn2023.0.2.noarc 6.3 MB/s | 26 kB 00:00 2025-05-07T19:42:54.1749249Z (42/107): perl-Digest-MD5-2.58-2.amzn2023.0.2.x 8.6 MB/s | 36 kB 00:00 2025-05-07T19:42:54.1765665Z (43/107): perl-DynaLoader-1.47-477.amzn2023.0.6 4.7 MB/s | 26 kB 00:00 2025-05-07T19:42:54.1900915Z (44/107): perl-Encode-3.15-462.amzn2023.0.2.x86 95 MB/s | 1.7 MB 00:00 2025-05-07T19:42:54.1917865Z (45/107): perl-Errno-1.30-477.amzn2023.0.6.x86_ 921 kB/s | 15 kB 00:00 2025-05-07T19:42:54.1926842Z (46/107): perl-Error-0.17029-5.amzn2023.0.2.noa 2.5 MB/s | 41 kB 00:00 2025-05-07T19:42:54.1961950Z (47/107): perl-Exporter-5.74-459.amzn2023.0.2.n 5.9 MB/s | 31 kB 00:00 2025-05-07T19:42:54.1997267Z (48/107): perl-Fcntl-1.13-477.amzn2023.0.6.x86_ 3.4 MB/s | 21 kB 00:00 2025-05-07T19:42:54.2014156Z (49/107): perl-File-Find-1.37-477.amzn2023.0.6. 5.1 MB/s | 26 kB 00:00 2025-05-07T19:42:54.2029352Z (50/107): perl-File-Basename-2.85-477.amzn2023. 1.9 MB/s | 18 kB 00:00 2025-05-07T19:42:54.2052297Z (51/107): perl-File-Path-2.18-2.amzn2023.0.2.no 6.6 MB/s | 36 kB 00:00 2025-05-07T19:42:54.2076295Z (52/107): perl-File-stat-1.09-477.amzn2023.0.6. 3.9 MB/s | 17 kB 00:00 2025-05-07T19:42:54.2091502Z (53/107): perl-File-Temp-0.231.100-2.amzn2023.0 8.2 MB/s | 60 kB 00:00 2025-05-07T19:42:54.2102223Z (54/107): perl-FileHandle-2.03-477.amzn2023.0.6 3.0 MB/s | 16 kB 00:00 2025-05-07T19:42:54.2133637Z (55/107): perl-Getopt-Long-2.52-2.amzn2023.0.2. 10 MB/s | 60 kB 00:00 2025-05-07T19:42:54.2148396Z (56/107): perl-Getopt-Std-1.12-477.amzn2023.0.6 2.9 MB/s | 16 kB 00:00 2025-05-07T19:42:54.2168722Z (57/107): perl-Git-2.47.1-1.amzn2023.0.2.noarch 6.8 MB/s | 42 kB 00:00 2025-05-07T19:42:54.2180686Z (58/107): perl-HTTP-Tiny-0.078-1.amzn2023.0.3.n 12 MB/s | 56 kB 00:00 2025-05-07T19:42:54.2211959Z (59/107): perl-IO-Socket-IP-0.41-3.amzn2023.0.2 10 MB/s | 42 kB 00:00 2025-05-07T19:42:54.2230403Z (60/107): perl-IO-1.43-477.amzn2023.0.6.x86_64. 11 MB/s | 87 kB 00:00 2025-05-07T19:42:54.2256050Z (61/107): perl-IO-Socket-SSL-2.075-1.amzn2023.0 29 MB/s | 218 kB 00:00 2025-05-07T19:42:54.2292447Z (62/107): perl-IPC-Open3-1.21-477.amzn2023.0.6. 3.0 MB/s | 23 kB 00:00 2025-05-07T19:42:54.2301609Z (63/107): perl-MIME-Base64-3.16-2.amzn2023.0.2. 4.4 MB/s | 31 kB 00:00 2025-05-07T19:42:54.2318392Z (64/107): perl-Mozilla-CA-20200520-4.amzn2023.0 2.2 MB/s | 13 kB 00:00 2025-05-07T19:42:54.2335639Z (65/107): perl-NDBM_File-1.15-477.amzn2023.0.6. 5.1 MB/s | 23 kB 00:00 2025-05-07T19:42:54.2381989Z (66/107): perl-Net-SSLeay-1.94-1.amzn2023.0.1.x 50 MB/s | 392 kB 00:00 2025-05-07T19:42:54.2404694Z (67/107): perl-POSIX-1.94-477.amzn2023.0.6.x86_ 11 MB/s | 97 kB 00:00 2025-05-07T19:42:54.2418938Z (68/107): perl-PathTools-3.78-459.amzn2023.0.2. 10 MB/s | 85 kB 00:00 2025-05-07T19:42:54.2432090Z (69/107): perl-Pod-Escapes-1.07-458.amzn2023.0. 4.3 MB/s | 20 kB 00:00 2025-05-07T19:42:54.2457659Z (70/107): perl-Pod-Perldoc-3.28.01-459.amzn2023 16 MB/s | 84 kB 00:00 2025-05-07T19:42:54.2491818Z (71/107): perl-Pod-Simple-3.42-2.amzn2023.0.2.n 32 MB/s | 215 kB 00:00 2025-05-07T19:42:54.2506511Z (72/107): perl-Pod-Usage-2.01-2.amzn2023.0.2.no 5.8 MB/s | 41 kB 00:00 2025-05-07T19:42:54.2522296Z (73/107): perl-Scalar-List-Utils-1.56-459.amzn2 12 MB/s | 71 kB 00:00 2025-05-07T19:42:54.2547599Z (74/107): perl-SelectSaver-1.02-477.amzn2023.0. 2.3 MB/s | 12 kB 00:00 2025-05-07T19:42:54.2573069Z (75/107): perl-Storable-3.21-458.amzn2023.0.2.x 19 MB/s | 96 kB 00:00 2025-05-07T19:42:54.2595326Z (76/107): perl-Socket-2.032-1.amzn2023.0.2.x86_ 6.5 MB/s | 55 kB 00:00 2025-05-07T19:42:54.2613445Z (77/107): perl-Symbol-1.08-477.amzn2023.0.6.noa 2.3 MB/s | 15 kB 00:00 2025-05-07T19:42:54.2630631Z (78/107): perl-Term-ANSIColor-5.01-459.amzn2023 9.0 MB/s | 48 kB 00:00 2025-05-07T19:42:54.2654384Z (79/107): perl-Term-Cap-1.17-458.amzn2023.0.2.n 4.1 MB/s | 22 kB 00:00 2025-05-07T19:42:54.2675465Z (80/107): perl-TermReadKey-2.38-9.amzn2023.0.2. 6.5 MB/s | 36 kB 00:00 2025-05-07T19:42:54.2693456Z (81/107): perl-Text-ParseWords-3.30-458.amzn202 2.8 MB/s | 17 kB 00:00 2025-05-07T19:42:54.2711769Z (82/107): perl-Text-Tabs+Wrap-2021.0726-1.amzn2 4.5 MB/s | 22 kB 00:00 2025-05-07T19:42:54.2763023Z (83/107): perl-Time-Local-1.300-5.amzn2023.0.2. 4.1 MB/s | 34 kB 00:00 2025-05-07T19:42:54.2777096Z (84/107): perl-URI-5.09-1.amzn2023.0.2.noarch.r 13 MB/s | 108 kB 00:00 2025-05-07T19:42:54.2795140Z (85/107): perl-base-2.27-477.amzn2023.0.6.noarc 2.1 MB/s | 17 kB 00:00 2025-05-07T19:42:54.2829731Z (86/107): perl-constant-1.33-459.amzn2023.0.2.n 5.0 MB/s | 23 kB 00:00 2025-05-07T19:42:54.2841210Z (87/107): perl-if-0.60.800-477.amzn2023.0.6.noa 2.4 MB/s | 14 kB 00:00 2025-05-07T19:42:54.2861348Z (88/107): perl-interpreter-5.32.1-477.amzn2023. 11 MB/s | 71 kB 00:00 2025-05-07T19:42:54.2883018Z (89/107): perl-lib-0.65-477.amzn2023.0.6.x86_64 3.1 MB/s | 15 kB 00:00 2025-05-07T19:42:54.2928185Z (90/107): perl-mro-1.23-477.amzn2023.0.6.x86_64 6.6 MB/s | 29 kB 00:00 2025-05-07T19:42:54.3065958Z (91/107): perl-libs-5.32.1-477.amzn2023.0.6.x86 104 MB/s | 2.0 MB 00:00 2025-05-07T19:42:54.3094951Z (92/107): perl-libnet-3.13-2.amzn2023.0.2.noarc 5.0 MB/s | 126 kB 00:00 2025-05-07T19:42:54.3110561Z (93/107): perl-overload-1.31-477.amzn2023.0.6.n 2.6 MB/s | 46 kB 00:00 2025-05-07T19:42:54.3129319Z (94/107): perl-overloading-0.02-477.amzn2023.0. 2.4 MB/s | 13 kB 00:00 2025-05-07T19:42:54.3172525Z (95/107): perl-parent-0.238-458.amzn2023.0.2.no 2.5 MB/s | 14 kB 00:00 2025-05-07T19:42:54.3194876Z (96/107): perl-podlators-4.14-458.amzn2023.0.2. 14 MB/s | 112 kB 00:00 2025-05-07T19:42:54.3214078Z (97/107): perl-subs-1.03-477.amzn2023.0.6.noarc 1.4 MB/s | 12 kB 00:00 2025-05-07T19:42:54.3247183Z (98/107): perl-vars-1.05-477.amzn2023.0.6.noarc 2.8 MB/s | 13 kB 00:00 2025-05-07T19:42:54.3324291Z (99/107): shadow-utils-4.9-12.amzn2023.0.4.x86_ 91 MB/s | 1.1 MB 00:00 2025-05-07T19:42:54.3407639Z (100/107): sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 67 MB/s | 1.3 MB 00:00 2025-05-07T19:42:54.3414931Z (101/107): sudo-python-plugin-1.9.15-1.p5.amzn2 3.5 MB/s | 56 kB 00:00 2025-05-07T19:42:54.3470832Z (102/107): systemd-libs-252.23-3.amzn2023.x86_6 48 MB/s | 613 kB 00:00 2025-05-07T19:42:54.3625133Z (103/107): util-linux-2.37.4-1.amzn2023.0.4.x86 111 MB/s | 2.2 MB 00:00 2025-05-07T19:42:54.3684520Z (104/107): tar-1.34-1.amzn2023.0.4.x86_64.rpm 34 MB/s | 879 kB 00:00 2025-05-07T19:42:54.3719231Z (105/107): util-linux-core-2.37.4-1.amzn2023.0. 18 MB/s | 432 kB 00:00 2025-05-07T19:42:54.3776435Z (106/107): wget-1.21.3-1.amzn2023.0.4.x86_64.rp 54 MB/s | 779 kB 00:00 2025-05-07T19:42:54.3795557Z (107/107): which-2.21-26.amzn2023.0.2.x86_64.rp 6.6 MB/s | 42 kB 00:00 2025-05-07T19:42:54.3815874Z -------------------------------------------------------------------------------- 2025-05-07T19:42:54.3816797Z Total 70 MB/s | 38 MB 00:00 2025-05-07T19:42:55.4610046Z Running transaction check 2025-05-07T19:42:55.5076613Z Transaction check succeeded. 2025-05-07T19:42:55.5077472Z Running transaction test 2025-05-07T19:42:55.8780211Z Transaction test succeeded. 2025-05-07T19:42:55.8782002Z Running transaction 2025-05-07T19:42:56.9343514Z Preparing : 1/1 2025-05-07T19:42:56.9512999Z Installing : systemd-libs-252.23-3.amzn2023.x86_64 1/107 2025-05-07T19:42:56.9770760Z Installing : nettle-3.10.1-1.amzn2023.0.1.x86_64 2/107 2025-05-07T19:42:57.0000966Z Installing : gnutls-3.8.3-6.amzn2023.0.1.x86_64 3/107 2025-05-07T19:42:57.0079525Z Installing : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:57.0148337Z Running scriptlet: util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:57.0252820Z Installing : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:57.0550171Z Installing : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 6/107 2025-05-07T19:42:57.0646256Z Installing : nano-8.3-1.amzn2023.x86_64 7/107 2025-05-07T19:42:57.0713918Z Installing : nano-default-editor-8.3-1.amzn2023.noarch 8/107 2025-05-07T19:42:57.1234068Z Installing : libsemanage-3.4-5.amzn2023.0.2.x86_64 9/107 2025-05-07T19:42:57.1332142Z Installing : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 10/107 2025-05-07T19:42:57.1782376Z Running scriptlet: libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:57.1846626Z Installing : libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:57.1917595Z Installing : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 12/107 2025-05-07T19:42:57.1992833Z Installing : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 13/107 2025-05-07T19:42:57.2050399Z Installing : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 14/107 2025-05-07T19:42:57.2199231Z Installing : libeconf-0.4.0-1.amzn2023.0.3.x86_64 15/107 2025-05-07T19:42:57.2258016Z Installing : libdb-5.3.28-49.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:57.2328888Z Installing : libcbor-0.7.0-3.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:57.2413488Z Installing : libfido2-1.10.0-2.amzn2023.0.2.x86_64 18/107 2025-05-07T19:42:57.2483804Z Installing : less-608-2.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:57.2541504Z Installing : kmod-libs-29-2.amzn2023.0.5.x86_64 20/107 2025-05-07T19:42:57.2979086Z Installing : jansson-2.14-0.amzn2023.x86_64 21/107 2025-05-07T19:42:57.3072325Z Installing : hwdata-0.384-1.amzn2023.0.3.noarch 22/107 2025-05-07T19:42:57.3230626Z Installing : gzip-1.12-1.amzn2023.0.1.x86_64 23/107 2025-05-07T19:42:57.3682611Z Installing : cracklib-2.9.6-27.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:57.3878979Z Installing : pam-1.5.1-8.amzn2023.0.4.x86_64 25/107 2025-05-07T19:42:57.4702880Z Installing : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 26/107 2025-05-07T19:42:57.4705359Z Installing : util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:57.4705829Z warning: /etc/adjtime created as /etc/adjtime.rpmnew 2025-05-07T19:42:57.4706085Z 2025-05-07T19:42:57.4914483Z Running scriptlet: util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:57.5263891Z Running scriptlet: openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:57.5469704Z Installing : openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:57.5543317Z Installing : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:57.6671282Z Running scriptlet: openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:57.8173095Z Installing : git-core-2.47.1-1.amzn2023.0.2.x86_64 30/107 2025-05-07T19:42:57.8315623Z Installing : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 31/107 2025-05-07T19:42:57.8731312Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:57.8814690Z Installing : groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:57.8890983Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:57.8965615Z Installing : perl-Digest-1.20-1.amzn2023.0.2.noarch 33/107 2025-05-07T19:42:57.9058960Z Installing : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:57.9116851Z Installing : perl-B-1.80-477.amzn2023.0.6.x86_64 35/107 2025-05-07T19:42:57.9164306Z Installing : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:57.9223203Z Installing : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 37/107 2025-05-07T19:42:57.9317933Z Installing : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 38/107 2025-05-07T19:42:57.9375955Z Installing : perl-libnet-3.13-2.amzn2023.0.2.noarch 39/107 2025-05-07T19:42:57.9481835Z Installing : perl-base-2.27-477.amzn2023.0.6.noarch 40/107 2025-05-07T19:42:57.9696347Z Installing : perl-URI-5.09-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:57.9788222Z Installing : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 42/107 2025-05-07T19:42:57.9837615Z Installing : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 43/107 2025-05-07T19:42:57.9884578Z Installing : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 44/107 2025-05-07T19:42:57.9943255Z Installing : perl-if-0.60.800-477.amzn2023.0.6.noarch 45/107 2025-05-07T19:42:58.0011660Z Installing : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:58.0071207Z Installing : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:58.0157800Z Installing : perl-File-Path-2.18-2.amzn2023.0.2.noarch 48/107 2025-05-07T19:42:58.0218736Z Installing : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 49/107 2025-05-07T19:42:58.0268467Z Installing : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 50/107 2025-05-07T19:42:58.0327666Z Installing : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 51/107 2025-05-07T19:42:58.0389185Z Installing : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 52/107 2025-05-07T19:42:58.0441056Z Installing : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 53/107 2025-05-07T19:42:58.0483662Z Installing : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:58.0549443Z Installing : perl-subs-1.03-477.amzn2023.0.6.noarch 55/107 2025-05-07T19:42:58.0611903Z Installing : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 56/107 2025-05-07T19:42:58.0668140Z Installing : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 57/107 2025-05-07T19:42:58.0777450Z Installing : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 58/107 2025-05-07T19:42:58.0869379Z Installing : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 59/107 2025-05-07T19:42:58.0925893Z Installing : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 60/107 2025-05-07T19:42:58.0973680Z Installing : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 61/107 2025-05-07T19:42:58.1015983Z Installing : perl-Symbol-1.08-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:58.1093510Z Installing : perl-File-stat-1.09-477.amzn2023.0.6.noarch 63/107 2025-05-07T19:42:58.1194826Z Installing : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:58.1267396Z Installing : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 65/107 2025-05-07T19:42:58.1327086Z Installing : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 66/107 2025-05-07T19:42:58.1390445Z Installing : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 67/107 2025-05-07T19:42:58.1461429Z Installing : perl-mro-1.23-477.amzn2023.0.6.x86_64 68/107 2025-05-07T19:42:58.1526268Z Installing : perl-IO-1.43-477.amzn2023.0.6.x86_64 69/107 2025-05-07T19:42:58.1582861Z Installing : perl-overloading-0.02-477.amzn2023.0.6.noarch 70/107 2025-05-07T19:42:58.1653695Z Installing : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:58.1706036Z Installing : perl-Errno-1.30-477.amzn2023.0.6.x86_64 72/107 2025-05-07T19:42:58.1753831Z Installing : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 73/107 2025-05-07T19:42:58.1812990Z Installing : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:58.1892865Z Installing : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:58.1970360Z Installing : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 76/107 2025-05-07T19:42:58.2037233Z Installing : perl-constant-1.33-459.amzn2023.0.2.noarch 77/107 2025-05-07T19:42:58.2098606Z Installing : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 78/107 2025-05-07T19:42:58.2155377Z Installing : perl-overload-1.31-477.amzn2023.0.6.noarch 79/107 2025-05-07T19:42:58.2203867Z Installing : perl-parent-1:0.238-458.amzn2023.0.2.noarch 80/107 2025-05-07T19:42:58.2263976Z Installing : perl-vars-1.05-477.amzn2023.0.6.noarch 81/107 2025-05-07T19:42:58.2318663Z Installing : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 82/107 2025-05-07T19:42:58.2370453Z Installing : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 83/107 2025-05-07T19:42:58.2432003Z Installing : perl-Carp-1.50-458.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:58.2488142Z Installing : perl-Exporter-5.74-459.amzn2023.0.2.noarch 85/107 2025-05-07T19:42:58.2568608Z Installing : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 86/107 2025-05-07T19:42:58.3104517Z Installing : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 87/107 2025-05-07T19:42:58.4073145Z Installing : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 88/107 2025-05-07T19:42:58.4205906Z Installing : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:58.4281798Z Installing : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 90/107 2025-05-07T19:42:58.4351172Z Installing : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 91/107 2025-05-07T19:42:58.4413638Z Installing : perl-File-Find-1.37-477.amzn2023.0.6.noarch 92/107 2025-05-07T19:42:58.4490817Z Installing : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 93/107 2025-05-07T19:42:58.4538635Z Installing : perl-lib-0.65-477.amzn2023.0.6.x86_64 94/107 2025-05-07T19:42:58.4605232Z Installing : perl-Git-2.47.1-1.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:58.4676881Z Installing : git-2.47.1-1.amzn2023.0.2.x86_64 96/107 2025-05-07T19:42:58.4874379Z Installing : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 97/107 2025-05-07T19:42:58.5002511Z Installing : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 98/107 2025-05-07T19:42:58.5082458Z Installing : openldap-2.4.57-6.amzn2023.0.7.x86_64 99/107 2025-05-07T19:42:58.5487717Z Installing : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 100/107 2025-05-07T19:42:58.6711951Z Installing : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 101/107 2025-05-07T19:42:58.6801159Z Installing : binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:58.6912410Z Running scriptlet: binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:58.7214343Z Installing : pciutils-3.7.0-3.amzn2023.0.2.x86_64 103/107 2025-05-07T19:42:58.7310808Z Installing : wget-1.21.3-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:58.7555595Z Installing : which-2.21-26.amzn2023.0.2.x86_64 105/107 2025-05-07T19:42:58.7769152Z Installing : tar-2:1.34-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:58.7853689Z Installing : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:58.7969371Z Running scriptlet: pam-1.5.1-8.amzn2023.0.4.x86_64 107/107 2025-05-07T19:42:59.5655930Z Running scriptlet: findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:59.5659318Z Verifying : binutils-2.41-50.amzn2023.0.3.x86_64 1/107 2025-05-07T19:42:59.5662737Z Verifying : cracklib-2.9.6-27.amzn2023.0.2.x86_64 2/107 2025-05-07T19:42:59.5665130Z Verifying : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 3/107 2025-05-07T19:42:59.5666383Z Verifying : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 4/107 2025-05-07T19:42:59.5667562Z Verifying : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:59.5668732Z Verifying : git-2.47.1-1.amzn2023.0.2.x86_64 6/107 2025-05-07T19:42:59.5669825Z Verifying : git-core-2.47.1-1.amzn2023.0.2.x86_64 7/107 2025-05-07T19:42:59.5670933Z Verifying : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 8/107 2025-05-07T19:42:59.5672590Z Verifying : gnutls-3.8.3-6.amzn2023.0.1.x86_64 9/107 2025-05-07T19:42:59.5673674Z Verifying : groff-base-1.22.4-7.amzn2023.0.2.x86_64 10/107 2025-05-07T19:42:59.5674866Z Verifying : gzip-1.12-1.amzn2023.0.1.x86_64 11/107 2025-05-07T19:42:59.5676039Z Verifying : hwdata-0.384-1.amzn2023.0.3.noarch 12/107 2025-05-07T19:42:59.5677137Z Verifying : jansson-2.14-0.amzn2023.x86_64 13/107 2025-05-07T19:42:59.5678323Z Verifying : kmod-libs-29-2.amzn2023.0.5.x86_64 14/107 2025-05-07T19:42:59.5679373Z Verifying : less-608-2.amzn2023.0.2.x86_64 15/107 2025-05-07T19:42:59.5680492Z Verifying : libcbor-0.7.0-3.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:59.5681666Z Verifying : libdb-5.3.28-49.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:59.5682808Z Verifying : libeconf-0.4.0-1.amzn2023.0.3.x86_64 18/107 2025-05-07T19:42:59.5684012Z Verifying : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:59.5685233Z Verifying : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 20/107 2025-05-07T19:42:59.5686317Z Verifying : libfido2-1.10.0-2.amzn2023.0.2.x86_64 21/107 2025-05-07T19:42:59.5687491Z Verifying : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 22/107 2025-05-07T19:42:59.5688653Z Verifying : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 23/107 2025-05-07T19:42:59.5689941Z Verifying : libsemanage-3.4-5.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:59.5691247Z Verifying : libutempter-1.2.1-4.amzn2023.0.2.x86_64 25/107 2025-05-07T19:42:59.5692494Z Verifying : nano-8.3-1.amzn2023.x86_64 26/107 2025-05-07T19:42:59.5693674Z Verifying : nano-default-editor-8.3-1.amzn2023.noarch 27/107 2025-05-07T19:42:59.5694413Z Verifying : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 28/107 2025-05-07T19:42:59.5695239Z Verifying : nettle-3.10.1-1.amzn2023.0.1.x86_64 29/107 2025-05-07T19:42:59.5695827Z Verifying : openldap-2.4.57-6.amzn2023.0.7.x86_64 30/107 2025-05-07T19:42:59.5696471Z Verifying : openssh-8.7p1-8.amzn2023.0.14.x86_64 31/107 2025-05-07T19:42:59.5697192Z Verifying : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 32/107 2025-05-07T19:42:59.5697787Z Verifying : pam-1.5.1-8.amzn2023.0.4.x86_64 33/107 2025-05-07T19:42:59.5698417Z Verifying : pciutils-3.7.0-3.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:59.5699027Z Verifying : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 35/107 2025-05-07T19:42:59.5699747Z Verifying : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:59.5700629Z Verifying : perl-B-1.80-477.amzn2023.0.6.x86_64 37/107 2025-05-07T19:42:59.5701167Z Verifying : perl-Carp-1.50-458.amzn2023.0.2.noarch 38/107 2025-05-07T19:42:59.5701715Z Verifying : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 39/107 2025-05-07T19:42:59.5702289Z Verifying : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 40/107 2025-05-07T19:42:59.5702833Z Verifying : perl-Digest-1.20-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:59.5703391Z Verifying : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 42/107 2025-05-07T19:42:59.5703958Z Verifying : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 43/107 2025-05-07T19:42:59.5704488Z Verifying : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 44/107 2025-05-07T19:42:59.5705032Z Verifying : perl-Errno-1.30-477.amzn2023.0.6.x86_64 45/107 2025-05-07T19:42:59.5705638Z Verifying : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:59.5706199Z Verifying : perl-Exporter-5.74-459.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:59.5706734Z Verifying : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 48/107 2025-05-07T19:42:59.5707305Z Verifying : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 49/107 2025-05-07T19:42:59.5707885Z Verifying : perl-File-Find-1.37-477.amzn2023.0.6.noarch 50/107 2025-05-07T19:42:59.5708431Z Verifying : perl-File-Path-2.18-2.amzn2023.0.2.noarch 51/107 2025-05-07T19:42:59.5708989Z Verifying : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 52/107 2025-05-07T19:42:59.5709526Z Verifying : perl-File-stat-1.09-477.amzn2023.0.6.noarch 53/107 2025-05-07T19:42:59.5710107Z Verifying : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:59.5710674Z Verifying : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 55/107 2025-05-07T19:42:59.5711223Z Verifying : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 56/107 2025-05-07T19:42:59.5711781Z Verifying : perl-Git-2.47.1-1.amzn2023.0.2.noarch 57/107 2025-05-07T19:42:59.5712318Z Verifying : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 58/107 2025-05-07T19:42:59.5712856Z Verifying : perl-IO-1.43-477.amzn2023.0.6.x86_64 59/107 2025-05-07T19:42:59.5713380Z Verifying : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 60/107 2025-05-07T19:42:59.5713946Z Verifying : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 61/107 2025-05-07T19:42:59.5714511Z Verifying : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:59.5715049Z Verifying : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 63/107 2025-05-07T19:42:59.5715607Z Verifying : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:59.5716143Z Verifying : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 65/107 2025-05-07T19:42:59.5716683Z Verifying : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 66/107 2025-05-07T19:42:59.5717205Z Verifying : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 67/107 2025-05-07T19:42:59.5717749Z Verifying : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 68/107 2025-05-07T19:42:59.5718306Z Verifying : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 69/107 2025-05-07T19:42:59.5718844Z Verifying : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 70/107 2025-05-07T19:42:59.5719403Z Verifying : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:59.5719923Z Verifying : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 72/107 2025-05-07T19:42:59.5720468Z Verifying : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 73/107 2025-05-07T19:42:59.5721115Z Verifying : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:59.5721655Z Verifying : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:59.5722193Z Verifying : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 76/107 2025-05-07T19:42:59.5722726Z Verifying : perl-Symbol-1.08-477.amzn2023.0.6.noarch 77/107 2025-05-07T19:42:59.5723298Z Verifying : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 78/107 2025-05-07T19:42:59.5723858Z Verifying : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 79/107 2025-05-07T19:42:59.5724449Z Verifying : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 80/107 2025-05-07T19:42:59.5725033Z Verifying : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 81/107 2025-05-07T19:42:59.5725594Z Verifying : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 82/107 2025-05-07T19:42:59.5726152Z Verifying : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 83/107 2025-05-07T19:42:59.5726743Z Verifying : perl-URI-5.09-1.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:59.5727282Z Verifying : perl-base-2.27-477.amzn2023.0.6.noarch 85/107 2025-05-07T19:42:59.5727839Z Verifying : perl-constant-1.33-459.amzn2023.0.2.noarch 86/107 2025-05-07T19:42:59.5728367Z Verifying : perl-if-0.60.800-477.amzn2023.0.6.noarch 87/107 2025-05-07T19:42:59.5729198Z Verifying : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 88/107 2025-05-07T19:42:59.5729727Z Verifying : perl-lib-0.65-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:59.5730356Z Verifying : perl-libnet-3.13-2.amzn2023.0.2.noarch 90/107 2025-05-07T19:42:59.5730904Z Verifying : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 91/107 2025-05-07T19:42:59.5731408Z Verifying : perl-mro-1.23-477.amzn2023.0.6.x86_64 92/107 2025-05-07T19:42:59.5731972Z Verifying : perl-overload-1.31-477.amzn2023.0.6.noarch 93/107 2025-05-07T19:42:59.5732528Z Verifying : perl-overloading-0.02-477.amzn2023.0.6.noarch 94/107 2025-05-07T19:42:59.5733090Z Verifying : perl-parent-1:0.238-458.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:59.5733619Z Verifying : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 96/107 2025-05-07T19:42:59.5734168Z Verifying : perl-subs-1.03-477.amzn2023.0.6.noarch 97/107 2025-05-07T19:42:59.5734709Z Verifying : perl-vars-1.05-477.amzn2023.0.6.noarch 98/107 2025-05-07T19:42:59.5735223Z Verifying : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 99/107 2025-05-07T19:42:59.5735742Z Verifying : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 100/107 2025-05-07T19:42:59.5736272Z Verifying : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 101/107 2025-05-07T19:42:59.5736845Z Verifying : systemd-libs-252.23-3.amzn2023.x86_64 102/107 2025-05-07T19:42:59.5737350Z Verifying : tar-2:1.34-1.amzn2023.0.4.x86_64 103/107 2025-05-07T19:42:59.5737863Z Verifying : util-linux-2.37.4-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:59.5738413Z Verifying : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 105/107 2025-05-07T19:42:59.5738926Z Verifying : wget-1.21.3-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:59.6659826Z Verifying : which-2.21-26.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:59.6661696Z 2025-05-07T19:42:59.6662156Z Installed: 2025-05-07T19:42:59.6663525Z binutils-2.41-50.amzn2023.0.3.x86_64 2025-05-07T19:42:59.6664396Z cracklib-2.9.6-27.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6664923Z cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 2025-05-07T19:42:59.6665803Z elfutils-debuginfod-client-0.188-3.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6666377Z findutils-1:4.8.0-2.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6666862Z git-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6667357Z git-core-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6667980Z git-core-doc-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:42:59.6668476Z gnutls-3.8.3-6.amzn2023.0.1.x86_64 2025-05-07T19:42:59.6668971Z groff-base-1.22.4-7.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6669437Z gzip-1.12-1.amzn2023.0.1.x86_64 2025-05-07T19:42:59.6669918Z hwdata-0.384-1.amzn2023.0.3.noarch 2025-05-07T19:42:59.6670514Z jansson-2.14-0.amzn2023.x86_64 2025-05-07T19:42:59.6671013Z kmod-libs-29-2.amzn2023.0.5.x86_64 2025-05-07T19:42:59.6671472Z less-608-2.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6671951Z libcbor-0.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6672440Z libdb-5.3.28-49.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6672909Z libeconf-0.4.0-1.amzn2023.0.3.x86_64 2025-05-07T19:42:59.6673431Z libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6673928Z libfdisk-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:59.6674558Z libfido2-1.10.0-2.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6675097Z libmetalink-0.1.3-14.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6675653Z libpwquality-1.4.4-6.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6676177Z libsemanage-3.4-5.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6676727Z libutempter-1.2.1-4.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6677210Z nano-8.3-1.amzn2023.x86_64 2025-05-07T19:42:59.6677751Z nano-default-editor-8.3-1.amzn2023.noarch 2025-05-07T19:42:59.6678291Z ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 2025-05-07T19:42:59.6678812Z nettle-3.10.1-1.amzn2023.0.1.x86_64 2025-05-07T19:42:59.6679327Z openldap-2.4.57-6.amzn2023.0.7.x86_64 2025-05-07T19:42:59.6679827Z openssh-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:42:59.6680387Z openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:42:59.6680894Z pam-1.5.1-8.amzn2023.0.4.x86_64 2025-05-07T19:42:59.6681391Z pciutils-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6681908Z pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6682440Z perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.6682969Z perl-B-1.80-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.6683460Z perl-Carp-1.50-458.amzn2023.0.2.noarch 2025-05-07T19:42:59.6684009Z perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.6684554Z perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6685181Z perl-Digest-1.20-1.amzn2023.0.2.noarch 2025-05-07T19:42:59.6685715Z perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6686238Z perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.6686766Z perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6687265Z perl-Errno-1.30-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.6687784Z perl-Error-1:0.17029-5.amzn2023.0.2.noarch 2025-05-07T19:42:59.6688314Z perl-Exporter-5.74-459.amzn2023.0.2.noarch 2025-05-07T19:42:59.6688822Z perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.6689384Z perl-File-Basename-2.85-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.6691105Z perl-File-Find-1.37-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.6691731Z perl-File-Path-2.18-2.amzn2023.0.2.noarch 2025-05-07T19:42:59.6692296Z perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 2025-05-07T19:42:59.6692890Z perl-File-stat-1.09-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.6693502Z perl-FileHandle-2.03-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.6694078Z perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 2025-05-07T19:42:59.6694670Z perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.6695229Z perl-Git-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:42:59.6695811Z perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 2025-05-07T19:42:59.6696356Z perl-IO-1.43-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.6697054Z perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 2025-05-07T19:42:59.6697623Z perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 2025-05-07T19:42:59.6698166Z perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.6698720Z perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6699270Z perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 2025-05-07T19:42:59.6699804Z perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.6700323Z perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 2025-05-07T19:42:59.6700838Z perl-POSIX-1.94-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.6701375Z perl-PathTools-3.78-459.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6701904Z perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 2025-05-07T19:42:59.6702443Z perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 2025-05-07T19:42:59.6702959Z perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 2025-05-07T19:42:59.6703475Z perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 2025-05-07T19:42:59.6704000Z perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6704536Z perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.6705062Z perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6705555Z perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6706073Z perl-Symbol-1.08-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.6706864Z perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 2025-05-07T19:42:59.6707403Z perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 2025-05-07T19:42:59.6707976Z perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6708541Z perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarch 2025-05-07T19:42:59.6709143Z perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noarch 2025-05-07T19:42:59.6709692Z perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 2025-05-07T19:42:59.6710228Z perl-URI-5.09-1.amzn2023.0.2.noarch 2025-05-07T19:42:59.6710760Z perl-base-2.27-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.6711283Z perl-constant-1.33-459.amzn2023.0.2.noarch 2025-05-07T19:42:59.6711825Z perl-if-0.60.800-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.6712489Z perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.6713031Z perl-lib-0.65-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.6713545Z perl-libnet-3.13-2.amzn2023.0.2.noarch 2025-05-07T19:42:59.6714083Z perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.6714607Z perl-mro-1.23-477.amzn2023.0.6.x86_64 2025-05-07T19:42:59.6715128Z perl-overload-1.31-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.6715707Z perl-overloading-0.02-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.6716245Z perl-parent-1:0.238-458.amzn2023.0.2.noarch 2025-05-07T19:42:59.6716792Z perl-podlators-1:4.14-458.amzn2023.0.2.noarch 2025-05-07T19:42:59.6717344Z perl-subs-1.03-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.6717859Z perl-vars-1.05-477.amzn2023.0.6.noarch 2025-05-07T19:42:59.6718391Z shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 2025-05-07T19:42:59.6718882Z sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:42:59.6719413Z sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:42:59.6719946Z systemd-libs-252.23-3.amzn2023.x86_64 2025-05-07T19:42:59.6720446Z tar-2:1.34-1.amzn2023.0.4.x86_64 2025-05-07T19:42:59.6720944Z util-linux-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:59.6721456Z util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:59.6722033Z wget-1.21.3-1.amzn2023.0.4.x86_64 2025-05-07T19:42:59.6722497Z which-2.21-26.amzn2023.0.2.x86_64 2025-05-07T19:42:59.6722818Z 2025-05-07T19:42:59.6722907Z Complete! 2025-05-07T19:42:59.7406053Z ##[group]Run actions/checkout@v4 2025-05-07T19:42:59.7406383Z with: 2025-05-07T19:42:59.7406629Z submodules: true 2025-05-07T19:42:59.7406877Z repository: pytorch/FBGEMM 2025-05-07T19:42:59.7407370Z token: *** 2025-05-07T19:42:59.7407589Z ssh-strict: true 2025-05-07T19:42:59.7407843Z ssh-user: git 2025-05-07T19:42:59.7408083Z persist-credentials: true 2025-05-07T19:42:59.7408375Z clean: true 2025-05-07T19:42:59.7408643Z sparse-checkout-cone-mode: true 2025-05-07T19:42:59.7408930Z fetch-depth: 1 2025-05-07T19:42:59.7409185Z fetch-tags: false 2025-05-07T19:42:59.7409414Z show-progress: true 2025-05-07T19:42:59.7409672Z lfs: false 2025-05-07T19:42:59.7409895Z set-safe-directory: true 2025-05-07T19:42:59.7410513Z env: 2025-05-07T19:42:59.7410920Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:59.7411282Z BUILD_ENV: build_binary 2025-05-07T19:42:59.7411588Z BUILD_TARGET: default 2025-05-07T19:42:59.7411873Z BUILD_VARIANT: cuda 2025-05-07T19:42:59.7412223Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:42:59.7412539Z ##[endgroup] 2025-05-07T19:42:59.7456839Z ##[command]/usr/bin/docker exec 180e7cabfdf5006d0fba1b706a3ca06949f33b672d353cfee7af27b6030ff987 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T19:43:00.0371124Z Syncing repository: pytorch/FBGEMM 2025-05-07T19:43:00.0372607Z ##[group]Getting Git version info 2025-05-07T19:43:00.0372943Z Working directory is '/__w/FBGEMM/FBGEMM' 2025-05-07T19:43:00.0373499Z [command]/usr/bin/git version 2025-05-07T19:43:00.0373790Z git version 2.47.1 2025-05-07T19:43:00.0374760Z ##[endgroup] 2025-05-07T19:43:00.0378577Z Temporarily overriding HOME='/__w/_temp/b65d0979-1b5f-4c63-a794-71b716826b7d' before making global git config changes 2025-05-07T19:43:00.0379407Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T19:43:00.0380062Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T19:43:00.0410460Z [command]/usr/bin/git config --local --get remote.origin.url 2025-05-07T19:43:00.0429171Z https://github.com/pytorch/FBGEMM 2025-05-07T19:43:00.0440420Z ##[group]Removing previously created refs, to avoid conflicts 2025-05-07T19:43:00.0443947Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-05-07T19:43:00.0460763Z HEAD 2025-05-07T19:43:00.0497055Z ##[endgroup] 2025-05-07T19:43:00.0497774Z [command]/usr/bin/git submodule status 2025-05-07T19:43:00.0881741Z e5d7c0bd5d9aec44d68830187138149e6a8c4e32 external/asmjit (e5d7c0b) 2025-05-07T19:43:00.0952993Z 4a61bdd4bd4ed730e078aebc7c0fcf046ff29406 external/composable_kernel (remotes/origin/FBGEMM) 2025-05-07T19:43:00.1062643Z 6543fec09b2f04ac4a666882998b534afc9c1349 external/cpuinfo (6543fec) 2025-05-07T19:43:00.1124227Z 3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3 external/cutlass (remotes/origin/FBGEMM) 2025-05-07T19:43:00.1348826Z f8d7d77c06936315286eb55f8de22cd23c188571 external/googletest (release-1.8.0-3335-gf8d7d77c) 2025-05-07T19:43:00.1432820Z 420084499c7c1e1c2d801922f40df202eac5f3a0 external/hipify_torch (remotes/origin/mmelesse-9-g4200844) 2025-05-07T19:43:00.1472073Z 9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03 external/json (v3.11.2-84-g9cca280a) 2025-05-07T19:43:00.1489899Z ##[group]Cleaning the repository 2025-05-07T19:43:00.1493312Z [command]/usr/bin/git clean -ffdx 2025-05-07T19:43:00.4511550Z Removing build_only/ 2025-05-07T19:43:00.4511938Z Removing collect_env.py 2025-05-07T19:43:00.4512230Z Removing fbgemm_gpu/_skbuild/ 2025-05-07T19:43:00.4512740Z Removing fbgemm_gpu/codegen/genscript/__pycache__/ 2025-05-07T19:43:00.4513093Z Removing fbgemm_gpu/dist/ 2025-05-07T19:43:00.4513394Z Removing fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:43:00.4513766Z Removing fbgemm_gpu/fbgemm_gpu_nightly.egg-info/ 2025-05-07T19:43:00.4519960Z [command]/usr/bin/git reset --hard HEAD 2025-05-07T19:43:00.5573578Z HEAD is now at 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:00.5576103Z ##[endgroup] 2025-05-07T19:43:00.5577825Z ##[group]Disabling automatic garbage collection 2025-05-07T19:43:00.5583087Z [command]/usr/bin/git config --local gc.auto 0 2025-05-07T19:43:00.5613038Z ##[endgroup] 2025-05-07T19:43:00.5613590Z ##[group]Setting up auth 2025-05-07T19:43:00.5614045Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T19:43:00.5638513Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T19:43:00.5915543Z Entering 'external/asmjit' 2025-05-07T19:43:00.5964010Z Entering 'external/composable_kernel' 2025-05-07T19:43:00.6034822Z Entering 'external/cpuinfo' 2025-05-07T19:43:00.6086786Z Entering 'external/cutlass' 2025-05-07T19:43:00.6162915Z Entering 'external/googletest' 2025-05-07T19:43:00.6211614Z Entering 'external/hipify_torch' 2025-05-07T19:43:00.6257025Z Entering 'external/json' 2025-05-07T19:43:00.6334325Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T19:43:00.6362169Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T19:43:00.6634513Z Entering 'external/asmjit' 2025-05-07T19:43:00.6687155Z Entering 'external/composable_kernel' 2025-05-07T19:43:00.6739675Z Entering 'external/cpuinfo' 2025-05-07T19:43:00.6789669Z Entering 'external/cutlass' 2025-05-07T19:43:00.6860070Z Entering 'external/googletest' 2025-05-07T19:43:00.6922579Z Entering 'external/hipify_torch' 2025-05-07T19:43:00.6979141Z Entering 'external/json' 2025-05-07T19:43:00.7064561Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:43:00.7124498Z ##[endgroup] 2025-05-07T19:43:00.7124928Z ##[group]Fetching the repository 2025-05-07T19:43:00.7126009Z [command]/usr/bin/git -c protocol.version=2 fetch --no-tags --prune --no-recurse-submodules --depth=1 origin +a2f4c52051596e74bc8c16e3d2867a4ecdd271e0:refs/remotes/pull/4066/merge 2025-05-07T19:43:00.9037293Z From https://github.com/pytorch/FBGEMM 2025-05-07T19:43:00.9037939Z + 1c9ad64...a2f4c52 a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 -> pull/4066/merge (forced update) 2025-05-07T19:43:00.9057436Z ##[endgroup] 2025-05-07T19:43:00.9058568Z ##[group]Determining the checkout info 2025-05-07T19:43:00.9059843Z ##[endgroup] 2025-05-07T19:43:00.9060978Z [command]/usr/bin/git sparse-checkout disable 2025-05-07T19:43:00.9559862Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-05-07T19:43:00.9592488Z ##[group]Checking out the ref 2025-05-07T19:43:00.9592995Z [command]/usr/bin/git checkout --progress --force refs/remotes/pull/4066/merge 2025-05-07T19:43:01.0561001Z Warning: you are leaving 1 commit behind, not connected to 2025-05-07T19:43:01.0562087Z any of your branches: 2025-05-07T19:43:01.0562673Z 2025-05-07T19:43:01.0563794Z 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:01.0565184Z 2025-05-07T19:43:01.0565802Z If you want to keep it by creating a new branch, this may be a good time 2025-05-07T19:43:01.0566980Z to do so with: 2025-05-07T19:43:01.0567135Z 2025-05-07T19:43:01.0567258Z git branch 1c9ad64 2025-05-07T19:43:01.0567464Z 2025-05-07T19:43:01.0567867Z HEAD is now at a2f4c52 Merge 6060cd4b5f971680caecdcc657faccb5720d1c3e into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:01.0569162Z ##[endgroup] 2025-05-07T19:43:01.0569633Z ##[group]Setting up auth for fetching submodules 2025-05-07T19:43:01.0570360Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:43:01.0613800Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-05-07T19:43:01.0636496Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-05-07T19:43:01.0660315Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-05-07T19:43:01.0684585Z ##[endgroup] 2025-05-07T19:43:01.0685677Z ##[group]Fetching submodules 2025-05-07T19:43:01.0686519Z [command]/usr/bin/git submodule sync 2025-05-07T19:43:01.0983059Z Synchronizing submodule url for 'external/asmjit' 2025-05-07T19:43:01.0983573Z Synchronizing submodule url for 'external/composable_kernel' 2025-05-07T19:43:01.0984048Z Synchronizing submodule url for 'external/cpuinfo' 2025-05-07T19:43:01.0984436Z Synchronizing submodule url for 'external/cutlass' 2025-05-07T19:43:01.0984855Z Synchronizing submodule url for 'external/googletest' 2025-05-07T19:43:01.0985554Z Synchronizing submodule url for 'external/hipify_torch' 2025-05-07T19:43:01.0985974Z Synchronizing submodule url for 'external/json' 2025-05-07T19:43:01.0991473Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --depth=1 2025-05-07T19:43:01.1763004Z Submodule path 'external/asmjit': checked out 'e5d7c0bd5d9aec44d68830187138149e6a8c4e32' 2025-05-07T19:43:01.4558620Z Submodule path 'external/composable_kernel': checked out '4a61bdd4bd4ed730e078aebc7c0fcf046ff29406' 2025-05-07T19:43:01.5607094Z Submodule path 'external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-05-07T19:43:02.2374063Z Submodule path 'external/cutlass': checked out '3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3' 2025-05-07T19:43:02.2813064Z Submodule path 'external/googletest': checked out 'f8d7d77c06936315286eb55f8de22cd23c188571' 2025-05-07T19:43:02.2903694Z Submodule path 'external/hipify_torch': checked out '420084499c7c1e1c2d801922f40df202eac5f3a0' 2025-05-07T19:43:02.4058089Z Submodule path 'external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-05-07T19:43:02.4067322Z [command]/usr/bin/git submodule foreach git config --local gc.auto 0 2025-05-07T19:43:02.4350015Z Entering 'external/asmjit' 2025-05-07T19:43:02.4372838Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.4408761Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.4443313Z Entering 'external/cutlass' 2025-05-07T19:43:02.4473332Z Entering 'external/googletest' 2025-05-07T19:43:02.4506092Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.4530814Z Entering 'external/json' 2025-05-07T19:43:02.4570849Z ##[endgroup] 2025-05-07T19:43:02.4571352Z ##[group]Persisting credentials for submodules 2025-05-07T19:43:02.4572678Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-05-07T19:43:02.4893275Z Entering 'external/asmjit' 2025-05-07T19:43:02.4930973Z url.https://github.com/.insteadof 2025-05-07T19:43:02.4931397Z url.https://github.com/.insteadof 2025-05-07T19:43:02.4970831Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.5011974Z url.https://github.com/.insteadof 2025-05-07T19:43:02.5012341Z url.https://github.com/.insteadof 2025-05-07T19:43:02.5055913Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.5092024Z url.https://github.com/.insteadof 2025-05-07T19:43:02.5092419Z url.https://github.com/.insteadof 2025-05-07T19:43:02.5129526Z Entering 'external/cutlass' 2025-05-07T19:43:02.5169763Z url.https://github.com/.insteadof 2025-05-07T19:43:02.5170369Z url.https://github.com/.insteadof 2025-05-07T19:43:02.5214691Z Entering 'external/googletest' 2025-05-07T19:43:02.5254223Z url.https://github.com/.insteadof 2025-05-07T19:43:02.5254659Z url.https://github.com/.insteadof 2025-05-07T19:43:02.5291958Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.5331387Z url.https://github.com/.insteadof 2025-05-07T19:43:02.5331856Z url.https://github.com/.insteadof 2025-05-07T19:43:02.5372176Z Entering 'external/json' 2025-05-07T19:43:02.5413869Z url.https://github.com/.insteadof 2025-05-07T19:43:02.5414272Z url.https://github.com/.insteadof 2025-05-07T19:43:02.5474308Z [command]/usr/bin/git submodule foreach sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-05-07T19:43:02.5780121Z Entering 'external/asmjit' 2025-05-07T19:43:02.5828370Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/asmjit/config remote.origin.url 2025-05-07T19:43:02.5830311Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.5884579Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/composable_kernel/config remote.origin.url 2025-05-07T19:43:02.5888267Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.5936516Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cpuinfo/config remote.origin.url 2025-05-07T19:43:02.5938388Z Entering 'external/cutlass' 2025-05-07T19:43:02.5984172Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cutlass/config remote.origin.url 2025-05-07T19:43:02.5985640Z Entering 'external/googletest' 2025-05-07T19:43:02.6033140Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/googletest/config remote.origin.url 2025-05-07T19:43:02.6034406Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.6081004Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/hipify_torch/config remote.origin.url 2025-05-07T19:43:02.6081539Z Entering 'external/json' 2025-05-07T19:43:02.6134182Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/json/config remote.origin.url 2025-05-07T19:43:02.6214983Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-05-07T19:43:02.6509575Z Entering 'external/asmjit' 2025-05-07T19:43:02.6531161Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.6557000Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.6589376Z Entering 'external/cutlass' 2025-05-07T19:43:02.6621891Z Entering 'external/googletest' 2025-05-07T19:43:02.6652965Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.6682613Z Entering 'external/json' 2025-05-07T19:43:02.6723657Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-05-07T19:43:02.7002221Z Entering 'external/asmjit' 2025-05-07T19:43:02.7025615Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.7063450Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.7092318Z Entering 'external/cutlass' 2025-05-07T19:43:02.7123109Z Entering 'external/googletest' 2025-05-07T19:43:02.7154044Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.7181450Z Entering 'external/json' 2025-05-07T19:43:02.7216693Z ##[endgroup] 2025-05-07T19:43:02.7247358Z [command]/usr/bin/git log -1 --format=%H 2025-05-07T19:43:02.7267767Z a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:02.7411196Z ##[group]Run . $PRELUDE; print_system_info 2025-05-07T19:43:02.7411689Z . $PRELUDE; print_system_info 2025-05-07T19:43:02.7412259Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:02.7412629Z env: 2025-05-07T19:43:02.7412888Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:02.7413207Z BUILD_ENV: build_binary 2025-05-07T19:43:02.7413491Z BUILD_TARGET: default 2025-05-07T19:43:02.7413743Z BUILD_VARIANT: cuda 2025-05-07T19:43:02.7414026Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:02.7414296Z ##[endgroup] 2025-05-07T19:43:03.1753606Z ################################################################################ 2025-05-07T19:43:03.1754667Z # Print System Info 2025-05-07T19:43:03.1755319Z # 2025-05-07T19:43:03.1769019Z # [2025-05-07T19:43:03.176Z] + print_system_info 2025-05-07T19:43:03.1770401Z ################################################################################ 2025-05-07T19:43:03.1770658Z 2025-05-07T19:43:03.1770898Z ################################################################################ 2025-05-07T19:43:03.1771284Z [INFO] Printing environment variables ... 2025-05-07T19:43:03.1771632Z + printenv 2025-05-07T19:43:03.1771760Z 2025-05-07T19:43:03.1780101Z GITHUB_WORKSPACE=/__w/FBGEMM/FBGEMM 2025-05-07T19:43:03.1781128Z BUILD_VARIANT=cuda 2025-05-07T19:43:03.1781802Z HOSTNAME=180e7cabfdf5 2025-05-07T19:43:03.1783022Z GITHUB_PATH=/__w/_temp/_runner_file_commands/add_path_3cae3672-99ad-4e75-aa32-b295111e0eb2 2025-05-07T19:43:03.1784437Z GITHUB_ACTION=__run_2 2025-05-07T19:43:03.1785130Z GITHUB_RUN_NUMBER=10601 2025-05-07T19:43:03.1785838Z RUNNER_NAME=i-07daac3f3a185b77c 2025-05-07T19:43:03.1786676Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-05-07T19:43:03.1787430Z PLATFORM_NAME_LC=linux-x86_64 2025-05-07T19:43:03.1787722Z MACHINE_NAME_LC=x86_64 2025-05-07T19:43:03.1787962Z GITHUB_TRIGGERING_ACTOR=q10 2025-05-07T19:43:03.1788258Z PRELUDE=.github/scripts/setup_env.bash 2025-05-07T19:43:03.1788554Z GITHUB_REF_TYPE=branch 2025-05-07T19:43:03.1789091Z *** 2025-05-07T19:43:03.1789322Z GITHUB_REPOSITORY_ID=150154628 2025-05-07T19:43:03.1789849Z GITHUB_ACTIONS=true 2025-05-07T19:43:03.1790143Z GITHUB_SHA=a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:03.1790688Z GITHUB_WORKFLOW_REF=pytorch/FBGEMM/.github/workflows/fbgemm_gpu_ci_cuda.yml@refs/pull/4066/merge 2025-05-07T19:43:03.1791233Z RUNNER_ENVIRONMENT=self-hosted 2025-05-07T19:43:03.1791512Z GITHUB_REF=refs/pull/4066/merge 2025-05-07T19:43:03.1791806Z RUNNER_OS=Linux 2025-05-07T19:43:03.1792045Z GITHUB_REF_PROTECTED=false 2025-05-07T19:43:03.1792329Z HOME=/github/home 2025-05-07T19:43:03.1792583Z GITHUB_API_URL=https://api.github.com 2025-05-07T19:43:03.1792904Z RUNNER_ARCH=X64 2025-05-07T19:43:03.1793142Z RUNNER_TEMP=/__w/_temp 2025-05-07T19:43:03.1793377Z BUILD_TARGET=default 2025-05-07T19:43:03.1793810Z GITHUB_STATE=/__w/_temp/_runner_file_commands/save_state_3cae3672-99ad-4e75-aa32-b295111e0eb2 2025-05-07T19:43:03.1794430Z GITHUB_ENV=/__w/_temp/_runner_file_commands/set_env_3cae3672-99ad-4e75-aa32-b295111e0eb2 2025-05-07T19:43:03.1794964Z GITHUB_EVENT_PATH=/github/workflow/event.json 2025-05-07T19:43:03.1795465Z GITHUB_EVENT_NAME=pull_request 2025-05-07T19:43:03.1795871Z GITHUB_RUN_ID=14891846252 2025-05-07T19:43:03.1796352Z GITHUB_STEP_SUMMARY=/__w/_temp/_runner_file_commands/step_summary_3cae3672-99ad-4e75-aa32-b295111e0eb2 2025-05-07T19:43:03.1796896Z BUILD_ENV=build_binary 2025-05-07T19:43:03.1797341Z GITHUB_ACTOR=q10 2025-05-07T19:43:03.1797577Z GITHUB_RUN_ATTEMPT=1 2025-05-07T19:43:03.1797843Z KERN_NAME_LC=linux 2025-05-07T19:43:03.1798082Z BUILD_CUDA_VERSION=12.6.3 2025-05-07T19:43:03.1798427Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-05-07T19:43:03.1798794Z PLATFORM_NAME=Linux-x86_64 2025-05-07T19:43:03.1799109Z GITHUB_SERVER_URL=https://github.com 2025-05-07T19:43:03.1799402Z SHLVL=1 2025-05-07T19:43:03.1799636Z GITHUB_ACTOR_ID=255046 2025-05-07T19:43:03.1799892Z RUNNER_TOOL_CACHE=/__w/_tool 2025-05-07T19:43:03.1800443Z GITHUB_WORKFLOW_SHA=6060cd4b5f971680caecdcc657faccb5720d1c3e 2025-05-07T19:43:03.1800879Z GITHUB_REF_NAME=4066/merge 2025-05-07T19:43:03.1801152Z KERN_NAME=Linux 2025-05-07T19:43:03.1801420Z GITHUB_JOB=build_artifact 2025-05-07T19:43:03.1801704Z GITHUB_REPOSITORY=pytorch/FBGEMM 2025-05-07T19:43:03.1802034Z GITHUB_RETENTION_DAYS=90 2025-05-07T19:43:03.1802300Z RUNNER_WORKSPACE=/__w/FBGEMM 2025-05-07T19:43:03.1802606Z GITHUB_ACTION_REPOSITORY= 2025-05-07T19:43:03.1802975Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:43:03.1803399Z GITHUB_BASE_REF=main 2025-05-07T19:43:03.1803629Z CI=true 2025-05-07T19:43:03.1803876Z GITHUB_REPOSITORY_OWNER=pytorch 2025-05-07T19:43:03.1804202Z GITHUB_HEAD_REF=bm/genai-rocm-oss-6 2025-05-07T19:43:03.1804497Z GITHUB_ACTION_REF= 2025-05-07T19:43:03.1804777Z GITHUB_WORKFLOW=FBGEMM GPU/GenAI CUDA CI 2025-05-07T19:43:03.1805287Z GITHUB_OUTPUT=/__w/_temp/_runner_file_commands/set_output_3cae3672-99ad-4e75-aa32-b295111e0eb2 2025-05-07T19:43:03.1805806Z MACHINE_NAME=x86_64 2025-05-07T19:43:03.1806041Z _=/usr/bin/printenv 2025-05-07T19:43:03.1806209Z 2025-05-07T19:43:03.1806336Z ################################################################################ 2025-05-07T19:43:03.1806674Z [INFO] Print ldd version ... 2025-05-07T19:43:03.1806967Z + ldd --version 2025-05-07T19:43:03.1807105Z 2025-05-07T19:43:03.1807240Z ldd (GNU libc) 2.34 2025-05-07T19:43:03.1807528Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:43:03.1808025Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:43:03.1808600Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:43:03.1809115Z Written by Roland McGrath and Ulrich Drepper. 2025-05-07T19:43:03.1809358Z 2025-05-07T19:43:03.1809478Z ################################################################################ 2025-05-07T19:43:03.1809845Z [INFO] Print CPU info ... 2025-05-07T19:43:03.1810263Z + nproc 2025-05-07T19:43:03.1810398Z 2025-05-07T19:43:03.1810495Z 96 2025-05-07T19:43:03.1810638Z 2025-05-07T19:43:03.1810729Z + lscpu 2025-05-07T19:43:03.1810967Z 2025-05-07T19:43:03.2072758Z Architecture: x86_64 2025-05-07T19:43:03.2073274Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:43:03.2073720Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2074170Z Byte Order: Little Endian 2025-05-07T19:43:03.2074518Z CPU(s): 96 2025-05-07T19:43:03.2074972Z On-line CPU(s) list: 0-95 2025-05-07T19:43:03.2075308Z Vendor ID: GenuineIntel 2025-05-07T19:43:03.2075741Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2076144Z CPU family: 6 2025-05-07T19:43:03.2076470Z Model: 85 2025-05-07T19:43:03.2076807Z Thread(s) per core: 2 2025-05-07T19:43:03.2077130Z Core(s) per socket: 24 2025-05-07T19:43:03.2077458Z Socket(s): 2 2025-05-07T19:43:03.2077765Z Stepping: 7 2025-05-07T19:43:03.2078099Z BogoMIPS: 5999.98 2025-05-07T19:43:03.2080497Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2083101Z Hypervisor vendor: KVM 2025-05-07T19:43:03.2083706Z Virtualization type: full 2025-05-07T19:43:03.2084125Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:43:03.2084543Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:43:03.2084962Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:43:03.2085348Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:43:03.2085761Z NUMA node(s): 2 2025-05-07T19:43:03.2086115Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:43:03.2086466Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:43:03.2086981Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:43:03.2087581Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:43:03.2088136Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:43:03.2088779Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:03.2089425Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:43:03.2090106Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:03.2090908Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:43:03.2091344Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:43:03.2091744Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:43:03.2092171Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:43:03.2092816Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:43:03.2093719Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:43:03.2094402Z Vulnerability Srbds: Not affected 2025-05-07T19:43:03.2094836Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:43:03.2095098Z 2025-05-07T19:43:03.2095208Z + cat /proc/cpuinfo 2025-05-07T19:43:03.2095516Z 2025-05-07T19:43:03.2095881Z processor : 0 2025-05-07T19:43:03.2096168Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2096439Z cpu family : 6 2025-05-07T19:43:03.2096702Z model : 85 2025-05-07T19:43:03.2097018Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2097422Z stepping : 7 2025-05-07T19:43:03.2097657Z microcode : 0x5003901 2025-05-07T19:43:03.2097938Z cpu MHz : 3200.679 2025-05-07T19:43:03.2098173Z cache size : 36608 KB 2025-05-07T19:43:03.2098452Z physical id : 0 2025-05-07T19:43:03.2098695Z siblings : 48 2025-05-07T19:43:03.2098957Z core id : 0 2025-05-07T19:43:03.2099177Z cpu cores : 24 2025-05-07T19:43:03.2099453Z apicid : 0 2025-05-07T19:43:03.2099701Z initial apicid : 0 2025-05-07T19:43:03.2099937Z fpu : yes 2025-05-07T19:43:03.2100188Z fpu_exception : yes 2025-05-07T19:43:03.2100434Z cpuid level : 13 2025-05-07T19:43:03.2100730Z wp : yes 2025-05-07T19:43:03.2103035Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2105717Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2106347Z bogomips : 5999.98 2025-05-07T19:43:03.2106581Z clflush size : 64 2025-05-07T19:43:03.2106849Z cache_alignment : 64 2025-05-07T19:43:03.2107137Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2107590Z power management: 2025-05-07T19:43:03.2107747Z 2025-05-07T19:43:03.2107879Z processor : 1 2025-05-07T19:43:03.2108130Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2108441Z cpu family : 6 2025-05-07T19:43:03.2108679Z model : 85 2025-05-07T19:43:03.2109014Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2109393Z stepping : 7 2025-05-07T19:43:03.2109656Z microcode : 0x5003901 2025-05-07T19:43:03.2109920Z cpu MHz : 3269.772 2025-05-07T19:43:03.2110179Z cache size : 36608 KB 2025-05-07T19:43:03.2110421Z physical id : 0 2025-05-07T19:43:03.2110687Z siblings : 48 2025-05-07T19:43:03.2110929Z core id : 1 2025-05-07T19:43:03.2111154Z cpu cores : 24 2025-05-07T19:43:03.2111408Z apicid : 2 2025-05-07T19:43:03.2111625Z initial apicid : 2 2025-05-07T19:43:03.2111895Z fpu : yes 2025-05-07T19:43:03.2112128Z fpu_exception : yes 2025-05-07T19:43:03.2112408Z cpuid level : 13 2025-05-07T19:43:03.2112637Z wp : yes 2025-05-07T19:43:03.2114947Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2117607Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2118205Z bogomips : 5999.98 2025-05-07T19:43:03.2118470Z clflush size : 64 2025-05-07T19:43:03.2118706Z cache_alignment : 64 2025-05-07T19:43:03.2119020Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2119390Z power management: 2025-05-07T19:43:03.2119539Z 2025-05-07T19:43:03.2119633Z processor : 2 2025-05-07T19:43:03.2119887Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2120214Z cpu family : 6 2025-05-07T19:43:03.2120465Z model : 85 2025-05-07T19:43:03.2120760Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2121155Z stepping : 7 2025-05-07T19:43:03.2121384Z microcode : 0x5003901 2025-05-07T19:43:03.2121663Z cpu MHz : 3155.763 2025-05-07T19:43:03.2121910Z cache size : 36608 KB 2025-05-07T19:43:03.2122189Z physical id : 0 2025-05-07T19:43:03.2122444Z siblings : 48 2025-05-07T19:43:03.2122675Z core id : 2 2025-05-07T19:43:03.2122915Z cpu cores : 24 2025-05-07T19:43:03.2123136Z apicid : 4 2025-05-07T19:43:03.2123381Z initial apicid : 4 2025-05-07T19:43:03.2123615Z fpu : yes 2025-05-07T19:43:03.2123860Z fpu_exception : yes 2025-05-07T19:43:03.2124098Z cpuid level : 13 2025-05-07T19:43:03.2124350Z wp : yes 2025-05-07T19:43:03.2126635Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2129551Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2130286Z bogomips : 5999.98 2025-05-07T19:43:03.2130537Z clflush size : 64 2025-05-07T19:43:03.2130816Z cache_alignment : 64 2025-05-07T19:43:03.2131151Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2131501Z power management: 2025-05-07T19:43:03.2131647Z 2025-05-07T19:43:03.2131777Z processor : 3 2025-05-07T19:43:03.2132175Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2132472Z cpu family : 6 2025-05-07T19:43:03.2132705Z model : 85 2025-05-07T19:43:03.2133049Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2133426Z stepping : 7 2025-05-07T19:43:03.2133694Z microcode : 0x5003901 2025-05-07T19:43:03.2133945Z cpu MHz : 3236.838 2025-05-07T19:43:03.2134216Z cache size : 36608 KB 2025-05-07T19:43:03.2134464Z physical id : 0 2025-05-07T19:43:03.2134725Z siblings : 48 2025-05-07T19:43:03.2134983Z core id : 3 2025-05-07T19:43:03.2135203Z cpu cores : 24 2025-05-07T19:43:03.2135455Z apicid : 6 2025-05-07T19:43:03.2135671Z initial apicid : 6 2025-05-07T19:43:03.2135941Z fpu : yes 2025-05-07T19:43:03.2136159Z fpu_exception : yes 2025-05-07T19:43:03.2136419Z cpuid level : 13 2025-05-07T19:43:03.2136647Z wp : yes 2025-05-07T19:43:03.2138941Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2141595Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2142195Z bogomips : 5999.98 2025-05-07T19:43:03.2142568Z clflush size : 64 2025-05-07T19:43:03.2142796Z cache_alignment : 64 2025-05-07T19:43:03.2143102Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2143459Z power management: 2025-05-07T19:43:03.2143600Z 2025-05-07T19:43:03.2143693Z processor : 4 2025-05-07T19:43:03.2143953Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2144213Z cpu family : 6 2025-05-07T19:43:03.2144452Z model : 85 2025-05-07T19:43:03.2144830Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2145228Z stepping : 7 2025-05-07T19:43:03.2145459Z microcode : 0x5003901 2025-05-07T19:43:03.2145742Z cpu MHz : 2999.992 2025-05-07T19:43:03.2145978Z cache size : 36608 KB 2025-05-07T19:43:03.2146255Z physical id : 0 2025-05-07T19:43:03.2146514Z siblings : 48 2025-05-07T19:43:03.2146729Z core id : 4 2025-05-07T19:43:03.2146969Z cpu cores : 24 2025-05-07T19:43:03.2147187Z apicid : 8 2025-05-07T19:43:03.2147421Z initial apicid : 8 2025-05-07T19:43:03.2147651Z fpu : yes 2025-05-07T19:43:03.2147886Z fpu_exception : yes 2025-05-07T19:43:03.2148122Z cpuid level : 13 2025-05-07T19:43:03.2148364Z wp : yes 2025-05-07T19:43:03.2150577Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2153201Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2153828Z bogomips : 5999.98 2025-05-07T19:43:03.2154066Z clflush size : 64 2025-05-07T19:43:03.2154339Z cache_alignment : 64 2025-05-07T19:43:03.2154659Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2154998Z power management: 2025-05-07T19:43:03.2155136Z 2025-05-07T19:43:03.2155259Z processor : 5 2025-05-07T19:43:03.2155485Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2155759Z cpu family : 6 2025-05-07T19:43:03.2156037Z model : 85 2025-05-07T19:43:03.2156470Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2156840Z stepping : 7 2025-05-07T19:43:03.2157094Z microcode : 0x5003901 2025-05-07T19:43:03.2157337Z cpu MHz : 2999.992 2025-05-07T19:43:03.2157596Z cache size : 36608 KB 2025-05-07T19:43:03.2157861Z physical id : 0 2025-05-07T19:43:03.2158093Z siblings : 48 2025-05-07T19:43:03.2158334Z core id : 5 2025-05-07T19:43:03.2158549Z cpu cores : 24 2025-05-07T19:43:03.2158791Z apicid : 10 2025-05-07T19:43:03.2159008Z initial apicid : 10 2025-05-07T19:43:03.2159260Z fpu : yes 2025-05-07T19:43:03.2159468Z fpu_exception : yes 2025-05-07T19:43:03.2159722Z cpuid level : 13 2025-05-07T19:43:03.2159945Z wp : yes 2025-05-07T19:43:03.2162346Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2165033Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2165653Z bogomips : 5999.98 2025-05-07T19:43:03.2165929Z clflush size : 64 2025-05-07T19:43:03.2166194Z cache_alignment : 64 2025-05-07T19:43:03.2166483Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2166849Z power management: 2025-05-07T19:43:03.2166991Z 2025-05-07T19:43:03.2167085Z processor : 6 2025-05-07T19:43:03.2167356Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2167624Z cpu family : 6 2025-05-07T19:43:03.2167886Z model : 85 2025-05-07T19:43:03.2168211Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2168691Z stepping : 7 2025-05-07T19:43:03.2168937Z microcode : 0x5003901 2025-05-07T19:43:03.2169235Z cpu MHz : 2999.992 2025-05-07T19:43:03.2169512Z cache size : 36608 KB 2025-05-07T19:43:03.2169955Z physical id : 0 2025-05-07T19:43:03.2170560Z siblings : 48 2025-05-07T19:43:03.2170934Z core id : 6 2025-05-07T19:43:03.2171198Z cpu cores : 24 2025-05-07T19:43:03.2171518Z apicid : 12 2025-05-07T19:43:03.2171751Z initial apicid : 12 2025-05-07T19:43:03.2172028Z fpu : yes 2025-05-07T19:43:03.2172253Z fpu_exception : yes 2025-05-07T19:43:03.2172532Z cpuid level : 13 2025-05-07T19:43:03.2172769Z wp : yes 2025-05-07T19:43:03.2175095Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2177769Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2178379Z bogomips : 5999.98 2025-05-07T19:43:03.2178638Z clflush size : 64 2025-05-07T19:43:03.2178886Z cache_alignment : 64 2025-05-07T19:43:03.2179203Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2179552Z power management: 2025-05-07T19:43:03.2179720Z 2025-05-07T19:43:03.2179812Z processor : 7 2025-05-07T19:43:03.2180072Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2180329Z cpu family : 6 2025-05-07T19:43:03.2180570Z model : 85 2025-05-07T19:43:03.2180859Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2181334Z stepping : 7 2025-05-07T19:43:03.2181559Z microcode : 0x5003901 2025-05-07T19:43:03.2181832Z cpu MHz : 3139.732 2025-05-07T19:43:03.2182063Z cache size : 36608 KB 2025-05-07T19:43:03.2182332Z physical id : 0 2025-05-07T19:43:03.2182553Z siblings : 48 2025-05-07T19:43:03.2182916Z core id : 7 2025-05-07T19:43:03.2183142Z cpu cores : 24 2025-05-07T19:43:03.2183349Z apicid : 14 2025-05-07T19:43:03.2183592Z initial apicid : 14 2025-05-07T19:43:03.2183822Z fpu : yes 2025-05-07T19:43:03.2184066Z fpu_exception : yes 2025-05-07T19:43:03.2184301Z cpuid level : 13 2025-05-07T19:43:03.2184548Z wp : yes 2025-05-07T19:43:03.2186646Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2189106Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2189692Z bogomips : 5999.98 2025-05-07T19:43:03.2189913Z clflush size : 64 2025-05-07T19:43:03.2190158Z cache_alignment : 64 2025-05-07T19:43:03.2190428Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2190773Z power management: 2025-05-07T19:43:03.2190907Z 2025-05-07T19:43:03.2191019Z processor : 8 2025-05-07T19:43:03.2191245Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2191517Z cpu family : 6 2025-05-07T19:43:03.2191729Z model : 85 2025-05-07T19:43:03.2192026Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2192371Z stepping : 7 2025-05-07T19:43:03.2192611Z microcode : 0x5003901 2025-05-07T19:43:03.2192839Z cpu MHz : 2999.992 2025-05-07T19:43:03.2193142Z cache size : 36608 KB 2025-05-07T19:43:03.2193532Z physical id : 0 2025-05-07T19:43:03.2193766Z siblings : 48 2025-05-07T19:43:03.2193973Z core id : 8 2025-05-07T19:43:03.2194210Z cpu cores : 24 2025-05-07T19:43:03.2194444Z apicid : 16 2025-05-07T19:43:03.2194659Z initial apicid : 16 2025-05-07T19:43:03.2194900Z fpu : yes 2025-05-07T19:43:03.2195100Z fpu_exception : yes 2025-05-07T19:43:03.2195344Z cpuid level : 13 2025-05-07T19:43:03.2195557Z wp : yes 2025-05-07T19:43:03.2197677Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2200123Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2200679Z bogomips : 5999.98 2025-05-07T19:43:03.2200922Z clflush size : 64 2025-05-07T19:43:03.2201149Z cache_alignment : 64 2025-05-07T19:43:03.2201441Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2201758Z power management: 2025-05-07T19:43:03.2201913Z 2025-05-07T19:43:03.2202000Z processor : 9 2025-05-07T19:43:03.2202238Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2202477Z cpu family : 6 2025-05-07T19:43:03.2202704Z model : 85 2025-05-07T19:43:03.2202971Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2203332Z stepping : 7 2025-05-07T19:43:03.2203537Z microcode : 0x5003901 2025-05-07T19:43:03.2203847Z cpu MHz : 3197.728 2025-05-07T19:43:03.2204069Z cache size : 36608 KB 2025-05-07T19:43:03.2204327Z physical id : 0 2025-05-07T19:43:03.2204541Z siblings : 48 2025-05-07T19:43:03.2204770Z core id : 9 2025-05-07T19:43:03.2204999Z cpu cores : 24 2025-05-07T19:43:03.2205219Z apicid : 18 2025-05-07T19:43:03.2205458Z initial apicid : 18 2025-05-07T19:43:03.2205686Z fpu : yes 2025-05-07T19:43:03.2205915Z fpu_exception : yes 2025-05-07T19:43:03.2206135Z cpuid level : 13 2025-05-07T19:43:03.2206362Z wp : yes 2025-05-07T19:43:03.2208699Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2211576Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2212171Z bogomips : 5999.98 2025-05-07T19:43:03.2212389Z clflush size : 64 2025-05-07T19:43:03.2212631Z cache_alignment : 64 2025-05-07T19:43:03.2212911Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2213265Z power management: 2025-05-07T19:43:03.2213400Z 2025-05-07T19:43:03.2213506Z processor : 10 2025-05-07T19:43:03.2213728Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2213989Z cpu family : 6 2025-05-07T19:43:03.2214190Z model : 85 2025-05-07T19:43:03.2214482Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2214835Z stepping : 7 2025-05-07T19:43:03.2215095Z microcode : 0x5003901 2025-05-07T19:43:03.2215347Z cpu MHz : 2999.992 2025-05-07T19:43:03.2215622Z cache size : 36608 KB 2025-05-07T19:43:03.2215873Z physical id : 0 2025-05-07T19:43:03.2216217Z siblings : 48 2025-05-07T19:43:03.2216447Z core id : 10 2025-05-07T19:43:03.2216701Z cpu cores : 24 2025-05-07T19:43:03.2216961Z apicid : 20 2025-05-07T19:43:03.2217190Z initial apicid : 20 2025-05-07T19:43:03.2217460Z fpu : yes 2025-05-07T19:43:03.2217680Z fpu_exception : yes 2025-05-07T19:43:03.2217937Z cpuid level : 13 2025-05-07T19:43:03.2218163Z wp : yes 2025-05-07T19:43:03.2220450Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2223213Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2223791Z bogomips : 5999.98 2025-05-07T19:43:03.2224034Z clflush size : 64 2025-05-07T19:43:03.2224252Z cache_alignment : 64 2025-05-07T19:43:03.2224534Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2224857Z power management: 2025-05-07T19:43:03.2225003Z 2025-05-07T19:43:03.2225087Z processor : 11 2025-05-07T19:43:03.2225314Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2225546Z cpu family : 6 2025-05-07T19:43:03.2225756Z model : 85 2025-05-07T19:43:03.2226022Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2226380Z stepping : 7 2025-05-07T19:43:03.2226580Z microcode : 0x5003901 2025-05-07T19:43:03.2226815Z cpu MHz : 2999.992 2025-05-07T19:43:03.2227026Z cache size : 36608 KB 2025-05-07T19:43:03.2227327Z physical id : 0 2025-05-07T19:43:03.2227539Z siblings : 48 2025-05-07T19:43:03.2227794Z core id : 11 2025-05-07T19:43:03.2228045Z cpu cores : 24 2025-05-07T19:43:03.2228263Z apicid : 22 2025-05-07T19:43:03.2228757Z initial apicid : 22 2025-05-07T19:43:03.2229171Z fpu : yes 2025-05-07T19:43:03.2229555Z fpu_exception : yes 2025-05-07T19:43:03.2229793Z cpuid level : 13 2025-05-07T19:43:03.2230047Z wp : yes 2025-05-07T19:43:03.2232322Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2234972Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2235594Z bogomips : 5999.98 2025-05-07T19:43:03.2235827Z clflush size : 64 2025-05-07T19:43:03.2236091Z cache_alignment : 64 2025-05-07T19:43:03.2236379Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2236748Z power management: 2025-05-07T19:43:03.2236884Z 2025-05-07T19:43:03.2236987Z processor : 12 2025-05-07T19:43:03.2237204Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2237460Z cpu family : 6 2025-05-07T19:43:03.2237663Z model : 85 2025-05-07T19:43:03.2237950Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2238298Z stepping : 7 2025-05-07T19:43:03.2238524Z microcode : 0x5003901 2025-05-07T19:43:03.2238751Z cpu MHz : 3233.323 2025-05-07T19:43:03.2238980Z cache size : 36608 KB 2025-05-07T19:43:03.2239209Z physical id : 0 2025-05-07T19:43:03.2239435Z siblings : 48 2025-05-07T19:43:03.2239636Z core id : 12 2025-05-07T19:43:03.2239972Z cpu cores : 24 2025-05-07T19:43:03.2240197Z apicid : 24 2025-05-07T19:43:03.2240401Z initial apicid : 24 2025-05-07T19:43:03.2240633Z fpu : yes 2025-05-07T19:43:03.2240833Z fpu_exception : yes 2025-05-07T19:43:03.2241070Z cpuid level : 13 2025-05-07T19:43:03.2241283Z wp : yes 2025-05-07T19:43:03.2243608Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2246169Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2246740Z bogomips : 5999.98 2025-05-07T19:43:03.2246965Z clflush size : 64 2025-05-07T19:43:03.2247179Z cache_alignment : 64 2025-05-07T19:43:03.2247458Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2247793Z power management: 2025-05-07T19:43:03.2247922Z 2025-05-07T19:43:03.2248010Z processor : 13 2025-05-07T19:43:03.2248237Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2248474Z cpu family : 6 2025-05-07T19:43:03.2248685Z model : 85 2025-05-07T19:43:03.2248963Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2249335Z stepping : 7 2025-05-07T19:43:03.2249537Z microcode : 0x5003901 2025-05-07T19:43:03.2249774Z cpu MHz : 2999.992 2025-05-07T19:43:03.2249990Z cache size : 36608 KB 2025-05-07T19:43:03.2250314Z physical id : 0 2025-05-07T19:43:03.2250524Z siblings : 48 2025-05-07T19:43:03.2251034Z core id : 13 2025-05-07T19:43:03.2251257Z cpu cores : 24 2025-05-07T19:43:03.2251549Z apicid : 26 2025-05-07T19:43:03.2251777Z initial apicid : 26 2025-05-07T19:43:03.2251993Z fpu : yes 2025-05-07T19:43:03.2252208Z fpu_exception : yes 2025-05-07T19:43:03.2252427Z cpuid level : 13 2025-05-07T19:43:03.2252649Z wp : yes 2025-05-07T19:43:03.2254891Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2257547Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2258137Z bogomips : 5999.98 2025-05-07T19:43:03.2258371Z clflush size : 64 2025-05-07T19:43:03.2258591Z cache_alignment : 64 2025-05-07T19:43:03.2258885Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2259233Z power management: 2025-05-07T19:43:03.2259369Z 2025-05-07T19:43:03.2259456Z processor : 14 2025-05-07T19:43:03.2259700Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2259943Z cpu family : 6 2025-05-07T19:43:03.2260176Z model : 85 2025-05-07T19:43:03.2260455Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2260828Z stepping : 7 2025-05-07T19:43:03.2261038Z microcode : 0x5003901 2025-05-07T19:43:03.2261294Z cpu MHz : 2999.992 2025-05-07T19:43:03.2261517Z cache size : 36608 KB 2025-05-07T19:43:03.2261770Z physical id : 0 2025-05-07T19:43:03.2261987Z siblings : 48 2025-05-07T19:43:03.2262217Z core id : 14 2025-05-07T19:43:03.2262446Z cpu cores : 24 2025-05-07T19:43:03.2262656Z apicid : 28 2025-05-07T19:43:03.2262889Z initial apicid : 28 2025-05-07T19:43:03.2263163Z fpu : yes 2025-05-07T19:43:03.2263389Z fpu_exception : yes 2025-05-07T19:43:03.2263615Z cpuid level : 13 2025-05-07T19:43:03.2263850Z wp : yes 2025-05-07T19:43:03.2266094Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2268714Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2269313Z bogomips : 5999.98 2025-05-07T19:43:03.2269534Z clflush size : 64 2025-05-07T19:43:03.2269772Z cache_alignment : 64 2025-05-07T19:43:03.2270043Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2270378Z power management: 2025-05-07T19:43:03.2270509Z 2025-05-07T19:43:03.2270612Z processor : 15 2025-05-07T19:43:03.2270828Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2271082Z cpu family : 6 2025-05-07T19:43:03.2271285Z model : 85 2025-05-07T19:43:03.2271573Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2271921Z stepping : 7 2025-05-07T19:43:03.2272142Z microcode : 0x5003901 2025-05-07T19:43:03.2272372Z cpu MHz : 3203.582 2025-05-07T19:43:03.2272629Z cache size : 36608 KB 2025-05-07T19:43:03.2272874Z physical id : 0 2025-05-07T19:43:03.2273125Z siblings : 48 2025-05-07T19:43:03.2273366Z core id : 15 2025-05-07T19:43:03.2273587Z cpu cores : 24 2025-05-07T19:43:03.2273833Z apicid : 30 2025-05-07T19:43:03.2274126Z initial apicid : 30 2025-05-07T19:43:03.2274383Z fpu : yes 2025-05-07T19:43:03.2274602Z fpu_exception : yes 2025-05-07T19:43:03.2274859Z cpuid level : 13 2025-05-07T19:43:03.2275080Z wp : yes 2025-05-07T19:43:03.2277357Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2280007Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2280608Z bogomips : 5999.98 2025-05-07T19:43:03.2280867Z clflush size : 64 2025-05-07T19:43:03.2281105Z cache_alignment : 64 2025-05-07T19:43:03.2281420Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2281787Z power management: 2025-05-07T19:43:03.2281927Z 2025-05-07T19:43:03.2282022Z processor : 16 2025-05-07T19:43:03.2282281Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2282534Z cpu family : 6 2025-05-07T19:43:03.2282783Z model : 85 2025-05-07T19:43:03.2283079Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2283467Z stepping : 7 2025-05-07T19:43:03.2283691Z microcode : 0x5003901 2025-05-07T19:43:03.2283960Z cpu MHz : 2999.992 2025-05-07T19:43:03.2284189Z cache size : 36608 KB 2025-05-07T19:43:03.2284439Z physical id : 0 2025-05-07T19:43:03.2284655Z siblings : 48 2025-05-07T19:43:03.2285017Z core id : 16 2025-05-07T19:43:03.2285307Z cpu cores : 24 2025-05-07T19:43:03.2285527Z apicid : 32 2025-05-07T19:43:03.2285777Z initial apicid : 32 2025-05-07T19:43:03.2286008Z fpu : yes 2025-05-07T19:43:03.2286244Z fpu_exception : yes 2025-05-07T19:43:03.2286546Z cpuid level : 13 2025-05-07T19:43:03.2286795Z wp : yes 2025-05-07T19:43:03.2289047Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2291829Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2292445Z bogomips : 5999.98 2025-05-07T19:43:03.2292682Z clflush size : 64 2025-05-07T19:43:03.2292952Z cache_alignment : 64 2025-05-07T19:43:03.2293245Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2293612Z power management: 2025-05-07T19:43:03.2293754Z 2025-05-07T19:43:03.2293871Z processor : 17 2025-05-07T19:43:03.2294105Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2294382Z cpu family : 6 2025-05-07T19:43:03.2294604Z model : 85 2025-05-07T19:43:03.2294926Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2295304Z stepping : 7 2025-05-07T19:43:03.2295562Z microcode : 0x5003901 2025-05-07T19:43:03.2295807Z cpu MHz : 2999.992 2025-05-07T19:43:03.2296065Z cache size : 36608 KB 2025-05-07T19:43:03.2296307Z physical id : 0 2025-05-07T19:43:03.2296559Z siblings : 48 2025-05-07T19:43:03.2296801Z core id : 17 2025-05-07T19:43:03.2297015Z cpu cores : 24 2025-05-07T19:43:03.2297254Z apicid : 34 2025-05-07T19:43:03.2297476Z initial apicid : 34 2025-05-07T19:43:03.2297726Z fpu : yes 2025-05-07T19:43:03.2298023Z fpu_exception : yes 2025-05-07T19:43:03.2298285Z cpuid level : 13 2025-05-07T19:43:03.2298716Z wp : yes 2025-05-07T19:43:03.2301153Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2303860Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2304463Z bogomips : 5999.98 2025-05-07T19:43:03.2304742Z clflush size : 64 2025-05-07T19:43:03.2304998Z cache_alignment : 64 2025-05-07T19:43:03.2305331Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2305717Z power management: 2025-05-07T19:43:03.2305866Z 2025-05-07T19:43:03.2305963Z processor : 18 2025-05-07T19:43:03.2306231Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2306496Z cpu family : 6 2025-05-07T19:43:03.2332297Z model : 85 2025-05-07T19:43:03.2332680Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2333095Z stepping : 7 2025-05-07T19:43:03.2333323Z microcode : 0x5003901 2025-05-07T19:43:03.2333584Z cpu MHz : 3238.447 2025-05-07T19:43:03.2333808Z cache size : 36608 KB 2025-05-07T19:43:03.2334066Z physical id : 0 2025-05-07T19:43:03.2334313Z siblings : 48 2025-05-07T19:43:03.2334531Z core id : 18 2025-05-07T19:43:03.2334765Z cpu cores : 24 2025-05-07T19:43:03.2334981Z apicid : 36 2025-05-07T19:43:03.2335213Z initial apicid : 36 2025-05-07T19:43:03.2335436Z fpu : yes 2025-05-07T19:43:03.2335663Z fpu_exception : yes 2025-05-07T19:43:03.2335915Z cpuid level : 13 2025-05-07T19:43:03.2336148Z wp : yes 2025-05-07T19:43:03.2338650Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2341325Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2341957Z bogomips : 5999.98 2025-05-07T19:43:03.2342310Z clflush size : 64 2025-05-07T19:43:03.2342686Z cache_alignment : 64 2025-05-07T19:43:03.2342988Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2343307Z power management: 2025-05-07T19:43:03.2343447Z 2025-05-07T19:43:03.2343559Z processor : 19 2025-05-07T19:43:03.2343780Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2344043Z cpu family : 6 2025-05-07T19:43:03.2344245Z model : 85 2025-05-07T19:43:03.2344536Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2344878Z stepping : 7 2025-05-07T19:43:03.2345098Z microcode : 0x5003901 2025-05-07T19:43:03.2345315Z cpu MHz : 2999.992 2025-05-07T19:43:03.2345548Z cache size : 36608 KB 2025-05-07T19:43:03.2345794Z physical id : 0 2025-05-07T19:43:03.2346000Z siblings : 48 2025-05-07T19:43:03.2346215Z core id : 19 2025-05-07T19:43:03.2346413Z cpu cores : 24 2025-05-07T19:43:03.2346635Z apicid : 38 2025-05-07T19:43:03.2346843Z initial apicid : 38 2025-05-07T19:43:03.2347079Z fpu : yes 2025-05-07T19:43:03.2347277Z fpu_exception : yes 2025-05-07T19:43:03.2347517Z cpuid level : 13 2025-05-07T19:43:03.2347724Z wp : yes 2025-05-07T19:43:03.2353926Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2356537Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2357097Z bogomips : 5999.98 2025-05-07T19:43:03.2357339Z clflush size : 64 2025-05-07T19:43:03.2357550Z cache_alignment : 64 2025-05-07T19:43:03.2357844Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2358166Z power management: 2025-05-07T19:43:03.2358291Z 2025-05-07T19:43:03.2358371Z processor : 20 2025-05-07T19:43:03.2358576Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2358787Z cpu family : 6 2025-05-07T19:43:03.2358973Z model : 85 2025-05-07T19:43:03.2359216Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2359548Z stepping : 7 2025-05-07T19:43:03.2359728Z microcode : 0x5003901 2025-05-07T19:43:03.2359948Z cpu MHz : 2999.992 2025-05-07T19:43:03.2360139Z cache size : 36608 KB 2025-05-07T19:43:03.2360345Z physical id : 0 2025-05-07T19:43:03.2360533Z siblings : 48 2025-05-07T19:43:03.2360719Z core id : 20 2025-05-07T19:43:03.2360906Z cpu cores : 24 2025-05-07T19:43:03.2361080Z apicid : 40 2025-05-07T19:43:03.2361269Z initial apicid : 40 2025-05-07T19:43:03.2361449Z fpu : yes 2025-05-07T19:43:03.2361626Z fpu_exception : yes 2025-05-07T19:43:03.2361833Z cpuid level : 13 2025-05-07T19:43:03.2362036Z wp : yes 2025-05-07T19:43:03.2364120Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2366852Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2367434Z bogomips : 5999.98 2025-05-07T19:43:03.2367633Z clflush size : 64 2025-05-07T19:43:03.2367847Z cache_alignment : 64 2025-05-07T19:43:03.2368119Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2368427Z power management: 2025-05-07T19:43:03.2368561Z 2025-05-07T19:43:03.2368647Z processor : 21 2025-05-07T19:43:03.2368857Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2369086Z cpu family : 6 2025-05-07T19:43:03.2369284Z model : 85 2025-05-07T19:43:03.2369552Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2369880Z stepping : 7 2025-05-07T19:43:03.2370089Z microcode : 0x5003901 2025-05-07T19:43:03.2370424Z cpu MHz : 3251.645 2025-05-07T19:43:03.2370817Z cache size : 36608 KB 2025-05-07T19:43:03.2371052Z physical id : 0 2025-05-07T19:43:03.2371270Z siblings : 48 2025-05-07T19:43:03.2371542Z core id : 21 2025-05-07T19:43:03.2371747Z cpu cores : 24 2025-05-07T19:43:03.2371962Z apicid : 42 2025-05-07T19:43:03.2372168Z initial apicid : 42 2025-05-07T19:43:03.2372395Z fpu : yes 2025-05-07T19:43:03.2372595Z fpu_exception : yes 2025-05-07T19:43:03.2372819Z cpuid level : 13 2025-05-07T19:43:03.2373013Z wp : yes 2025-05-07T19:43:03.2375334Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2377950Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2378530Z bogomips : 5999.98 2025-05-07T19:43:03.2378758Z clflush size : 64 2025-05-07T19:43:03.2378970Z cache_alignment : 64 2025-05-07T19:43:03.2379259Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2379593Z power management: 2025-05-07T19:43:03.2379721Z 2025-05-07T19:43:03.2379806Z processor : 22 2025-05-07T19:43:03.2380029Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2380273Z cpu family : 6 2025-05-07T19:43:03.2380483Z model : 85 2025-05-07T19:43:03.2380752Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2381110Z stepping : 7 2025-05-07T19:43:03.2381313Z microcode : 0x5003901 2025-05-07T19:43:03.2381540Z cpu MHz : 2999.992 2025-05-07T19:43:03.2381744Z cache size : 36608 KB 2025-05-07T19:43:03.2381981Z physical id : 0 2025-05-07T19:43:03.2382197Z siblings : 48 2025-05-07T19:43:03.2382389Z core id : 22 2025-05-07T19:43:03.2382593Z cpu cores : 24 2025-05-07T19:43:03.2382788Z apicid : 44 2025-05-07T19:43:03.2383118Z initial apicid : 44 2025-05-07T19:43:03.2383316Z fpu : yes 2025-05-07T19:43:03.2383517Z fpu_exception : yes 2025-05-07T19:43:03.2383724Z cpuid level : 13 2025-05-07T19:43:03.2383928Z wp : yes 2025-05-07T19:43:03.2386106Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2388715Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2389287Z bogomips : 5999.98 2025-05-07T19:43:03.2389597Z clflush size : 64 2025-05-07T19:43:03.2389801Z cache_alignment : 64 2025-05-07T19:43:03.2390048Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2390335Z power management: 2025-05-07T19:43:03.2390453Z 2025-05-07T19:43:03.2390543Z processor : 23 2025-05-07T19:43:03.2390737Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2390964Z cpu family : 6 2025-05-07T19:43:03.2391149Z model : 85 2025-05-07T19:43:03.2391412Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2391727Z stepping : 7 2025-05-07T19:43:03.2391926Z microcode : 0x5003901 2025-05-07T19:43:03.2392122Z cpu MHz : 2999.992 2025-05-07T19:43:03.2392330Z cache size : 36608 KB 2025-05-07T19:43:03.2392543Z physical id : 0 2025-05-07T19:43:03.2392731Z siblings : 48 2025-05-07T19:43:03.2392929Z core id : 23 2025-05-07T19:43:03.2393112Z cpu cores : 24 2025-05-07T19:43:03.2393302Z apicid : 46 2025-05-07T19:43:03.2393487Z initial apicid : 46 2025-05-07T19:43:03.2393685Z fpu : yes 2025-05-07T19:43:03.2393858Z fpu_exception : yes 2025-05-07T19:43:03.2394062Z cpuid level : 13 2025-05-07T19:43:03.2394242Z wp : yes 2025-05-07T19:43:03.2396380Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2398966Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2399494Z bogomips : 5999.98 2025-05-07T19:43:03.2399691Z clflush size : 64 2025-05-07T19:43:03.2399895Z cache_alignment : 64 2025-05-07T19:43:03.2400138Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2400439Z power management: 2025-05-07T19:43:03.2400558Z 2025-05-07T19:43:03.2400641Z processor : 24 2025-05-07T19:43:03.2400838Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2401060Z cpu family : 6 2025-05-07T19:43:03.2401235Z model : 85 2025-05-07T19:43:03.2401484Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2401793Z stepping : 7 2025-05-07T19:43:03.2401986Z microcode : 0x5003901 2025-05-07T19:43:03.2402186Z cpu MHz : 1205.517 2025-05-07T19:43:03.2402383Z cache size : 36608 KB 2025-05-07T19:43:03.2402577Z physical id : 1 2025-05-07T19:43:03.2402769Z siblings : 48 2025-05-07T19:43:03.2402939Z core id : 0 2025-05-07T19:43:03.2403121Z cpu cores : 24 2025-05-07T19:43:03.2403293Z apicid : 64 2025-05-07T19:43:03.2403480Z initial apicid : 64 2025-05-07T19:43:03.2403667Z fpu : yes 2025-05-07T19:43:03.2403848Z fpu_exception : yes 2025-05-07T19:43:03.2404047Z cpuid level : 13 2025-05-07T19:43:03.2404226Z wp : yes 2025-05-07T19:43:03.2406312Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2408773Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2409300Z bogomips : 5999.98 2025-05-07T19:43:03.2409500Z clflush size : 64 2025-05-07T19:43:03.2409690Z cache_alignment : 64 2025-05-07T19:43:03.2409930Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2410303Z power management: 2025-05-07T19:43:03.2410435Z 2025-05-07T19:43:03.2410511Z processor : 25 2025-05-07T19:43:03.2410879Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2411121Z cpu family : 6 2025-05-07T19:43:03.2411319Z model : 85 2025-05-07T19:43:03.2411612Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2411967Z stepping : 7 2025-05-07T19:43:03.2412157Z microcode : 0x5003901 2025-05-07T19:43:03.2412377Z cpu MHz : 1199.996 2025-05-07T19:43:03.2412576Z cache size : 36608 KB 2025-05-07T19:43:03.2412797Z physical id : 1 2025-05-07T19:43:03.2412988Z siblings : 48 2025-05-07T19:43:03.2413186Z core id : 1 2025-05-07T19:43:03.2413366Z cpu cores : 24 2025-05-07T19:43:03.2413709Z apicid : 66 2025-05-07T19:43:03.2413903Z initial apicid : 66 2025-05-07T19:43:03.2414117Z fpu : yes 2025-05-07T19:43:03.2414315Z fpu_exception : yes 2025-05-07T19:43:03.2414522Z cpuid level : 13 2025-05-07T19:43:03.2414732Z wp : yes 2025-05-07T19:43:03.2417042Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2419683Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2420279Z bogomips : 5999.98 2025-05-07T19:43:03.2420500Z clflush size : 64 2025-05-07T19:43:03.2420714Z cache_alignment : 64 2025-05-07T19:43:03.2420972Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2421311Z power management: 2025-05-07T19:43:03.2421440Z 2025-05-07T19:43:03.2421525Z processor : 26 2025-05-07T19:43:03.2421750Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2422000Z cpu family : 6 2025-05-07T19:43:03.2422198Z model : 85 2025-05-07T19:43:03.2422482Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2422825Z stepping : 7 2025-05-07T19:43:03.2423160Z microcode : 0x5003901 2025-05-07T19:43:03.2423366Z cpu MHz : 1198.469 2025-05-07T19:43:03.2423579Z cache size : 36608 KB 2025-05-07T19:43:03.2423783Z physical id : 1 2025-05-07T19:43:03.2423978Z siblings : 48 2025-05-07T19:43:03.2424159Z core id : 2 2025-05-07T19:43:03.2424355Z cpu cores : 24 2025-05-07T19:43:03.2424530Z apicid : 68 2025-05-07T19:43:03.2424717Z initial apicid : 68 2025-05-07T19:43:03.2424913Z fpu : yes 2025-05-07T19:43:03.2425095Z fpu_exception : yes 2025-05-07T19:43:03.2425300Z cpuid level : 13 2025-05-07T19:43:03.2425486Z wp : yes 2025-05-07T19:43:03.2427566Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2430483Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2431052Z bogomips : 5999.98 2025-05-07T19:43:03.2431298Z clflush size : 64 2025-05-07T19:43:03.2431503Z cache_alignment : 64 2025-05-07T19:43:03.2431779Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2432092Z power management: 2025-05-07T19:43:03.2432236Z 2025-05-07T19:43:03.2432318Z processor : 27 2025-05-07T19:43:03.2432531Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2432753Z cpu family : 6 2025-05-07T19:43:03.2432961Z model : 85 2025-05-07T19:43:03.2433228Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2433594Z stepping : 7 2025-05-07T19:43:03.2433811Z microcode : 0x5003901 2025-05-07T19:43:03.2434058Z cpu MHz : 2999.992 2025-05-07T19:43:03.2434275Z cache size : 36608 KB 2025-05-07T19:43:03.2434503Z physical id : 1 2025-05-07T19:43:03.2434708Z siblings : 48 2025-05-07T19:43:03.2434917Z core id : 3 2025-05-07T19:43:03.2435108Z cpu cores : 24 2025-05-07T19:43:03.2435320Z apicid : 70 2025-05-07T19:43:03.2435514Z initial apicid : 70 2025-05-07T19:43:03.2435730Z fpu : yes 2025-05-07T19:43:03.2435928Z fpu_exception : yes 2025-05-07T19:43:03.2436132Z cpuid level : 13 2025-05-07T19:43:03.2436341Z wp : yes 2025-05-07T19:43:03.2438686Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2441399Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2441947Z bogomips : 5999.98 2025-05-07T19:43:03.2442142Z clflush size : 64 2025-05-07T19:43:03.2442341Z cache_alignment : 64 2025-05-07T19:43:03.2442585Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2442909Z power management: 2025-05-07T19:43:03.2443037Z 2025-05-07T19:43:03.2443113Z processor : 28 2025-05-07T19:43:03.2443324Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2443550Z cpu family : 6 2025-05-07T19:43:03.2443733Z model : 85 2025-05-07T19:43:03.2443994Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2444354Z stepping : 7 2025-05-07T19:43:03.2444549Z microcode : 0x5003901 2025-05-07T19:43:03.2444750Z cpu MHz : 2999.992 2025-05-07T19:43:03.2444952Z cache size : 36608 KB 2025-05-07T19:43:03.2445171Z physical id : 1 2025-05-07T19:43:03.2445415Z siblings : 48 2025-05-07T19:43:03.2445623Z core id : 4 2025-05-07T19:43:03.2445857Z cpu cores : 24 2025-05-07T19:43:03.2446067Z apicid : 72 2025-05-07T19:43:03.2446299Z initial apicid : 72 2025-05-07T19:43:03.2446540Z fpu : yes 2025-05-07T19:43:03.2446749Z fpu_exception : yes 2025-05-07T19:43:03.2446997Z cpuid level : 13 2025-05-07T19:43:03.2447204Z wp : yes 2025-05-07T19:43:03.2449326Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2452219Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2452813Z bogomips : 5999.98 2025-05-07T19:43:03.2453080Z clflush size : 64 2025-05-07T19:43:03.2453310Z cache_alignment : 64 2025-05-07T19:43:03.2453622Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2453960Z power management: 2025-05-07T19:43:03.2454125Z 2025-05-07T19:43:03.2454218Z processor : 29 2025-05-07T19:43:03.2454474Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2454731Z cpu family : 6 2025-05-07T19:43:03.2454971Z model : 85 2025-05-07T19:43:03.2455259Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2455638Z stepping : 7 2025-05-07T19:43:03.2455860Z microcode : 0x5003901 2025-05-07T19:43:03.2456131Z cpu MHz : 1190.052 2025-05-07T19:43:03.2456365Z cache size : 36608 KB 2025-05-07T19:43:03.2456639Z physical id : 1 2025-05-07T19:43:03.2456875Z siblings : 48 2025-05-07T19:43:03.2457121Z core id : 5 2025-05-07T19:43:03.2457335Z cpu cores : 24 2025-05-07T19:43:03.2457575Z apicid : 74 2025-05-07T19:43:03.2457793Z initial apicid : 74 2025-05-07T19:43:03.2458047Z fpu : yes 2025-05-07T19:43:03.2458294Z fpu_exception : yes 2025-05-07T19:43:03.2458525Z cpuid level : 13 2025-05-07T19:43:03.2458768Z wp : yes 2025-05-07T19:43:03.2461119Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2463847Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2464441Z bogomips : 5999.98 2025-05-07T19:43:03.2464667Z clflush size : 64 2025-05-07T19:43:03.2464922Z cache_alignment : 64 2025-05-07T19:43:03.2465200Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2465548Z power management: 2025-05-07T19:43:03.2465680Z 2025-05-07T19:43:03.2465766Z processor : 30 2025-05-07T19:43:03.2466005Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2466267Z cpu family : 6 2025-05-07T19:43:03.2466458Z model : 85 2025-05-07T19:43:03.2466750Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2467089Z stepping : 7 2025-05-07T19:43:03.2467321Z microcode : 0x5003901 2025-05-07T19:43:03.2467547Z cpu MHz : 1205.894 2025-05-07T19:43:03.2467794Z cache size : 36608 KB 2025-05-07T19:43:03.2468481Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:03.2468830Z physical id : 1 2025-05-07T19:43:03.2469042Z siblings : 48 2025-05-07T19:43:03.2469268Z core id : 6 2025-05-07T19:43:03.2469496Z cpu cores : 24 2025-05-07T19:43:03.2469699Z apicid : 76 2025-05-07T19:43:03.2469924Z initial apicid : 76 2025-05-07T19:43:03.2470140Z fpu : yes 2025-05-07T19:43:03.2470366Z fpu_exception : yes 2025-05-07T19:43:03.2470585Z cpuid level : 13 2025-05-07T19:43:03.2470863Z wp : yes 2025-05-07T19:43:03.2472960Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2475473Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2476045Z bogomips : 5999.98 2025-05-07T19:43:03.2476265Z clflush size : 64 2025-05-07T19:43:03.2476513Z cache_alignment : 64 2025-05-07T19:43:03.2476780Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2477113Z power management: 2025-05-07T19:43:03.2477244Z 2025-05-07T19:43:03.2477356Z processor : 31 2025-05-07T19:43:03.2477579Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2477847Z cpu family : 6 2025-05-07T19:43:03.2478056Z model : 85 2025-05-07T19:43:03.2478362Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2478702Z stepping : 7 2025-05-07T19:43:03.2478939Z microcode : 0x5003901 2025-05-07T19:43:03.2479171Z cpu MHz : 1207.513 2025-05-07T19:43:03.2479413Z cache size : 36608 KB 2025-05-07T19:43:03.2479637Z physical id : 1 2025-05-07T19:43:03.2479876Z siblings : 48 2025-05-07T19:43:03.2480107Z core id : 7 2025-05-07T19:43:03.2480311Z cpu cores : 24 2025-05-07T19:43:03.2480541Z apicid : 78 2025-05-07T19:43:03.2480741Z initial apicid : 78 2025-05-07T19:43:03.2480962Z fpu : yes 2025-05-07T19:43:03.2481155Z fpu_exception : yes 2025-05-07T19:43:03.2481370Z cpuid level : 13 2025-05-07T19:43:03.2481557Z wp : yes 2025-05-07T19:43:03.2483686Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2486098Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2486627Z bogomips : 5999.98 2025-05-07T19:43:03.2486837Z clflush size : 64 2025-05-07T19:43:03.2487038Z cache_alignment : 64 2025-05-07T19:43:03.2487310Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2487629Z power management: 2025-05-07T19:43:03.2487754Z 2025-05-07T19:43:03.2487831Z processor : 32 2025-05-07T19:43:03.2488059Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2488284Z cpu family : 6 2025-05-07T19:43:03.2488493Z model : 85 2025-05-07T19:43:03.2488743Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2489073Z stepping : 7 2025-05-07T19:43:03.2489265Z microcode : 0x5003901 2025-05-07T19:43:03.2489482Z cpu MHz : 1199.607 2025-05-07T19:43:03.2489687Z cache size : 36608 KB 2025-05-07T19:43:03.2489912Z physical id : 1 2025-05-07T19:43:03.2490100Z siblings : 48 2025-05-07T19:43:03.2490387Z core id : 8 2025-05-07T19:43:03.2490756Z cpu cores : 24 2025-05-07T19:43:03.2490961Z apicid : 80 2025-05-07T19:43:03.2491181Z initial apicid : 80 2025-05-07T19:43:03.2491393Z fpu : yes 2025-05-07T19:43:03.2491606Z fpu_exception : yes 2025-05-07T19:43:03.2491813Z cpuid level : 13 2025-05-07T19:43:03.2492025Z wp : yes 2025-05-07T19:43:03.2494438Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2497055Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2497700Z bogomips : 5999.98 2025-05-07T19:43:03.2497914Z clflush size : 64 2025-05-07T19:43:03.2498145Z cache_alignment : 64 2025-05-07T19:43:03.2498421Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2498753Z power management: 2025-05-07T19:43:03.2498886Z 2025-05-07T19:43:03.2498990Z processor : 33 2025-05-07T19:43:03.2499199Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2499436Z cpu family : 6 2025-05-07T19:43:03.2499635Z model : 85 2025-05-07T19:43:03.2499916Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2500257Z stepping : 7 2025-05-07T19:43:03.2500462Z microcode : 0x5003901 2025-05-07T19:43:03.2500689Z cpu MHz : 1199.191 2025-05-07T19:43:03.2500913Z cache size : 36608 KB 2025-05-07T19:43:03.2501133Z physical id : 1 2025-05-07T19:43:03.2501351Z siblings : 48 2025-05-07T19:43:03.2501564Z core id : 9 2025-05-07T19:43:03.2501767Z cpu cores : 24 2025-05-07T19:43:03.2501985Z apicid : 82 2025-05-07T19:43:03.2502187Z initial apicid : 82 2025-05-07T19:43:03.2502411Z fpu : yes 2025-05-07T19:43:03.2502605Z fpu_exception : yes 2025-05-07T19:43:03.2502829Z cpuid level : 13 2025-05-07T19:43:03.2503139Z wp : yes 2025-05-07T19:43:03.2505236Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2507734Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2508280Z bogomips : 5999.98 2025-05-07T19:43:03.2508491Z clflush size : 64 2025-05-07T19:43:03.2508687Z cache_alignment : 64 2025-05-07T19:43:03.2508945Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2509270Z power management: 2025-05-07T19:43:03.2509397Z 2025-05-07T19:43:03.2509479Z processor : 34 2025-05-07T19:43:03.2509696Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2509917Z cpu family : 6 2025-05-07T19:43:03.2510118Z model : 85 2025-05-07T19:43:03.2510366Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2510702Z stepping : 7 2025-05-07T19:43:03.2510895Z microcode : 0x5003901 2025-05-07T19:43:03.2511113Z cpu MHz : 2999.992 2025-05-07T19:43:03.2511310Z cache size : 36608 KB 2025-05-07T19:43:03.2511527Z physical id : 1 2025-05-07T19:43:03.2511710Z siblings : 48 2025-05-07T19:43:03.2511911Z core id : 10 2025-05-07T19:43:03.2512109Z cpu cores : 24 2025-05-07T19:43:03.2512292Z apicid : 84 2025-05-07T19:43:03.2512507Z initial apicid : 84 2025-05-07T19:43:03.2512703Z fpu : yes 2025-05-07T19:43:03.2512902Z fpu_exception : yes 2025-05-07T19:43:03.2513102Z cpuid level : 13 2025-05-07T19:43:03.2513303Z wp : yes 2025-05-07T19:43:03.2515370Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2517793Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2518386Z bogomips : 5999.98 2025-05-07T19:43:03.2518580Z clflush size : 64 2025-05-07T19:43:03.2518791Z cache_alignment : 64 2025-05-07T19:43:03.2519036Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2519346Z power management: 2025-05-07T19:43:03.2519466Z 2025-05-07T19:43:03.2519564Z processor : 35 2025-05-07T19:43:03.2519759Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2519999Z cpu family : 6 2025-05-07T19:43:03.2520180Z model : 85 2025-05-07T19:43:03.2520428Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2520736Z stepping : 7 2025-05-07T19:43:03.2520916Z microcode : 0x5003901 2025-05-07T19:43:03.2521108Z cpu MHz : 2999.992 2025-05-07T19:43:03.2521296Z cache size : 36608 KB 2025-05-07T19:43:03.2521486Z physical id : 1 2025-05-07T19:43:03.2521673Z siblings : 48 2025-05-07T19:43:03.2521857Z core id : 11 2025-05-07T19:43:03.2522027Z cpu cores : 24 2025-05-07T19:43:03.2522218Z apicid : 86 2025-05-07T19:43:03.2522395Z initial apicid : 86 2025-05-07T19:43:03.2522594Z fpu : yes 2025-05-07T19:43:03.2522770Z fpu_exception : yes 2025-05-07T19:43:03.2522977Z cpuid level : 13 2025-05-07T19:43:03.2523171Z wp : yes 2025-05-07T19:43:03.2525255Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2527730Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2528261Z bogomips : 5999.98 2025-05-07T19:43:03.2528602Z clflush size : 64 2025-05-07T19:43:03.2528963Z cache_alignment : 64 2025-05-07T19:43:03.2529241Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2529590Z power management: 2025-05-07T19:43:03.2529722Z 2025-05-07T19:43:03.2529806Z processor : 36 2025-05-07T19:43:03.2530057Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2530366Z cpu family : 6 2025-05-07T19:43:03.2530594Z model : 85 2025-05-07T19:43:03.2530864Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2531233Z stepping : 7 2025-05-07T19:43:03.2531429Z microcode : 0x5003901 2025-05-07T19:43:03.2531653Z cpu MHz : 1199.689 2025-05-07T19:43:03.2531858Z cache size : 36608 KB 2025-05-07T19:43:03.2532081Z physical id : 1 2025-05-07T19:43:03.2532274Z siblings : 48 2025-05-07T19:43:03.2532474Z core id : 12 2025-05-07T19:43:03.2532679Z cpu cores : 24 2025-05-07T19:43:03.2532867Z apicid : 88 2025-05-07T19:43:03.2533068Z initial apicid : 88 2025-05-07T19:43:03.2533265Z fpu : yes 2025-05-07T19:43:03.2533466Z fpu_exception : yes 2025-05-07T19:43:03.2533671Z cpuid level : 13 2025-05-07T19:43:03.2533870Z wp : yes 2025-05-07T19:43:03.2536093Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2538691Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2539278Z bogomips : 5999.98 2025-05-07T19:43:03.2539482Z clflush size : 64 2025-05-07T19:43:03.2539809Z cache_alignment : 64 2025-05-07T19:43:03.2540066Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2540385Z power management: 2025-05-07T19:43:03.2540510Z 2025-05-07T19:43:03.2540604Z processor : 37 2025-05-07T19:43:03.2540809Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2541049Z cpu family : 6 2025-05-07T19:43:03.2541238Z model : 85 2025-05-07T19:43:03.2541511Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2541860Z stepping : 7 2025-05-07T19:43:03.2542077Z microcode : 0x5003901 2025-05-07T19:43:03.2542406Z cpu MHz : 2999.992 2025-05-07T19:43:03.2542743Z cache size : 36608 KB 2025-05-07T19:43:03.2542947Z physical id : 1 2025-05-07T19:43:03.2543154Z siblings : 48 2025-05-07T19:43:03.2543345Z core id : 13 2025-05-07T19:43:03.2543515Z cpu cores : 24 2025-05-07T19:43:03.2543703Z apicid : 90 2025-05-07T19:43:03.2543877Z initial apicid : 90 2025-05-07T19:43:03.2544077Z fpu : yes 2025-05-07T19:43:03.2544247Z fpu_exception : yes 2025-05-07T19:43:03.2544449Z cpuid level : 13 2025-05-07T19:43:03.2544627Z wp : yes 2025-05-07T19:43:03.2546704Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2549109Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2549632Z bogomips : 5999.98 2025-05-07T19:43:03.2549899Z clflush size : 64 2025-05-07T19:43:03.2550086Z cache_alignment : 64 2025-05-07T19:43:03.2550335Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2550634Z power management: 2025-05-07T19:43:03.2550752Z 2025-05-07T19:43:03.2550825Z processor : 38 2025-05-07T19:43:03.2551023Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2551231Z cpu family : 6 2025-05-07T19:43:03.2551411Z model : 85 2025-05-07T19:43:03.2551651Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2551972Z stepping : 7 2025-05-07T19:43:03.2552150Z microcode : 0x5003901 2025-05-07T19:43:03.2552360Z cpu MHz : 2999.992 2025-05-07T19:43:03.2552548Z cache size : 36608 KB 2025-05-07T19:43:03.2552758Z physical id : 1 2025-05-07T19:43:03.2552941Z siblings : 48 2025-05-07T19:43:03.2553127Z core id : 14 2025-05-07T19:43:03.2553313Z cpu cores : 24 2025-05-07T19:43:03.2553489Z apicid : 92 2025-05-07T19:43:03.2553678Z initial apicid : 92 2025-05-07T19:43:03.2553867Z fpu : yes 2025-05-07T19:43:03.2554066Z fpu_exception : yes 2025-05-07T19:43:03.2554258Z cpuid level : 13 2025-05-07T19:43:03.2554454Z wp : yes 2025-05-07T19:43:03.2556513Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2558925Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2559455Z bogomips : 5999.98 2025-05-07T19:43:03.2559637Z clflush size : 64 2025-05-07T19:43:03.2559835Z cache_alignment : 64 2025-05-07T19:43:03.2560071Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2560422Z power management: 2025-05-07T19:43:03.2560537Z 2025-05-07T19:43:03.2560617Z processor : 39 2025-05-07T19:43:03.2560801Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2561011Z cpu family : 6 2025-05-07T19:43:03.2561189Z model : 85 2025-05-07T19:43:03.2561445Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2561755Z stepping : 7 2025-05-07T19:43:03.2561957Z microcode : 0x5003901 2025-05-07T19:43:03.2562159Z cpu MHz : 2999.992 2025-05-07T19:43:03.2562373Z cache size : 36608 KB 2025-05-07T19:43:03.2562571Z physical id : 1 2025-05-07T19:43:03.2562764Z siblings : 48 2025-05-07T19:43:03.2562958Z core id : 15 2025-05-07T19:43:03.2563127Z cpu cores : 24 2025-05-07T19:43:03.2563322Z apicid : 94 2025-05-07T19:43:03.2563510Z initial apicid : 94 2025-05-07T19:43:03.2563710Z fpu : yes 2025-05-07T19:43:03.2563887Z fpu_exception : yes 2025-05-07T19:43:03.2564100Z cpuid level : 13 2025-05-07T19:43:03.2564288Z wp : yes 2025-05-07T19:43:03.2566374Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2568792Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2569336Z bogomips : 5999.98 2025-05-07T19:43:03.2569546Z clflush size : 64 2025-05-07T19:43:03.2569750Z cache_alignment : 64 2025-05-07T19:43:03.2570069Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2570467Z power management: 2025-05-07T19:43:03.2570775Z 2025-05-07T19:43:03.2570862Z processor : 40 2025-05-07T19:43:03.2571092Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2571390Z cpu family : 6 2025-05-07T19:43:03.2571605Z model : 85 2025-05-07T19:43:03.2571870Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2572217Z stepping : 7 2025-05-07T19:43:03.2572418Z microcode : 0x5003901 2025-05-07T19:43:03.2572644Z cpu MHz : 2999.992 2025-05-07T19:43:03.2572853Z cache size : 36608 KB 2025-05-07T19:43:03.2573086Z physical id : 1 2025-05-07T19:43:03.2573304Z siblings : 48 2025-05-07T19:43:03.2573495Z core id : 16 2025-05-07T19:43:03.2573699Z cpu cores : 24 2025-05-07T19:43:03.2573892Z apicid : 96 2025-05-07T19:43:03.2574096Z initial apicid : 96 2025-05-07T19:43:03.2574322Z fpu : yes 2025-05-07T19:43:03.2574520Z fpu_exception : yes 2025-05-07T19:43:03.2574720Z cpuid level : 13 2025-05-07T19:43:03.2574927Z wp : yes 2025-05-07T19:43:03.2577152Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2579768Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2580357Z bogomips : 5999.98 2025-05-07T19:43:03.2580569Z clflush size : 64 2025-05-07T19:43:03.2580794Z cache_alignment : 64 2025-05-07T19:43:03.2581055Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2581393Z power management: 2025-05-07T19:43:03.2581517Z 2025-05-07T19:43:03.2581697Z processor : 41 2025-05-07T19:43:03.2581905Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2582142Z cpu family : 6 2025-05-07T19:43:03.2582337Z model : 85 2025-05-07T19:43:03.2582631Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2583078Z stepping : 7 2025-05-07T19:43:03.2583278Z microcode : 0x5003901 2025-05-07T19:43:03.2583473Z cpu MHz : 2999.992 2025-05-07T19:43:03.2583677Z cache size : 36608 KB 2025-05-07T19:43:03.2583882Z physical id : 1 2025-05-07T19:43:03.2584074Z siblings : 48 2025-05-07T19:43:03.2584258Z core id : 17 2025-05-07T19:43:03.2584445Z cpu cores : 24 2025-05-07T19:43:03.2584645Z apicid : 98 2025-05-07T19:43:03.2584825Z initial apicid : 98 2025-05-07T19:43:03.2585038Z fpu : yes 2025-05-07T19:43:03.2585223Z fpu_exception : yes 2025-05-07T19:43:03.2585431Z cpuid level : 13 2025-05-07T19:43:03.2585626Z wp : yes 2025-05-07T19:43:03.2587709Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2590140Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2590671Z bogomips : 5999.98 2025-05-07T19:43:03.2590887Z clflush size : 64 2025-05-07T19:43:03.2591083Z cache_alignment : 64 2025-05-07T19:43:03.2591337Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2591650Z power management: 2025-05-07T19:43:03.2591829Z 2025-05-07T19:43:03.2591905Z processor : 42 2025-05-07T19:43:03.2592119Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2592328Z cpu family : 6 2025-05-07T19:43:03.2592527Z model : 85 2025-05-07T19:43:03.2592774Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2593117Z stepping : 7 2025-05-07T19:43:03.2593307Z microcode : 0x5003901 2025-05-07T19:43:03.2593525Z cpu MHz : 2999.992 2025-05-07T19:43:03.2593722Z cache size : 36608 KB 2025-05-07T19:43:03.2593947Z physical id : 1 2025-05-07T19:43:03.2594146Z siblings : 48 2025-05-07T19:43:03.2594327Z core id : 18 2025-05-07T19:43:03.2594536Z cpu cores : 24 2025-05-07T19:43:03.2594726Z apicid : 100 2025-05-07T19:43:03.2594944Z initial apicid : 100 2025-05-07T19:43:03.2595137Z fpu : yes 2025-05-07T19:43:03.2595337Z fpu_exception : yes 2025-05-07T19:43:03.2595531Z cpuid level : 13 2025-05-07T19:43:03.2595737Z wp : yes 2025-05-07T19:43:03.2597961Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2600375Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2600920Z bogomips : 5999.98 2025-05-07T19:43:03.2601110Z clflush size : 64 2025-05-07T19:43:03.2601335Z cache_alignment : 64 2025-05-07T19:43:03.2601593Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2601892Z power management: 2025-05-07T19:43:03.2602012Z 2025-05-07T19:43:03.2602100Z processor : 43 2025-05-07T19:43:03.2602311Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2602609Z cpu family : 6 2025-05-07T19:43:03.2602794Z model : 85 2025-05-07T19:43:03.2603062Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2603384Z stepping : 7 2025-05-07T19:43:03.2603590Z microcode : 0x5003901 2025-05-07T19:43:03.2603799Z cpu MHz : 2999.992 2025-05-07T19:43:03.2604018Z cache size : 36608 KB 2025-05-07T19:43:03.2604220Z physical id : 1 2025-05-07T19:43:03.2604428Z siblings : 48 2025-05-07T19:43:03.2604632Z core id : 19 2025-05-07T19:43:03.2604816Z cpu cores : 24 2025-05-07T19:43:03.2605016Z apicid : 102 2025-05-07T19:43:03.2605202Z initial apicid : 102 2025-05-07T19:43:03.2605422Z fpu : yes 2025-05-07T19:43:03.2605605Z fpu_exception : yes 2025-05-07T19:43:03.2605824Z cpuid level : 13 2025-05-07T19:43:03.2606016Z wp : yes 2025-05-07T19:43:03.2608116Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2610793Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2611375Z bogomips : 5999.98 2025-05-07T19:43:03.2611606Z clflush size : 64 2025-05-07T19:43:03.2611825Z cache_alignment : 64 2025-05-07T19:43:03.2612116Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2612459Z power management: 2025-05-07T19:43:03.2612595Z 2025-05-07T19:43:03.2612688Z processor : 44 2025-05-07T19:43:03.2612980Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2613233Z cpu family : 6 2025-05-07T19:43:03.2613447Z model : 85 2025-05-07T19:43:03.2613719Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2614084Z stepping : 7 2025-05-07T19:43:03.2614285Z microcode : 0x5003901 2025-05-07T19:43:03.2614528Z cpu MHz : 2999.992 2025-05-07T19:43:03.2614736Z cache size : 36608 KB 2025-05-07T19:43:03.2614969Z physical id : 1 2025-05-07T19:43:03.2615195Z siblings : 48 2025-05-07T19:43:03.2615391Z core id : 20 2025-05-07T19:43:03.2615606Z cpu cores : 24 2025-05-07T19:43:03.2615800Z apicid : 104 2025-05-07T19:43:03.2616009Z initial apicid : 104 2025-05-07T19:43:03.2616225Z fpu : yes 2025-05-07T19:43:03.2616428Z fpu_exception : yes 2025-05-07T19:43:03.2616631Z cpuid level : 13 2025-05-07T19:43:03.2616833Z wp : yes 2025-05-07T19:43:03.2619067Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2621675Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2622274Z bogomips : 5999.98 2025-05-07T19:43:03.2622487Z clflush size : 64 2025-05-07T19:43:03.2622721Z cache_alignment : 64 2025-05-07T19:43:03.2623096Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2623398Z power management: 2025-05-07T19:43:03.2623525Z 2025-05-07T19:43:03.2623619Z processor : 45 2025-05-07T19:43:03.2623823Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2624058Z cpu family : 6 2025-05-07T19:43:03.2624242Z model : 85 2025-05-07T19:43:03.2624406Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2624548Z stepping : 7 2025-05-07T19:43:03.2624630Z microcode : 0x5003901 2025-05-07T19:43:03.2624709Z cpu MHz : 2999.992 2025-05-07T19:43:03.2624808Z cache size : 36608 KB 2025-05-07T19:43:03.2624888Z physical id : 1 2025-05-07T19:43:03.2624965Z siblings : 48 2025-05-07T19:43:03.2625057Z core id : 21 2025-05-07T19:43:03.2625129Z cpu cores : 24 2025-05-07T19:43:03.2625208Z apicid : 106 2025-05-07T19:43:03.2625287Z initial apicid : 106 2025-05-07T19:43:03.2625382Z fpu : yes 2025-05-07T19:43:03.2625471Z fpu_exception : yes 2025-05-07T19:43:03.2625550Z cpuid level : 13 2025-05-07T19:43:03.2625628Z wp : yes 2025-05-07T19:43:03.2627614Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2627979Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2628068Z bogomips : 5999.98 2025-05-07T19:43:03.2628145Z clflush size : 64 2025-05-07T19:43:03.2628225Z cache_alignment : 64 2025-05-07T19:43:03.2628338Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2628423Z power management: 2025-05-07T19:43:03.2628427Z 2025-05-07T19:43:03.2628645Z processor : 46 2025-05-07T19:43:03.2628735Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2628825Z cpu family : 6 2025-05-07T19:43:03.2629069Z model : 85 2025-05-07T19:43:03.2629326Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2629427Z stepping : 7 2025-05-07T19:43:03.2629540Z microcode : 0x5003901 2025-05-07T19:43:03.2629621Z cpu MHz : 2999.992 2025-05-07T19:43:03.2629710Z cache size : 36608 KB 2025-05-07T19:43:03.2629810Z physical id : 1 2025-05-07T19:43:03.2629889Z siblings : 48 2025-05-07T19:43:03.2629965Z core id : 22 2025-05-07T19:43:03.2630051Z cpu cores : 24 2025-05-07T19:43:03.2630144Z apicid : 108 2025-05-07T19:43:03.2630231Z initial apicid : 108 2025-05-07T19:43:03.2630316Z fpu : yes 2025-05-07T19:43:03.2630425Z fpu_exception : yes 2025-05-07T19:43:03.2630505Z cpuid level : 13 2025-05-07T19:43:03.2630580Z wp : yes 2025-05-07T19:43:03.2632737Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2633129Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2633213Z bogomips : 5999.98 2025-05-07T19:43:03.2633322Z clflush size : 64 2025-05-07T19:43:03.2633412Z cache_alignment : 64 2025-05-07T19:43:03.2633540Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2633623Z power management: 2025-05-07T19:43:03.2633649Z 2025-05-07T19:43:03.2633730Z processor : 47 2025-05-07T19:43:03.2633825Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2633903Z cpu family : 6 2025-05-07T19:43:03.2633986Z model : 85 2025-05-07T19:43:03.2634152Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2634229Z stepping : 7 2025-05-07T19:43:03.2634381Z microcode : 0x5003901 2025-05-07T19:43:03.2634482Z cpu MHz : 1199.962 2025-05-07T19:43:03.2634570Z cache size : 36608 KB 2025-05-07T19:43:03.2634657Z physical id : 1 2025-05-07T19:43:03.2634763Z siblings : 48 2025-05-07T19:43:03.2634840Z core id : 23 2025-05-07T19:43:03.2634920Z cpu cores : 24 2025-05-07T19:43:03.2635003Z apicid : 110 2025-05-07T19:43:03.2635102Z initial apicid : 110 2025-05-07T19:43:03.2635181Z fpu : yes 2025-05-07T19:43:03.2635271Z fpu_exception : yes 2025-05-07T19:43:03.2635366Z cpuid level : 13 2025-05-07T19:43:03.2635445Z wp : yes 2025-05-07T19:43:03.2637577Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2637990Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2638073Z bogomips : 5999.98 2025-05-07T19:43:03.2638156Z clflush size : 64 2025-05-07T19:43:03.2638259Z cache_alignment : 64 2025-05-07T19:43:03.2638389Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2638475Z power management: 2025-05-07T19:43:03.2638480Z 2025-05-07T19:43:03.2638562Z processor : 48 2025-05-07T19:43:03.2638674Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2638758Z cpu family : 6 2025-05-07T19:43:03.2638833Z model : 85 2025-05-07T19:43:03.2639009Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2639139Z stepping : 7 2025-05-07T19:43:03.2639231Z microcode : 0x5003901 2025-05-07T19:43:03.2639320Z cpu MHz : 2999.992 2025-05-07T19:43:03.2639426Z cache size : 36608 KB 2025-05-07T19:43:03.2639517Z physical id : 0 2025-05-07T19:43:03.2639596Z siblings : 48 2025-05-07T19:43:03.2639688Z core id : 0 2025-05-07T19:43:03.2639768Z cpu cores : 24 2025-05-07T19:43:03.2639850Z apicid : 1 2025-05-07T19:43:03.2639938Z initial apicid : 1 2025-05-07T19:43:03.2640054Z fpu : yes 2025-05-07T19:43:03.2640151Z fpu_exception : yes 2025-05-07T19:43:03.2640240Z cpuid level : 13 2025-05-07T19:43:03.2640355Z wp : yes 2025-05-07T19:43:03.2642504Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2642880Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2643000Z bogomips : 5999.98 2025-05-07T19:43:03.2643091Z clflush size : 64 2025-05-07T19:43:03.2643185Z cache_alignment : 64 2025-05-07T19:43:03.2643343Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2643435Z power management: 2025-05-07T19:43:03.2643438Z 2025-05-07T19:43:03.2643528Z processor : 49 2025-05-07T19:43:03.2643626Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2643740Z cpu family : 6 2025-05-07T19:43:03.2643827Z model : 85 2025-05-07T19:43:03.2643988Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2644108Z stepping : 7 2025-05-07T19:43:03.2644207Z microcode : 0x5003901 2025-05-07T19:43:03.2644297Z cpu MHz : 2999.992 2025-05-07T19:43:03.2644461Z cache size : 36608 KB 2025-05-07T19:43:03.2644573Z physical id : 0 2025-05-07T19:43:03.2644663Z siblings : 48 2025-05-07T19:43:03.2644753Z core id : 1 2025-05-07T19:43:03.2644874Z cpu cores : 24 2025-05-07T19:43:03.2644964Z apicid : 3 2025-05-07T19:43:03.2645059Z initial apicid : 3 2025-05-07T19:43:03.2645146Z fpu : yes 2025-05-07T19:43:03.2645268Z fpu_exception : yes 2025-05-07T19:43:03.2645365Z cpuid level : 13 2025-05-07T19:43:03.2645453Z wp : yes 2025-05-07T19:43:03.2647467Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2647838Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2647933Z bogomips : 5999.98 2025-05-07T19:43:03.2648046Z clflush size : 64 2025-05-07T19:43:03.2648138Z cache_alignment : 64 2025-05-07T19:43:03.2648271Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2648391Z power management: 2025-05-07T19:43:03.2648395Z 2025-05-07T19:43:03.2648485Z processor : 50 2025-05-07T19:43:03.2648581Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2648666Z cpu family : 6 2025-05-07T19:43:03.2648774Z model : 85 2025-05-07T19:43:03.2648932Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2649016Z stepping : 7 2025-05-07T19:43:03.2649138Z microcode : 0x5003901 2025-05-07T19:43:03.2649268Z cpu MHz : 2999.992 2025-05-07T19:43:03.2649359Z cache size : 36608 KB 2025-05-07T19:43:03.2649448Z physical id : 0 2025-05-07T19:43:03.2649556Z siblings : 48 2025-05-07T19:43:03.2649639Z core id : 2 2025-05-07T19:43:03.2649724Z cpu cores : 24 2025-05-07T19:43:03.2649830Z apicid : 5 2025-05-07T19:43:03.2649916Z initial apicid : 5 2025-05-07T19:43:03.2649998Z fpu : yes 2025-05-07T19:43:03.2650086Z fpu_exception : yes 2025-05-07T19:43:03.2650262Z cpuid level : 13 2025-05-07T19:43:03.2650345Z wp : yes 2025-05-07T19:43:03.2652642Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2653072Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2653166Z bogomips : 5999.98 2025-05-07T19:43:03.2653258Z clflush size : 64 2025-05-07T19:43:03.2653383Z cache_alignment : 64 2025-05-07T19:43:03.2653522Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2653616Z power management: 2025-05-07T19:43:03.2653620Z 2025-05-07T19:43:03.2653743Z processor : 51 2025-05-07T19:43:03.2653845Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2653937Z cpu family : 6 2025-05-07T19:43:03.2654025Z model : 85 2025-05-07T19:43:03.2654220Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2654311Z stepping : 7 2025-05-07T19:43:03.2654406Z microcode : 0x5003901 2025-05-07T19:43:03.2654520Z cpu MHz : 2999.992 2025-05-07T19:43:03.2654618Z cache size : 36608 KB 2025-05-07T19:43:03.2654711Z physical id : 0 2025-05-07T19:43:03.2654856Z siblings : 48 2025-05-07T19:43:03.2654968Z core id : 3 2025-05-07T19:43:03.2655062Z cpu cores : 24 2025-05-07T19:43:03.2655151Z apicid : 7 2025-05-07T19:43:03.2655280Z initial apicid : 7 2025-05-07T19:43:03.2655372Z fpu : yes 2025-05-07T19:43:03.2655469Z fpu_exception : yes 2025-05-07T19:43:03.2655567Z cpuid level : 13 2025-05-07T19:43:03.2655685Z wp : yes 2025-05-07T19:43:03.2657823Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2658222Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2658339Z bogomips : 5999.98 2025-05-07T19:43:03.2658432Z clflush size : 64 2025-05-07T19:43:03.2658527Z cache_alignment : 64 2025-05-07T19:43:03.2658694Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2658790Z power management: 2025-05-07T19:43:03.2658794Z 2025-05-07T19:43:03.2658887Z processor : 52 2025-05-07T19:43:03.2659012Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2659103Z cpu family : 6 2025-05-07T19:43:03.2659191Z model : 85 2025-05-07T19:43:03.2659359Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2659474Z stepping : 7 2025-05-07T19:43:03.2659570Z microcode : 0x5003901 2025-05-07T19:43:03.2659662Z cpu MHz : 2999.992 2025-05-07T19:43:03.2659780Z cache size : 36608 KB 2025-05-07T19:43:03.2659929Z physical id : 0 2025-05-07T19:43:03.2660021Z siblings : 48 2025-05-07T19:43:03.2660113Z core id : 4 2025-05-07T19:43:03.2660235Z cpu cores : 24 2025-05-07T19:43:03.2660330Z apicid : 9 2025-05-07T19:43:03.2660429Z initial apicid : 9 2025-05-07T19:43:03.2660518Z fpu : yes 2025-05-07T19:43:03.2660643Z fpu_exception : yes 2025-05-07T19:43:03.2660738Z cpuid level : 13 2025-05-07T19:43:03.2660829Z wp : yes 2025-05-07T19:43:03.2663098Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2663468Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2663561Z bogomips : 5999.98 2025-05-07T19:43:03.2663673Z clflush size : 64 2025-05-07T19:43:03.2663765Z cache_alignment : 64 2025-05-07T19:43:03.2663895Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2664008Z power management: 2025-05-07T19:43:03.2664013Z 2025-05-07T19:43:03.2664098Z processor : 53 2025-05-07T19:43:03.2664192Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2664303Z cpu family : 6 2025-05-07T19:43:03.2664389Z model : 85 2025-05-07T19:43:03.2664546Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2664632Z stepping : 7 2025-05-07T19:43:03.2664746Z microcode : 0x5003901 2025-05-07T19:43:03.2664833Z cpu MHz : 2999.992 2025-05-07T19:43:03.2664921Z cache size : 36608 KB 2025-05-07T19:43:03.2665007Z physical id : 0 2025-05-07T19:43:03.2665112Z siblings : 48 2025-05-07T19:43:03.2665202Z core id : 5 2025-05-07T19:43:03.2665285Z cpu cores : 24 2025-05-07T19:43:03.2665460Z apicid : 11 2025-05-07T19:43:03.2665551Z initial apicid : 11 2025-05-07T19:43:03.2665635Z fpu : yes 2025-05-07T19:43:03.2665725Z fpu_exception : yes 2025-05-07T19:43:03.2665835Z cpuid level : 13 2025-05-07T19:43:03.2665917Z wp : yes 2025-05-07T19:43:03.2667901Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2668296Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2668388Z bogomips : 5999.98 2025-05-07T19:43:03.2668478Z clflush size : 64 2025-05-07T19:43:03.2668604Z cache_alignment : 64 2025-05-07T19:43:03.2668736Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2668825Z power management: 2025-05-07T19:43:03.2668830Z 2025-05-07T19:43:03.2668940Z processor : 54 2025-05-07T19:43:03.2669039Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2669123Z cpu family : 6 2025-05-07T19:43:03.2669204Z model : 85 2025-05-07T19:43:03.2669389Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2669474Z stepping : 7 2025-05-07T19:43:03.2669564Z microcode : 0x5003901 2025-05-07T19:43:03.2669677Z cpu MHz : 2999.992 2025-05-07T19:43:03.2669766Z cache size : 36608 KB 2025-05-07T19:43:03.2669855Z physical id : 0 2025-05-07T19:43:03.2669942Z siblings : 48 2025-05-07T19:43:03.2670052Z core id : 6 2025-05-07T19:43:03.2670187Z cpu cores : 24 2025-05-07T19:43:03.2670271Z apicid : 13 2025-05-07T19:43:03.2670387Z initial apicid : 13 2025-05-07T19:43:03.2670469Z fpu : yes 2025-05-07T19:43:03.2670561Z fpu_exception : yes 2025-05-07T19:43:03.2670645Z cpuid level : 13 2025-05-07T19:43:03.2670751Z wp : yes 2025-05-07T19:43:03.2672722Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2673115Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2673209Z bogomips : 5999.98 2025-05-07T19:43:03.2673299Z clflush size : 64 2025-05-07T19:43:03.2673391Z cache_alignment : 64 2025-05-07T19:43:03.2673591Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2673682Z power management: 2025-05-07T19:43:03.2673686Z 2025-05-07T19:43:03.2673772Z processor : 55 2025-05-07T19:43:03.2673897Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2673990Z cpu family : 6 2025-05-07T19:43:03.2674076Z model : 85 2025-05-07T19:43:03.2674235Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2674347Z stepping : 7 2025-05-07T19:43:03.2674437Z microcode : 0x5003901 2025-05-07T19:43:03.2674527Z cpu MHz : 3156.743 2025-05-07T19:43:03.2674643Z cache size : 36608 KB 2025-05-07T19:43:03.2674734Z physical id : 0 2025-05-07T19:43:03.2674827Z siblings : 48 2025-05-07T19:43:03.2674911Z core id : 7 2025-05-07T19:43:03.2675023Z cpu cores : 24 2025-05-07T19:43:03.2675115Z apicid : 15 2025-05-07T19:43:03.2675203Z initial apicid : 15 2025-05-07T19:43:03.2675363Z fpu : yes 2025-05-07T19:43:03.2675454Z fpu_exception : yes 2025-05-07T19:43:03.2675544Z cpuid level : 13 2025-05-07T19:43:03.2675629Z wp : yes 2025-05-07T19:43:03.2677640Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2678011Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2678122Z bogomips : 5999.98 2025-05-07T19:43:03.2678213Z clflush size : 64 2025-05-07T19:43:03.2678305Z cache_alignment : 64 2025-05-07T19:43:03.2678440Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2678559Z power management: 2025-05-07T19:43:03.2678564Z 2025-05-07T19:43:03.2678652Z processor : 56 2025-05-07T19:43:03.2678750Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2678856Z cpu family : 6 2025-05-07T19:43:03.2678940Z model : 85 2025-05-07T19:43:03.2679097Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2679184Z stepping : 7 2025-05-07T19:43:03.2679304Z microcode : 0x5003901 2025-05-07T19:43:03.2679394Z cpu MHz : 2999.992 2025-05-07T19:43:03.2679485Z cache size : 36608 KB 2025-05-07T19:43:03.2679602Z physical id : 0 2025-05-07T19:43:03.2679692Z siblings : 48 2025-05-07T19:43:03.2679779Z core id : 8 2025-05-07T19:43:03.2679867Z cpu cores : 24 2025-05-07T19:43:03.2679985Z apicid : 17 2025-05-07T19:43:03.2680145Z initial apicid : 17 2025-05-07T19:43:03.2680235Z fpu : yes 2025-05-07T19:43:03.2680361Z fpu_exception : yes 2025-05-07T19:43:03.2680451Z cpuid level : 13 2025-05-07T19:43:03.2680541Z wp : yes 2025-05-07T19:43:03.2682547Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2682914Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2683012Z bogomips : 5999.98 2025-05-07T19:43:03.2683134Z clflush size : 64 2025-05-07T19:43:03.2683231Z cache_alignment : 64 2025-05-07T19:43:03.2683372Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2683466Z power management: 2025-05-07T19:43:03.2683470Z 2025-05-07T19:43:03.2683592Z processor : 57 2025-05-07T19:43:03.2683690Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2683779Z cpu family : 6 2025-05-07T19:43:03.2683896Z model : 85 2025-05-07T19:43:03.2684058Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2684149Z stepping : 7 2025-05-07T19:43:03.2684242Z microcode : 0x5003901 2025-05-07T19:43:03.2684357Z cpu MHz : 2999.992 2025-05-07T19:43:03.2684450Z cache size : 36608 KB 2025-05-07T19:43:03.2684537Z physical id : 0 2025-05-07T19:43:03.2684656Z siblings : 48 2025-05-07T19:43:03.2684742Z core id : 9 2025-05-07T19:43:03.2684832Z cpu cores : 24 2025-05-07T19:43:03.2684923Z apicid : 19 2025-05-07T19:43:03.2685045Z initial apicid : 19 2025-05-07T19:43:03.2685135Z fpu : yes 2025-05-07T19:43:03.2685228Z fpu_exception : yes 2025-05-07T19:43:03.2685319Z cpuid level : 13 2025-05-07T19:43:03.2685475Z wp : yes 2025-05-07T19:43:03.2687454Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2687845Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2687936Z bogomips : 5999.98 2025-05-07T19:43:03.2688025Z clflush size : 64 2025-05-07T19:43:03.2688146Z cache_alignment : 64 2025-05-07T19:43:03.2688276Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2688368Z power management: 2025-05-07T19:43:03.2688372Z 2025-05-07T19:43:03.2688458Z processor : 58 2025-05-07T19:43:03.2688577Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2688664Z cpu family : 6 2025-05-07T19:43:03.2688747Z model : 85 2025-05-07T19:43:03.2688931Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2689018Z stepping : 7 2025-05-07T19:43:03.2689112Z microcode : 0x5003901 2025-05-07T19:43:03.2689205Z cpu MHz : 3237.096 2025-05-07T19:43:03.2689324Z cache size : 36608 KB 2025-05-07T19:43:03.2689414Z physical id : 0 2025-05-07T19:43:03.2689500Z siblings : 48 2025-05-07T19:43:03.2689602Z core id : 10 2025-05-07T19:43:03.2689686Z cpu cores : 24 2025-05-07T19:43:03.2689772Z apicid : 21 2025-05-07T19:43:03.2689860Z initial apicid : 21 2025-05-07T19:43:03.2689965Z fpu : yes 2025-05-07T19:43:03.2690057Z fpu_exception : yes 2025-05-07T19:43:03.2690277Z cpuid level : 13 2025-05-07T19:43:03.2690363Z wp : yes 2025-05-07T19:43:03.2692701Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2693095Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2693217Z bogomips : 5999.98 2025-05-07T19:43:03.2693310Z clflush size : 64 2025-05-07T19:43:03.2693407Z cache_alignment : 64 2025-05-07T19:43:03.2693552Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2693675Z power management: 2025-05-07T19:43:03.2693679Z 2025-05-07T19:43:03.2693771Z processor : 59 2025-05-07T19:43:03.2693873Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2693990Z cpu family : 6 2025-05-07T19:43:03.2694082Z model : 85 2025-05-07T19:43:03.2694253Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2710404Z stepping : 7 2025-05-07T19:43:03.2710549Z microcode : 0x5003901 2025-05-07T19:43:03.2710646Z cpu MHz : 3236.824 2025-05-07T19:43:03.2710723Z cache size : 36608 KB 2025-05-07T19:43:03.2710805Z physical id : 0 2025-05-07T19:43:03.2710890Z siblings : 48 2025-05-07T19:43:03.2710959Z core id : 11 2025-05-07T19:43:03.2711035Z cpu cores : 24 2025-05-07T19:43:03.2711114Z apicid : 23 2025-05-07T19:43:03.2711206Z initial apicid : 23 2025-05-07T19:43:03.2711280Z fpu : yes 2025-05-07T19:43:03.2711363Z fpu_exception : yes 2025-05-07T19:43:03.2711459Z cpuid level : 13 2025-05-07T19:43:03.2711542Z wp : yes 2025-05-07T19:43:03.2713543Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2714059Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2714147Z bogomips : 5999.98 2025-05-07T19:43:03.2714227Z clflush size : 64 2025-05-07T19:43:03.2714329Z cache_alignment : 64 2025-05-07T19:43:03.2714461Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2714547Z power management: 2025-05-07T19:43:03.2714557Z 2025-05-07T19:43:03.2714635Z processor : 60 2025-05-07T19:43:03.2714735Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2714812Z cpu family : 6 2025-05-07T19:43:03.2714886Z model : 85 2025-05-07T19:43:03.2715061Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2715131Z stepping : 7 2025-05-07T19:43:03.2715207Z microcode : 0x5003901 2025-05-07T19:43:03.2715286Z cpu MHz : 3234.113 2025-05-07T19:43:03.2715378Z cache size : 36608 KB 2025-05-07T19:43:03.2715461Z physical id : 0 2025-05-07T19:43:03.2715537Z siblings : 48 2025-05-07T19:43:03.2715619Z core id : 12 2025-05-07T19:43:03.2715694Z cpu cores : 24 2025-05-07T19:43:03.2715765Z apicid : 25 2025-05-07T19:43:03.2715842Z initial apicid : 25 2025-05-07T19:43:03.2715925Z fpu : yes 2025-05-07T19:43:03.2716006Z fpu_exception : yes 2025-05-07T19:43:03.2716083Z cpuid level : 13 2025-05-07T19:43:03.2716151Z wp : yes 2025-05-07T19:43:03.2718194Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2718556Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2718654Z bogomips : 5999.98 2025-05-07T19:43:03.2718728Z clflush size : 64 2025-05-07T19:43:03.2718813Z cache_alignment : 64 2025-05-07T19:43:03.2718962Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2719046Z power management: 2025-05-07T19:43:03.2719051Z 2025-05-07T19:43:03.2719133Z processor : 61 2025-05-07T19:43:03.2719226Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2719326Z cpu family : 6 2025-05-07T19:43:03.2719397Z model : 85 2025-05-07T19:43:03.2719552Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2719652Z stepping : 7 2025-05-07T19:43:03.2719735Z microcode : 0x5003901 2025-05-07T19:43:03.2719808Z cpu MHz : 3237.936 2025-05-07T19:43:03.2719879Z cache size : 36608 KB 2025-05-07T19:43:03.2719969Z physical id : 0 2025-05-07T19:43:03.2720043Z siblings : 48 2025-05-07T19:43:03.2720119Z core id : 13 2025-05-07T19:43:03.2720210Z cpu cores : 24 2025-05-07T19:43:03.2720282Z apicid : 27 2025-05-07T19:43:03.2720367Z initial apicid : 27 2025-05-07T19:43:03.2720441Z fpu : yes 2025-05-07T19:43:03.2720538Z fpu_exception : yes 2025-05-07T19:43:03.2720616Z cpuid level : 13 2025-05-07T19:43:03.2720689Z wp : yes 2025-05-07T19:43:03.2722680Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2723091Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2723168Z bogomips : 5999.98 2025-05-07T19:43:03.2723262Z clflush size : 64 2025-05-07T19:43:03.2723346Z cache_alignment : 64 2025-05-07T19:43:03.2723468Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2723557Z power management: 2025-05-07T19:43:03.2723562Z 2025-05-07T19:43:03.2723646Z processor : 62 2025-05-07T19:43:03.2723730Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2723813Z cpu family : 6 2025-05-07T19:43:03.2723903Z model : 85 2025-05-07T19:43:03.2724054Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2724126Z stepping : 7 2025-05-07T19:43:03.2724211Z microcode : 0x5003901 2025-05-07T19:43:03.2724289Z cpu MHz : 3288.597 2025-05-07T19:43:03.2724362Z cache size : 36608 KB 2025-05-07T19:43:03.2724435Z physical id : 0 2025-05-07T19:43:03.2724513Z siblings : 48 2025-05-07T19:43:03.2724583Z core id : 14 2025-05-07T19:43:03.2724653Z cpu cores : 24 2025-05-07T19:43:03.2724720Z apicid : 29 2025-05-07T19:43:03.2724803Z initial apicid : 29 2025-05-07T19:43:03.2724872Z fpu : yes 2025-05-07T19:43:03.2724947Z fpu_exception : yes 2025-05-07T19:43:03.2725030Z cpuid level : 13 2025-05-07T19:43:03.2725098Z wp : yes 2025-05-07T19:43:03.2727124Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2727493Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2727564Z bogomips : 5999.98 2025-05-07T19:43:03.2727634Z clflush size : 64 2025-05-07T19:43:03.2727715Z cache_alignment : 64 2025-05-07T19:43:03.2727831Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2727908Z power management: 2025-05-07T19:43:03.2727912Z 2025-05-07T19:43:03.2727992Z processor : 63 2025-05-07T19:43:03.2728074Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2728144Z cpu family : 6 2025-05-07T19:43:03.2728217Z model : 85 2025-05-07T19:43:03.2728371Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2728660Z stepping : 7 2025-05-07T19:43:03.2728737Z microcode : 0x5003901 2025-05-07T19:43:03.2728807Z cpu MHz : 2999.992 2025-05-07T19:43:03.2728892Z cache size : 36608 KB 2025-05-07T19:43:03.2729135Z physical id : 0 2025-05-07T19:43:03.2729212Z siblings : 48 2025-05-07T19:43:03.2729292Z core id : 15 2025-05-07T19:43:03.2729364Z cpu cores : 24 2025-05-07T19:43:03.2729439Z apicid : 31 2025-05-07T19:43:03.2729518Z initial apicid : 31 2025-05-07T19:43:03.2729599Z fpu : yes 2025-05-07T19:43:03.2729701Z fpu_exception : yes 2025-05-07T19:43:03.2729781Z cpuid level : 13 2025-05-07T19:43:03.2729858Z wp : yes 2025-05-07T19:43:03.2732069Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2732706Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2732793Z bogomips : 5999.98 2025-05-07T19:43:03.2732877Z clflush size : 64 2025-05-07T19:43:03.2732961Z cache_alignment : 64 2025-05-07T19:43:03.2733095Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2733174Z power management: 2025-05-07T19:43:03.2733179Z 2025-05-07T19:43:03.2733256Z processor : 64 2025-05-07T19:43:03.2733343Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2733445Z cpu family : 6 2025-05-07T19:43:03.2733520Z model : 85 2025-05-07T19:43:03.2733678Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2733769Z stepping : 7 2025-05-07T19:43:03.2733850Z microcode : 0x5003901 2025-05-07T19:43:03.2733924Z cpu MHz : 3786.394 2025-05-07T19:43:03.2734002Z cache size : 36608 KB 2025-05-07T19:43:03.2734087Z physical id : 0 2025-05-07T19:43:03.2734160Z siblings : 48 2025-05-07T19:43:03.2734234Z core id : 16 2025-05-07T19:43:03.2734316Z cpu cores : 24 2025-05-07T19:43:03.2734390Z apicid : 33 2025-05-07T19:43:03.2734471Z initial apicid : 33 2025-05-07T19:43:03.2734544Z fpu : yes 2025-05-07T19:43:03.2734643Z fpu_exception : yes 2025-05-07T19:43:03.2734716Z cpuid level : 13 2025-05-07T19:43:03.2734786Z wp : yes 2025-05-07T19:43:03.2737018Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2737402Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2737481Z bogomips : 5999.98 2025-05-07T19:43:03.2737568Z clflush size : 64 2025-05-07T19:43:03.2737648Z cache_alignment : 64 2025-05-07T19:43:03.2737774Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2737866Z power management: 2025-05-07T19:43:03.2737871Z 2025-05-07T19:43:03.2737955Z processor : 65 2025-05-07T19:43:03.2738040Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2738115Z cpu family : 6 2025-05-07T19:43:03.2738194Z model : 85 2025-05-07T19:43:03.2738350Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2738430Z stepping : 7 2025-05-07T19:43:03.2738518Z microcode : 0x5003901 2025-05-07T19:43:03.2738595Z cpu MHz : 3239.649 2025-05-07T19:43:03.2738670Z cache size : 36608 KB 2025-05-07T19:43:03.2738744Z physical id : 0 2025-05-07T19:43:03.2738826Z siblings : 48 2025-05-07T19:43:03.2738897Z core id : 17 2025-05-07T19:43:03.2738972Z cpu cores : 24 2025-05-07T19:43:03.2739058Z apicid : 35 2025-05-07T19:43:03.2739137Z initial apicid : 35 2025-05-07T19:43:03.2739209Z fpu : yes 2025-05-07T19:43:03.2739290Z fpu_exception : yes 2025-05-07T19:43:03.2739369Z cpuid level : 13 2025-05-07T19:43:03.2739441Z wp : yes 2025-05-07T19:43:03.2741569Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2742012Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2742093Z bogomips : 5999.98 2025-05-07T19:43:03.2742173Z clflush size : 64 2025-05-07T19:43:03.2742258Z cache_alignment : 64 2025-05-07T19:43:03.2742386Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2742467Z power management: 2025-05-07T19:43:03.2742471Z 2025-05-07T19:43:03.2742558Z processor : 66 2025-05-07T19:43:03.2742755Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2742828Z cpu family : 6 2025-05-07T19:43:03.2742899Z model : 85 2025-05-07T19:43:03.2743060Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2743136Z stepping : 7 2025-05-07T19:43:03.2743215Z microcode : 0x5003901 2025-05-07T19:43:03.2743295Z cpu MHz : 3244.043 2025-05-07T19:43:03.2743370Z cache size : 36608 KB 2025-05-07T19:43:03.2743447Z physical id : 0 2025-05-07T19:43:03.2743518Z siblings : 48 2025-05-07T19:43:03.2743599Z core id : 18 2025-05-07T19:43:03.2743675Z cpu cores : 24 2025-05-07T19:43:03.2743746Z apicid : 37 2025-05-07T19:43:03.2743833Z initial apicid : 37 2025-05-07T19:43:03.2743904Z fpu : yes 2025-05-07T19:43:03.2743981Z fpu_exception : yes 2025-05-07T19:43:03.2744054Z cpuid level : 13 2025-05-07T19:43:03.2744129Z wp : yes 2025-05-07T19:43:03.2746241Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2746624Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2746699Z bogomips : 5999.98 2025-05-07T19:43:03.2746776Z clflush size : 64 2025-05-07T19:43:03.2746859Z cache_alignment : 64 2025-05-07T19:43:03.2746988Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2747065Z power management: 2025-05-07T19:43:03.2747070Z 2025-05-07T19:43:03.2747146Z processor : 67 2025-05-07T19:43:03.2747238Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2747310Z cpu family : 6 2025-05-07T19:43:03.2747384Z model : 85 2025-05-07T19:43:03.2747537Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2747621Z stepping : 7 2025-05-07T19:43:03.2747703Z microcode : 0x5003901 2025-05-07T19:43:03.2747781Z cpu MHz : 3274.077 2025-05-07T19:43:03.2747866Z cache size : 36608 KB 2025-05-07T19:43:03.2747946Z physical id : 0 2025-05-07T19:43:03.2748019Z siblings : 48 2025-05-07T19:43:03.2748091Z core id : 19 2025-05-07T19:43:03.2748171Z cpu cores : 24 2025-05-07T19:43:03.2748243Z apicid : 39 2025-05-07T19:43:03.2748319Z initial apicid : 39 2025-05-07T19:43:03.2748397Z fpu : yes 2025-05-07T19:43:03.2748475Z fpu_exception : yes 2025-05-07T19:43:03.2748549Z cpuid level : 13 2025-05-07T19:43:03.2748618Z wp : yes 2025-05-07T19:43:03.2750708Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2751142Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2751226Z bogomips : 5999.98 2025-05-07T19:43:03.2751302Z clflush size : 64 2025-05-07T19:43:03.2751382Z cache_alignment : 64 2025-05-07T19:43:03.2751504Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2751588Z power management: 2025-05-07T19:43:03.2751592Z 2025-05-07T19:43:03.2751668Z processor : 68 2025-05-07T19:43:03.2751752Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2751829Z cpu family : 6 2025-05-07T19:43:03.2751901Z model : 85 2025-05-07T19:43:03.2752050Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2752121Z stepping : 7 2025-05-07T19:43:03.2752208Z microcode : 0x5003901 2025-05-07T19:43:03.2752280Z cpu MHz : 3228.933 2025-05-07T19:43:03.2752360Z cache size : 36608 KB 2025-05-07T19:43:03.2752440Z physical id : 0 2025-05-07T19:43:03.2752514Z siblings : 48 2025-05-07T19:43:03.2752583Z core id : 20 2025-05-07T19:43:03.2752653Z cpu cores : 24 2025-05-07T19:43:03.2752733Z apicid : 41 2025-05-07T19:43:03.2752811Z initial apicid : 41 2025-05-07T19:43:03.2752880Z fpu : yes 2025-05-07T19:43:03.2752957Z fpu_exception : yes 2025-05-07T19:43:03.2753036Z cpuid level : 13 2025-05-07T19:43:03.2753103Z wp : yes 2025-05-07T19:43:03.2755220Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2755608Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2755685Z bogomips : 5999.98 2025-05-07T19:43:03.2755760Z clflush size : 64 2025-05-07T19:43:03.2755843Z cache_alignment : 64 2025-05-07T19:43:03.2755964Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2756042Z power management: 2025-05-07T19:43:03.2756046Z 2025-05-07T19:43:03.2756130Z processor : 69 2025-05-07T19:43:03.2756212Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2756286Z cpu family : 6 2025-05-07T19:43:03.2756360Z model : 85 2025-05-07T19:43:03.2756508Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2756582Z stepping : 7 2025-05-07T19:43:03.2756658Z microcode : 0x5003901 2025-05-07T19:43:03.2756742Z cpu MHz : 2999.992 2025-05-07T19:43:03.2756820Z cache size : 36608 KB 2025-05-07T19:43:03.2756900Z physical id : 0 2025-05-07T19:43:03.2756971Z siblings : 48 2025-05-07T19:43:03.2757047Z core id : 21 2025-05-07T19:43:03.2757121Z cpu cores : 24 2025-05-07T19:43:03.2757193Z apicid : 43 2025-05-07T19:43:03.2757278Z initial apicid : 43 2025-05-07T19:43:03.2757348Z fpu : yes 2025-05-07T19:43:03.2757423Z fpu_exception : yes 2025-05-07T19:43:03.2757499Z cpuid level : 13 2025-05-07T19:43:03.2757571Z wp : yes 2025-05-07T19:43:03.2759637Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2760068Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2760142Z bogomips : 5999.98 2025-05-07T19:43:03.2760213Z clflush size : 64 2025-05-07T19:43:03.2760291Z cache_alignment : 64 2025-05-07T19:43:03.2760421Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2760498Z power management: 2025-05-07T19:43:03.2760502Z 2025-05-07T19:43:03.2760573Z processor : 70 2025-05-07T19:43:03.2760662Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2760735Z cpu family : 6 2025-05-07T19:43:03.2760802Z model : 85 2025-05-07T19:43:03.2760955Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2761031Z stepping : 7 2025-05-07T19:43:03.2761109Z microcode : 0x5003901 2025-05-07T19:43:03.2761188Z cpu MHz : 3334.614 2025-05-07T19:43:03.2761271Z cache size : 36608 KB 2025-05-07T19:43:03.2761343Z physical id : 0 2025-05-07T19:43:03.2761416Z siblings : 48 2025-05-07T19:43:03.2761486Z core id : 22 2025-05-07T19:43:03.2761569Z cpu cores : 24 2025-05-07T19:43:03.2761640Z apicid : 45 2025-05-07T19:43:03.2761722Z initial apicid : 45 2025-05-07T19:43:03.2761802Z fpu : yes 2025-05-07T19:43:03.2761883Z fpu_exception : yes 2025-05-07T19:43:03.2761957Z cpuid level : 13 2025-05-07T19:43:03.2762036Z wp : yes 2025-05-07T19:43:03.2764883Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2765250Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2765341Z bogomips : 5999.98 2025-05-07T19:43:03.2765421Z clflush size : 64 2025-05-07T19:43:03.2765504Z cache_alignment : 64 2025-05-07T19:43:03.2765624Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2765723Z power management: 2025-05-07T19:43:03.2765727Z 2025-05-07T19:43:03.2765809Z processor : 71 2025-05-07T19:43:03.2765893Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2765978Z cpu family : 6 2025-05-07T19:43:03.2766055Z model : 85 2025-05-07T19:43:03.2766199Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2766271Z stepping : 7 2025-05-07T19:43:03.2766359Z microcode : 0x5003901 2025-05-07T19:43:03.2766437Z cpu MHz : 3245.635 2025-05-07T19:43:03.2766512Z cache size : 36608 KB 2025-05-07T19:43:03.2766598Z physical id : 0 2025-05-07T19:43:03.2766675Z siblings : 48 2025-05-07T19:43:03.2766754Z core id : 23 2025-05-07T19:43:03.2766831Z cpu cores : 24 2025-05-07T19:43:03.2766923Z apicid : 47 2025-05-07T19:43:03.2767004Z initial apicid : 47 2025-05-07T19:43:03.2767076Z fpu : yes 2025-05-07T19:43:03.2767166Z fpu_exception : yes 2025-05-07T19:43:03.2767248Z cpuid level : 13 2025-05-07T19:43:03.2767323Z wp : yes 2025-05-07T19:43:03.2769305Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2769655Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2769797Z bogomips : 5999.98 2025-05-07T19:43:03.2769888Z clflush size : 64 2025-05-07T19:43:03.2769967Z cache_alignment : 64 2025-05-07T19:43:03.2770096Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2770251Z power management: 2025-05-07T19:43:03.2770256Z 2025-05-07T19:43:03.2770356Z processor : 72 2025-05-07T19:43:03.2770442Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2770517Z cpu family : 6 2025-05-07T19:43:03.2770774Z model : 85 2025-05-07T19:43:03.2770935Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2771020Z stepping : 7 2025-05-07T19:43:03.2771104Z microcode : 0x5003901 2025-05-07T19:43:03.2771206Z cpu MHz : 1199.772 2025-05-07T19:43:03.2771288Z cache size : 36608 KB 2025-05-07T19:43:03.2771369Z physical id : 1 2025-05-07T19:43:03.2771466Z siblings : 48 2025-05-07T19:43:03.2771573Z core id : 0 2025-05-07T19:43:03.2771657Z cpu cores : 24 2025-05-07T19:43:03.2771736Z apicid : 65 2025-05-07T19:43:03.2771845Z initial apicid : 65 2025-05-07T19:43:03.2771921Z fpu : yes 2025-05-07T19:43:03.2772001Z fpu_exception : yes 2025-05-07T19:43:03.2772103Z cpuid level : 13 2025-05-07T19:43:03.2772182Z wp : yes 2025-05-07T19:43:03.2774319Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2774770Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2774853Z bogomips : 5999.98 2025-05-07T19:43:03.2774927Z clflush size : 64 2025-05-07T19:43:03.2775020Z cache_alignment : 64 2025-05-07T19:43:03.2775144Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2775229Z power management: 2025-05-07T19:43:03.2775234Z 2025-05-07T19:43:03.2775311Z processor : 73 2025-05-07T19:43:03.2775404Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2775478Z cpu family : 6 2025-05-07T19:43:03.2775549Z model : 85 2025-05-07T19:43:03.2775713Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2775789Z stepping : 7 2025-05-07T19:43:03.2775874Z microcode : 0x5003901 2025-05-07T19:43:03.2775951Z cpu MHz : 2999.992 2025-05-07T19:43:03.2776058Z cache size : 36608 KB 2025-05-07T19:43:03.2776142Z physical id : 1 2025-05-07T19:43:03.2776225Z siblings : 48 2025-05-07T19:43:03.2776319Z core id : 1 2025-05-07T19:43:03.2776399Z cpu cores : 24 2025-05-07T19:43:03.2776489Z apicid : 67 2025-05-07T19:43:03.2776569Z initial apicid : 67 2025-05-07T19:43:03.2776675Z fpu : yes 2025-05-07T19:43:03.2776757Z fpu_exception : yes 2025-05-07T19:43:03.2776837Z cpuid level : 13 2025-05-07T19:43:03.2776910Z wp : yes 2025-05-07T19:43:03.2779060Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2779444Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2779585Z bogomips : 5999.98 2025-05-07T19:43:03.2779663Z clflush size : 64 2025-05-07T19:43:03.2779744Z cache_alignment : 64 2025-05-07T19:43:03.2779868Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2779978Z power management: 2025-05-07T19:43:03.2779982Z 2025-05-07T19:43:03.2780071Z processor : 74 2025-05-07T19:43:03.2780159Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2780256Z cpu family : 6 2025-05-07T19:43:03.2780337Z model : 85 2025-05-07T19:43:03.2780493Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2780590Z stepping : 7 2025-05-07T19:43:03.2780674Z microcode : 0x5003901 2025-05-07T19:43:03.2780763Z cpu MHz : 2999.992 2025-05-07T19:43:03.2780846Z cache size : 36608 KB 2025-05-07T19:43:03.2780941Z physical id : 1 2025-05-07T19:43:03.2781019Z siblings : 48 2025-05-07T19:43:03.2781095Z core id : 2 2025-05-07T19:43:03.2781171Z cpu cores : 24 2025-05-07T19:43:03.2781265Z apicid : 69 2025-05-07T19:43:03.2781355Z initial apicid : 69 2025-05-07T19:43:03.2781435Z fpu : yes 2025-05-07T19:43:03.2781534Z fpu_exception : yes 2025-05-07T19:43:03.2781617Z cpuid level : 13 2025-05-07T19:43:03.2781696Z wp : yes 2025-05-07T19:43:03.2783957Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2784384Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2784471Z bogomips : 5999.98 2025-05-07T19:43:03.2784567Z clflush size : 64 2025-05-07T19:43:03.2784650Z cache_alignment : 64 2025-05-07T19:43:03.2784776Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2784858Z power management: 2025-05-07T19:43:03.2784873Z 2025-05-07T19:43:03.2784957Z processor : 75 2025-05-07T19:43:03.2785052Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2785128Z cpu family : 6 2025-05-07T19:43:03.2785216Z model : 85 2025-05-07T19:43:03.2785368Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2785444Z stepping : 7 2025-05-07T19:43:03.2785521Z microcode : 0x5003901 2025-05-07T19:43:03.2785616Z cpu MHz : 1200.002 2025-05-07T19:43:03.2785701Z cache size : 36608 KB 2025-05-07T19:43:03.2785778Z physical id : 1 2025-05-07T19:43:03.2785870Z siblings : 48 2025-05-07T19:43:03.2785938Z core id : 3 2025-05-07T19:43:03.2786011Z cpu cores : 24 2025-05-07T19:43:03.2786085Z apicid : 71 2025-05-07T19:43:03.2786187Z initial apicid : 71 2025-05-07T19:43:03.2786265Z fpu : yes 2025-05-07T19:43:03.2786348Z fpu_exception : yes 2025-05-07T19:43:03.2786441Z cpuid level : 13 2025-05-07T19:43:03.2786514Z wp : yes 2025-05-07T19:43:03.2788596Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2788985Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2789074Z bogomips : 5999.98 2025-05-07T19:43:03.2789149Z clflush size : 64 2025-05-07T19:43:03.2789287Z cache_alignment : 64 2025-05-07T19:43:03.2789406Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2789484Z power management: 2025-05-07T19:43:03.2789489Z 2025-05-07T19:43:03.2789568Z processor : 76 2025-05-07T19:43:03.2789674Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2789753Z cpu family : 6 2025-05-07T19:43:03.2789825Z model : 85 2025-05-07T19:43:03.2789998Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2790073Z stepping : 7 2025-05-07T19:43:03.2790156Z microcode : 0x5003901 2025-05-07T19:43:03.2790237Z cpu MHz : 1200.682 2025-05-07T19:43:03.2790336Z cache size : 36608 KB 2025-05-07T19:43:03.2790415Z physical id : 1 2025-05-07T19:43:03.2790490Z siblings : 48 2025-05-07T19:43:03.2790590Z core id : 4 2025-05-07T19:43:03.2790664Z cpu cores : 24 2025-05-07T19:43:03.2790739Z apicid : 73 2025-05-07T19:43:03.2790821Z initial apicid : 73 2025-05-07T19:43:03.2790919Z fpu : yes 2025-05-07T19:43:03.2791006Z fpu_exception : yes 2025-05-07T19:43:03.2791086Z cpuid level : 13 2025-05-07T19:43:03.2791177Z wp : yes 2025-05-07T19:43:03.2793244Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2793623Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2793721Z bogomips : 5999.98 2025-05-07T19:43:03.2793860Z clflush size : 64 2025-05-07T19:43:03.2793940Z cache_alignment : 64 2025-05-07T19:43:03.2794087Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2794165Z power management: 2025-05-07T19:43:03.2794170Z 2025-05-07T19:43:03.2794253Z processor : 77 2025-05-07T19:43:03.2794339Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2794432Z cpu family : 6 2025-05-07T19:43:03.2794511Z model : 85 2025-05-07T19:43:03.2794664Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2794874Z stepping : 7 2025-05-07T19:43:03.2794957Z microcode : 0x5003901 2025-05-07T19:43:03.2795032Z cpu MHz : 1206.300 2025-05-07T19:43:03.2795107Z cache size : 36608 KB 2025-05-07T19:43:03.2795204Z physical id : 1 2025-05-07T19:43:03.2795277Z siblings : 48 2025-05-07T19:43:03.2795345Z core id : 5 2025-05-07T19:43:03.2795439Z cpu cores : 24 2025-05-07T19:43:03.2795507Z apicid : 75 2025-05-07T19:43:03.2795591Z initial apicid : 75 2025-05-07T19:43:03.2795660Z fpu : yes 2025-05-07T19:43:03.2795757Z fpu_exception : yes 2025-05-07T19:43:03.2795835Z cpuid level : 13 2025-05-07T19:43:03.2795909Z wp : yes 2025-05-07T19:43:03.2798033Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2798384Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2798460Z bogomips : 5999.98 2025-05-07T19:43:03.2798547Z clflush size : 64 2025-05-07T19:43:03.2798632Z cache_alignment : 64 2025-05-07T19:43:03.2798748Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2798893Z power management: 2025-05-07T19:43:03.2798897Z 2025-05-07T19:43:03.2798967Z processor : 78 2025-05-07T19:43:03.2799051Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2799123Z cpu family : 6 2025-05-07T19:43:03.2799216Z model : 85 2025-05-07T19:43:03.2799362Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2799432Z stepping : 7 2025-05-07T19:43:03.2799521Z microcode : 0x5003901 2025-05-07T19:43:03.2799599Z cpu MHz : 1200.805 2025-05-07T19:43:03.2799677Z cache size : 36608 KB 2025-05-07T19:43:03.2799752Z physical id : 1 2025-05-07T19:43:03.2799836Z siblings : 48 2025-05-07T19:43:03.2799907Z core id : 6 2025-05-07T19:43:03.2799979Z cpu cores : 24 2025-05-07T19:43:03.2800069Z apicid : 77 2025-05-07T19:43:03.2800147Z initial apicid : 77 2025-05-07T19:43:03.2800223Z fpu : yes 2025-05-07T19:43:03.2800301Z fpu_exception : yes 2025-05-07T19:43:03.2800391Z cpuid level : 13 2025-05-07T19:43:03.2800466Z wp : yes 2025-05-07T19:43:03.2802430Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2802798Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2802876Z bogomips : 5999.98 2025-05-07T19:43:03.2802952Z clflush size : 64 2025-05-07T19:43:03.2803033Z cache_alignment : 64 2025-05-07T19:43:03.2803194Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2803270Z power management: 2025-05-07T19:43:03.2803278Z 2025-05-07T19:43:03.2803358Z processor : 79 2025-05-07T19:43:03.2803439Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2803514Z cpu family : 6 2025-05-07T19:43:03.2803585Z model : 85 2025-05-07T19:43:03.2803744Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2803819Z stepping : 7 2025-05-07T19:43:03.2803894Z microcode : 0x5003901 2025-05-07T19:43:03.2803976Z cpu MHz : 1201.007 2025-05-07T19:43:03.2804053Z cache size : 36608 KB 2025-05-07T19:43:03.2804130Z physical id : 1 2025-05-07T19:43:03.2804203Z siblings : 48 2025-05-07T19:43:03.2804284Z core id : 7 2025-05-07T19:43:03.2804357Z cpu cores : 24 2025-05-07T19:43:03.2804428Z apicid : 79 2025-05-07T19:43:03.2804506Z initial apicid : 79 2025-05-07T19:43:03.2804585Z fpu : yes 2025-05-07T19:43:03.2804665Z fpu_exception : yes 2025-05-07T19:43:03.2804740Z cpuid level : 13 2025-05-07T19:43:03.2804827Z wp : yes 2025-05-07T19:43:03.2807057Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2807428Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2807523Z bogomips : 5999.98 2025-05-07T19:43:03.2807598Z clflush size : 64 2025-05-07T19:43:03.2807678Z cache_alignment : 64 2025-05-07T19:43:03.2807807Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2807895Z power management: 2025-05-07T19:43:03.2807899Z 2025-05-07T19:43:03.2808035Z processor : 80 2025-05-07T19:43:03.2808130Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2808205Z cpu family : 6 2025-05-07T19:43:03.2808279Z model : 85 2025-05-07T19:43:03.2808430Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2808527Z stepping : 7 2025-05-07T19:43:03.2808609Z microcode : 0x5003901 2025-05-07T19:43:03.2808686Z cpu MHz : 2999.992 2025-05-07T19:43:03.2808784Z cache size : 36608 KB 2025-05-07T19:43:03.2808862Z physical id : 1 2025-05-07T19:43:03.2808939Z siblings : 48 2025-05-07T19:43:03.2809009Z core id : 8 2025-05-07T19:43:03.2809097Z cpu cores : 24 2025-05-07T19:43:03.2809172Z apicid : 81 2025-05-07T19:43:03.2809247Z initial apicid : 81 2025-05-07T19:43:03.2809317Z fpu : yes 2025-05-07T19:43:03.2809405Z fpu_exception : yes 2025-05-07T19:43:03.2809478Z cpuid level : 13 2025-05-07T19:43:03.2809547Z wp : yes 2025-05-07T19:43:03.2811930Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2812315Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2812389Z bogomips : 5999.98 2025-05-07T19:43:03.2812471Z clflush size : 64 2025-05-07T19:43:03.2812550Z cache_alignment : 64 2025-05-07T19:43:03.2812674Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2812759Z power management: 2025-05-07T19:43:03.2812821Z 2025-05-07T19:43:03.2812898Z processor : 81 2025-05-07T19:43:03.2812985Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2813063Z cpu family : 6 2025-05-07T19:43:03.2813132Z model : 85 2025-05-07T19:43:03.2813287Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2813361Z stepping : 7 2025-05-07T19:43:03.2813447Z microcode : 0x5003901 2025-05-07T19:43:03.2813534Z cpu MHz : 2999.992 2025-05-07T19:43:03.2813628Z cache size : 36608 KB 2025-05-07T19:43:03.2813727Z physical id : 1 2025-05-07T19:43:03.2813848Z siblings : 48 2025-05-07T19:43:03.2813936Z core id : 9 2025-05-07T19:43:03.2814026Z cpu cores : 24 2025-05-07T19:43:03.2814142Z apicid : 83 2025-05-07T19:43:03.2814240Z initial apicid : 83 2025-05-07T19:43:03.2814327Z fpu : yes 2025-05-07T19:43:03.2814421Z fpu_exception : yes 2025-05-07T19:43:03.2814535Z cpuid level : 13 2025-05-07T19:43:03.2814627Z wp : yes 2025-05-07T19:43:03.2816784Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2817206Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2817300Z bogomips : 5999.98 2025-05-07T19:43:03.2817392Z clflush size : 64 2025-05-07T19:43:03.2817514Z cache_alignment : 64 2025-05-07T19:43:03.2817656Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2817752Z power management: 2025-05-07T19:43:03.2817756Z 2025-05-07T19:43:03.2817880Z processor : 82 2025-05-07T19:43:03.2817983Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2818127Z cpu family : 6 2025-05-07T19:43:03.2818218Z model : 85 2025-05-07T19:43:03.2818411Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2818504Z stepping : 7 2025-05-07T19:43:03.2818598Z microcode : 0x5003901 2025-05-07T19:43:03.2818709Z cpu MHz : 1199.246 2025-05-07T19:43:03.2818803Z cache size : 36608 KB 2025-05-07T19:43:03.2818896Z physical id : 1 2025-05-07T19:43:03.2818986Z siblings : 48 2025-05-07T19:43:03.2819098Z core id : 10 2025-05-07T19:43:03.2819184Z cpu cores : 24 2025-05-07T19:43:03.2819273Z apicid : 85 2025-05-07T19:43:03.2819383Z initial apicid : 85 2025-05-07T19:43:03.2819458Z fpu : yes 2025-05-07T19:43:03.2819542Z fpu_exception : yes 2025-05-07T19:43:03.2819623Z cpuid level : 13 2025-05-07T19:43:03.2819705Z wp : yes 2025-05-07T19:43:03.2821833Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2822244Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2822327Z bogomips : 5999.98 2025-05-07T19:43:03.2822415Z clflush size : 64 2025-05-07T19:43:03.2822504Z cache_alignment : 64 2025-05-07T19:43:03.2822654Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2822743Z power management: 2025-05-07T19:43:03.2822748Z 2025-05-07T19:43:03.2822942Z processor : 83 2025-05-07T19:43:03.2823085Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2823163Z cpu family : 6 2025-05-07T19:43:03.2823246Z model : 85 2025-05-07T19:43:03.2823400Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2823491Z stepping : 7 2025-05-07T19:43:03.2823572Z microcode : 0x5003901 2025-05-07T19:43:03.2823652Z cpu MHz : 1200.955 2025-05-07T19:43:03.2823752Z cache size : 36608 KB 2025-05-07T19:43:03.2823831Z physical id : 1 2025-05-07T19:43:03.2823903Z siblings : 48 2025-05-07T19:43:03.2823977Z core id : 11 2025-05-07T19:43:03.2824072Z cpu cores : 24 2025-05-07T19:43:03.2824149Z apicid : 87 2025-05-07T19:43:03.2824226Z initial apicid : 87 2025-05-07T19:43:03.2824323Z fpu : yes 2025-05-07T19:43:03.2824401Z fpu_exception : yes 2025-05-07T19:43:03.2824482Z cpuid level : 13 2025-05-07T19:43:03.2824556Z wp : yes 2025-05-07T19:43:03.2826551Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2826911Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2827009Z bogomips : 5999.98 2025-05-07T19:43:03.2827086Z clflush size : 64 2025-05-07T19:43:03.2827168Z cache_alignment : 64 2025-05-07T19:43:03.2827295Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2827397Z power management: 2025-05-07T19:43:03.2827400Z 2025-05-07T19:43:03.2827481Z processor : 84 2025-05-07T19:43:03.2827574Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2827667Z cpu family : 6 2025-05-07T19:43:03.2827738Z model : 85 2025-05-07T19:43:03.2827887Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2828018Z stepping : 7 2025-05-07T19:43:03.2828111Z microcode : 0x5003901 2025-05-07T19:43:03.2828193Z cpu MHz : 2999.992 2025-05-07T19:43:03.2828273Z cache size : 36608 KB 2025-05-07T19:43:03.2828368Z physical id : 1 2025-05-07T19:43:03.2828591Z siblings : 48 2025-05-07T19:43:03.2828666Z core id : 12 2025-05-07T19:43:03.2828742Z cpu cores : 24 2025-05-07T19:43:03.2829014Z apicid : 89 2025-05-07T19:43:03.2829106Z initial apicid : 89 2025-05-07T19:43:03.2829185Z fpu : yes 2025-05-07T19:43:03.2829294Z fpu_exception : yes 2025-05-07T19:43:03.2829463Z cpuid level : 13 2025-05-07T19:43:03.2829541Z wp : yes 2025-05-07T19:43:03.2831675Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2832074Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2832157Z bogomips : 5999.98 2025-05-07T19:43:03.2832247Z clflush size : 64 2025-05-07T19:43:03.2832334Z cache_alignment : 64 2025-05-07T19:43:03.2832470Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2832559Z power management: 2025-05-07T19:43:03.2832564Z 2025-05-07T19:43:03.2832658Z processor : 85 2025-05-07T19:43:03.2832742Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2832819Z cpu family : 6 2025-05-07T19:43:03.2832909Z model : 85 2025-05-07T19:43:03.2833162Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2833248Z stepping : 7 2025-05-07T19:43:03.2833331Z microcode : 0x5003901 2025-05-07T19:43:03.2833419Z cpu MHz : 1200.037 2025-05-07T19:43:03.2833510Z cache size : 36608 KB 2025-05-07T19:43:03.2833593Z physical id : 1 2025-05-07T19:43:03.2833684Z siblings : 48 2025-05-07T19:43:03.2833763Z core id : 13 2025-05-07T19:43:03.2833840Z cpu cores : 24 2025-05-07T19:43:03.2833920Z apicid : 91 2025-05-07T19:43:03.2834026Z initial apicid : 91 2025-05-07T19:43:03.2834105Z fpu : yes 2025-05-07T19:43:03.2834192Z fpu_exception : yes 2025-05-07T19:43:03.2834273Z cpuid level : 13 2025-05-07T19:43:03.2834361Z wp : yes 2025-05-07T19:43:03.2836499Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2836901Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2836985Z bogomips : 5999.98 2025-05-07T19:43:03.2837059Z clflush size : 64 2025-05-07T19:43:03.2837143Z cache_alignment : 64 2025-05-07T19:43:03.2837270Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2837351Z power management: 2025-05-07T19:43:03.2837355Z 2025-05-07T19:43:03.2837431Z processor : 86 2025-05-07T19:43:03.2837518Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2837594Z cpu family : 6 2025-05-07T19:43:03.2837664Z model : 85 2025-05-07T19:43:03.2837827Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2837901Z stepping : 7 2025-05-07T19:43:03.2838100Z microcode : 0x5003901 2025-05-07T19:43:03.2838175Z cpu MHz : 1199.892 2025-05-07T19:43:03.2838258Z cache size : 36608 KB 2025-05-07T19:43:03.2838339Z physical id : 1 2025-05-07T19:43:03.2838414Z siblings : 48 2025-05-07T19:43:03.2838493Z core id : 14 2025-05-07T19:43:03.2838573Z cpu cores : 24 2025-05-07T19:43:03.2838645Z apicid : 93 2025-05-07T19:43:03.2838723Z initial apicid : 93 2025-05-07T19:43:03.2838805Z fpu : yes 2025-05-07T19:43:03.2838885Z fpu_exception : yes 2025-05-07T19:43:03.2838961Z cpuid level : 13 2025-05-07T19:43:03.2839033Z wp : yes 2025-05-07T19:43:03.2841305Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2841780Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2841861Z bogomips : 5999.98 2025-05-07T19:43:03.2841934Z clflush size : 64 2025-05-07T19:43:03.2842009Z cache_alignment : 64 2025-05-07T19:43:03.2842127Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2842209Z power management: 2025-05-07T19:43:03.2842213Z 2025-05-07T19:43:03.2842289Z processor : 87 2025-05-07T19:43:03.2842368Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2842446Z cpu family : 6 2025-05-07T19:43:03.2842519Z model : 85 2025-05-07T19:43:03.2842665Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2842802Z stepping : 7 2025-05-07T19:43:03.2842878Z microcode : 0x5003901 2025-05-07T19:43:03.2842953Z cpu MHz : 1199.692 2025-05-07T19:43:03.2843024Z cache size : 36608 KB 2025-05-07T19:43:03.2843103Z physical id : 1 2025-05-07T19:43:03.2843170Z siblings : 48 2025-05-07T19:43:03.2843238Z core id : 15 2025-05-07T19:43:03.2843309Z cpu cores : 24 2025-05-07T19:43:03.2843383Z apicid : 95 2025-05-07T19:43:03.2843457Z initial apicid : 95 2025-05-07T19:43:03.2843522Z fpu : yes 2025-05-07T19:43:03.2843608Z fpu_exception : yes 2025-05-07T19:43:03.2843682Z cpuid level : 13 2025-05-07T19:43:03.2843748Z wp : yes 2025-05-07T19:43:03.2845720Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2846074Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2846148Z bogomips : 5999.98 2025-05-07T19:43:03.2846231Z clflush size : 64 2025-05-07T19:43:03.2846305Z cache_alignment : 64 2025-05-07T19:43:03.2846419Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2846492Z power management: 2025-05-07T19:43:03.2846503Z 2025-05-07T19:43:03.2846572Z processor : 88 2025-05-07T19:43:03.2846648Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2846716Z cpu family : 6 2025-05-07T19:43:03.2846794Z model : 85 2025-05-07T19:43:03.2846933Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2847002Z stepping : 7 2025-05-07T19:43:03.2847080Z microcode : 0x5003901 2025-05-07T19:43:03.2847164Z cpu MHz : 2999.992 2025-05-07T19:43:03.2847289Z cache size : 36608 KB 2025-05-07T19:43:03.2847362Z physical id : 1 2025-05-07T19:43:03.2847439Z siblings : 48 2025-05-07T19:43:03.2847506Z core id : 16 2025-05-07T19:43:03.2847572Z cpu cores : 24 2025-05-07T19:43:03.2847640Z apicid : 97 2025-05-07T19:43:03.2847719Z initial apicid : 97 2025-05-07T19:43:03.2847785Z fpu : yes 2025-05-07T19:43:03.2847857Z fpu_exception : yes 2025-05-07T19:43:03.2847937Z cpuid level : 13 2025-05-07T19:43:03.2848003Z wp : yes 2025-05-07T19:43:03.2849964Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2850406Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2850479Z bogomips : 5999.98 2025-05-07T19:43:03.2850716Z clflush size : 64 2025-05-07T19:43:03.2850810Z cache_alignment : 64 2025-05-07T19:43:03.2850933Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2851013Z power management: 2025-05-07T19:43:03.2851018Z 2025-05-07T19:43:03.2851096Z processor : 89 2025-05-07T19:43:03.2851190Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2851264Z cpu family : 6 2025-05-07T19:43:03.2851337Z model : 85 2025-05-07T19:43:03.2851496Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2851571Z stepping : 7 2025-05-07T19:43:03.2851650Z microcode : 0x5003901 2025-05-07T19:43:03.2851776Z cpu MHz : 1200.228 2025-05-07T19:43:03.2851865Z cache size : 36608 KB 2025-05-07T19:43:03.2851949Z physical id : 1 2025-05-07T19:43:03.2852020Z siblings : 48 2025-05-07T19:43:03.2852095Z core id : 17 2025-05-07T19:43:03.2852170Z cpu cores : 24 2025-05-07T19:43:03.2852242Z apicid : 99 2025-05-07T19:43:03.2852316Z initial apicid : 99 2025-05-07T19:43:03.2852399Z fpu : yes 2025-05-07T19:43:03.2852485Z fpu_exception : yes 2025-05-07T19:43:03.2852562Z cpuid level : 13 2025-05-07T19:43:03.2852640Z wp : yes 2025-05-07T19:43:03.2854791Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2855176Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2855257Z bogomips : 5999.98 2025-05-07T19:43:03.2855332Z clflush size : 64 2025-05-07T19:43:03.2855410Z cache_alignment : 64 2025-05-07T19:43:03.2855539Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2855620Z power management: 2025-05-07T19:43:03.2855624Z 2025-05-07T19:43:03.2855700Z processor : 90 2025-05-07T19:43:03.2855787Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2855870Z cpu family : 6 2025-05-07T19:43:03.2855942Z model : 85 2025-05-07T19:43:03.2856094Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2856174Z stepping : 7 2025-05-07T19:43:03.2856260Z microcode : 0x5003901 2025-05-07T19:43:03.2856337Z cpu MHz : 1199.245 2025-05-07T19:43:03.2856420Z cache size : 36608 KB 2025-05-07T19:43:03.2856506Z physical id : 1 2025-05-07T19:43:03.2856634Z siblings : 48 2025-05-07T19:43:03.2856705Z core id : 18 2025-05-07T19:43:03.2856789Z cpu cores : 24 2025-05-07T19:43:03.2856863Z apicid : 101 2025-05-07T19:43:03.2856944Z initial apicid : 101 2025-05-07T19:43:03.2857016Z fpu : yes 2025-05-07T19:43:03.2857106Z fpu_exception : yes 2025-05-07T19:43:03.2857179Z cpuid level : 13 2025-05-07T19:43:03.2857252Z wp : yes 2025-05-07T19:43:03.2859386Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2859776Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2859855Z bogomips : 5999.98 2025-05-07T19:43:03.2859939Z clflush size : 64 2025-05-07T19:43:03.2860024Z cache_alignment : 64 2025-05-07T19:43:03.2860150Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2860239Z power management: 2025-05-07T19:43:03.2860243Z 2025-05-07T19:43:03.2860323Z processor : 91 2025-05-07T19:43:03.2860408Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2860483Z cpu family : 6 2025-05-07T19:43:03.2860563Z model : 85 2025-05-07T19:43:03.2860718Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2860793Z stepping : 7 2025-05-07T19:43:03.2860880Z microcode : 0x5003901 2025-05-07T19:43:03.2860953Z cpu MHz : 1199.743 2025-05-07T19:43:03.2861032Z cache size : 36608 KB 2025-05-07T19:43:03.2861158Z physical id : 1 2025-05-07T19:43:03.2861242Z siblings : 48 2025-05-07T19:43:03.2861319Z core id : 19 2025-05-07T19:43:03.2861393Z cpu cores : 24 2025-05-07T19:43:03.2861476Z apicid : 103 2025-05-07T19:43:03.2861556Z initial apicid : 103 2025-05-07T19:43:03.2861629Z fpu : yes 2025-05-07T19:43:03.2861709Z fpu_exception : yes 2025-05-07T19:43:03.2861799Z cpuid level : 13 2025-05-07T19:43:03.2861869Z wp : yes 2025-05-07T19:43:03.2864035Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2864393Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2864467Z bogomips : 5999.98 2025-05-07T19:43:03.2864542Z clflush size : 64 2025-05-07T19:43:03.2864625Z cache_alignment : 64 2025-05-07T19:43:03.2864737Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2864810Z power management: 2025-05-07T19:43:03.2864814Z 2025-05-07T19:43:03.2864899Z processor : 92 2025-05-07T19:43:03.2864978Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2865045Z cpu family : 6 2025-05-07T19:43:03.2865111Z model : 85 2025-05-07T19:43:03.2865256Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2865327Z stepping : 7 2025-05-07T19:43:03.2865399Z microcode : 0x5003901 2025-05-07T19:43:03.2865474Z cpu MHz : 1200.084 2025-05-07T19:43:03.2865548Z cache size : 36608 KB 2025-05-07T19:43:03.2865620Z physical id : 1 2025-05-07T19:43:03.2865693Z siblings : 48 2025-05-07T19:43:03.2865772Z core id : 20 2025-05-07T19:43:03.2865839Z cpu cores : 24 2025-05-07T19:43:03.2865955Z apicid : 105 2025-05-07T19:43:03.2866037Z initial apicid : 105 2025-05-07T19:43:03.2866101Z fpu : yes 2025-05-07T19:43:03.2866174Z fpu_exception : yes 2025-05-07T19:43:03.2866242Z cpuid level : 13 2025-05-07T19:43:03.2866312Z wp : yes 2025-05-07T19:43:03.2868273Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2868628Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2868710Z bogomips : 5999.98 2025-05-07T19:43:03.2868777Z clflush size : 64 2025-05-07T19:43:03.2868852Z cache_alignment : 64 2025-05-07T19:43:03.2868977Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2869056Z power management: 2025-05-07T19:43:03.2869060Z 2025-05-07T19:43:03.2869129Z processor : 93 2025-05-07T19:43:03.2869214Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2869284Z cpu family : 6 2025-05-07T19:43:03.2869350Z model : 85 2025-05-07T19:43:03.2869495Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2869575Z stepping : 7 2025-05-07T19:43:03.2869648Z microcode : 0x5003901 2025-05-07T19:43:03.2869720Z cpu MHz : 1199.258 2025-05-07T19:43:03.2869796Z cache size : 36608 KB 2025-05-07T19:43:03.2869869Z physical id : 1 2025-05-07T19:43:03.2869941Z siblings : 48 2025-05-07T19:43:03.2870010Z core id : 21 2025-05-07T19:43:03.2870144Z cpu cores : 24 2025-05-07T19:43:03.2870211Z apicid : 107 2025-05-07T19:43:03.2870294Z initial apicid : 107 2025-05-07T19:43:03.2870363Z fpu : yes 2025-05-07T19:43:03.2870452Z fpu_exception : yes 2025-05-07T19:43:03.2870526Z cpuid level : 13 2025-05-07T19:43:03.2870593Z wp : yes 2025-05-07T19:43:03.2872558Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2872912Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2872986Z bogomips : 5999.98 2025-05-07T19:43:03.2873070Z clflush size : 64 2025-05-07T19:43:03.2873142Z cache_alignment : 64 2025-05-07T19:43:03.2873258Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2873339Z power management: 2025-05-07T19:43:03.2873343Z 2025-05-07T19:43:03.2873414Z processor : 94 2025-05-07T19:43:03.2873494Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2873575Z cpu family : 6 2025-05-07T19:43:03.2873646Z model : 85 2025-05-07T19:43:03.2873786Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2873856Z stepping : 7 2025-05-07T19:43:03.2873935Z microcode : 0x5003901 2025-05-07T19:43:03.2874006Z cpu MHz : 1199.170 2025-05-07T19:43:03.2874080Z cache size : 36608 KB 2025-05-07T19:43:03.2874150Z physical id : 1 2025-05-07T19:43:03.2874229Z siblings : 48 2025-05-07T19:43:03.2874297Z core id : 22 2025-05-07T19:43:03.2874370Z cpu cores : 24 2025-05-07T19:43:03.2874447Z apicid : 109 2025-05-07T19:43:03.2874521Z initial apicid : 109 2025-05-07T19:43:03.2874639Z fpu : yes 2025-05-07T19:43:03.2874714Z fpu_exception : yes 2025-05-07T19:43:03.2874797Z cpuid level : 13 2025-05-07T19:43:03.2874861Z wp : yes 2025-05-07T19:43:03.2876818Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2877180Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2877253Z bogomips : 5999.98 2025-05-07T19:43:03.2877327Z clflush size : 64 2025-05-07T19:43:03.2877406Z cache_alignment : 64 2025-05-07T19:43:03.2877521Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2877594Z power management: 2025-05-07T19:43:03.2877598Z 2025-05-07T19:43:03.2877678Z processor : 95 2025-05-07T19:43:03.2877762Z vendor_id : GenuineIntel 2025-05-07T19:43:03.2877831Z cpu family : 6 2025-05-07T19:43:03.2877900Z model : 85 2025-05-07T19:43:03.2878049Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:03.2878128Z stepping : 7 2025-05-07T19:43:03.2878200Z microcode : 0x5003901 2025-05-07T19:43:03.2878279Z cpu MHz : 2999.992 2025-05-07T19:43:03.2878351Z cache size : 36608 KB 2025-05-07T19:43:03.2878424Z physical id : 1 2025-05-07T19:43:03.2878492Z siblings : 48 2025-05-07T19:43:03.2878575Z core id : 23 2025-05-07T19:43:03.2878646Z cpu cores : 24 2025-05-07T19:43:03.2878715Z apicid : 111 2025-05-07T19:43:03.2878874Z initial apicid : 111 2025-05-07T19:43:03.2878946Z fpu : yes 2025-05-07T19:43:03.2879030Z fpu_exception : yes 2025-05-07T19:43:03.2879103Z cpuid level : 13 2025-05-07T19:43:03.2879178Z wp : yes 2025-05-07T19:43:03.2881138Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:03.2881500Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:03.2881573Z bogomips : 5999.98 2025-05-07T19:43:03.2881642Z clflush size : 64 2025-05-07T19:43:03.2881721Z cache_alignment : 64 2025-05-07T19:43:03.2881847Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:03.2881919Z power management: 2025-05-07T19:43:03.2881923Z 2025-05-07T19:43:03.2881926Z 2025-05-07T19:43:03.2882028Z ################################################################################ 2025-05-07T19:43:03.2882127Z [INFO] Print PCI info ... 2025-05-07T19:43:03.2882200Z + lspci -v 2025-05-07T19:43:03.2882204Z 2025-05-07T19:43:03.2882373Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2025-05-07T19:43:03.2882480Z Subsystem: Amazon.com, Inc. Device 1237 2025-05-07T19:43:03.2882583Z Flags: bus master, medium devsel, latency 0 2025-05-07T19:43:03.2882587Z 2025-05-07T19:43:03.2882771Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2025-05-07T19:43:03.2882851Z Physical Slot: 1 2025-05-07T19:43:03.2882957Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:03.2882962Z 2025-05-07T19:43:03.2883195Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2025-05-07T19:43:03.2883863Z Physical Slot: 1 2025-05-07T19:43:03.2883983Z Flags: bus master, fast devsel, latency 0, IRQ 9 2025-05-07T19:43:03.2883987Z 2025-05-07T19:43:03.2884233Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 (prog-if 00 [VGA controller]) 2025-05-07T19:43:03.2884305Z Physical Slot: 3 2025-05-07T19:43:03.2884415Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:03.2884537Z Memory at c0000000 (32-bit, prefetchable) [size=4M] 2025-05-07T19:43:03.2884648Z Expansion ROM at 000c0000 [disabled] [size=128K] 2025-05-07T19:43:03.2884652Z 2025-05-07T19:43:03.2884952Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller (prog-if 02 [NVM Express]) 2025-05-07T19:43:03.2885049Z Subsystem: Amazon.com, Inc. Device 0000 2025-05-07T19:43:03.2885118Z Physical Slot: 4 2025-05-07T19:43:03.2885243Z Flags: bus master, fast devsel, latency 0, IRQ 11 2025-05-07T19:43:03.2885389Z Memory at c0514000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:03.2885476Z Capabilities: 2025-05-07T19:43:03.2885559Z Kernel driver in use: nvme 2025-05-07T19:43:03.2885563Z 2025-05-07T19:43:03.2885769Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2025-05-07T19:43:03.2885840Z Physical Slot: 5 2025-05-07T19:43:03.2885941Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:03.2886089Z Memory at c0510000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:03.2886211Z Memory at c0400000 (32-bit, prefetchable) [size=1M] 2025-05-07T19:43:03.2886344Z Memory at c0500000 (32-bit, non-prefetchable) [size=64K] 2025-05-07T19:43:03.2886441Z Capabilities: 2025-05-07T19:43:03.2886523Z Kernel driver in use: ena 2025-05-07T19:43:03.2886527Z 2025-05-07T19:43:03.2886531Z 2025-05-07T19:43:03.2886678Z ################################################################################ 2025-05-07T19:43:03.2886775Z [INFO] Print Linux distribution info ... 2025-05-07T19:43:03.2886857Z + uname -a 2025-05-07T19:43:03.2886861Z 2025-05-07T19:43:03.2887219Z Linux 180e7cabfdf5 6.1.130-139.222.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux 2025-05-07T19:43:03.2887223Z 2025-05-07T19:43:03.2887293Z + uname -m 2025-05-07T19:43:03.2887297Z 2025-05-07T19:43:03.2887376Z x86_64 2025-05-07T19:43:03.2887379Z 2025-05-07T19:43:03.2887454Z + cat /proc/version 2025-05-07T19:43:03.2887458Z 2025-05-07T19:43:03.2887996Z Linux version 6.1.130-139.222.amzn2023.x86_64 (mockbuild@ip-10-0-55-76) (gcc (GCC) 11.5.0 20240719 (Red Hat 11.5.0-5), GNU ld version 2.39-6.amzn2023.0.11) #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 2025-05-07T19:43:03.2888009Z 2025-05-07T19:43:03.2888087Z + cat /etc/os-release 2025-05-07T19:43:03.2888091Z 2025-05-07T19:43:03.2888165Z NAME="Amazon Linux" 2025-05-07T19:43:03.2888237Z VERSION="2023" 2025-05-07T19:43:03.2888315Z ID="amzn" 2025-05-07T19:43:03.2888384Z ID_LIKE="fedora" 2025-05-07T19:43:03.2888458Z VERSION_ID="2023" 2025-05-07T19:43:03.2888555Z PLATFORM_ID="platform:al2023" 2025-05-07T19:43:03.2888651Z PRETTY_NAME="Amazon Linux 2023.7.20250428" 2025-05-07T19:43:03.2888722Z ANSI_COLOR="0;33" 2025-05-07T19:43:03.2888832Z CPE_NAME="cpe:2.3:o:amazon:amazon_linux:2023" 2025-05-07T19:43:03.2889008Z HOME_URL="https://aws.amazon.com/linux/amazon-linux-2023/" 2025-05-07T19:43:03.2889161Z DOCUMENTATION_URL="https://docs.aws.amazon.com/linux/" 2025-05-07T19:43:03.2889303Z SUPPORT_URL="https://aws.amazon.com/premiumsupport/" 2025-05-07T19:43:03.2889487Z BUG_REPORT_URL="https://github.com/amazonlinux/amazon-linux-2023" 2025-05-07T19:43:03.2889560Z VENDOR_NAME="AWS" 2025-05-07T19:43:03.2889657Z VENDOR_URL="https://aws.amazon.com/" 2025-05-07T19:43:03.2889740Z SUPPORT_END="2029-06-30" 2025-05-07T19:43:03.2889752Z 2025-05-07T19:43:03.2927733Z ##[group]Run . $PRELUDE; print_gpu_info 2025-05-07T19:43:03.2927876Z . $PRELUDE; print_gpu_info 2025-05-07T19:43:03.2928178Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:03.2928358Z env: 2025-05-07T19:43:03.2928643Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:03.2928725Z BUILD_ENV: build_binary 2025-05-07T19:43:03.2928993Z BUILD_TARGET: default 2025-05-07T19:43:03.2929073Z BUILD_VARIANT: cuda 2025-05-07T19:43:03.2929163Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:03.2929248Z ##[endgroup] 2025-05-07T19:43:03.7539069Z ################################################################################ 2025-05-07T19:43:03.7540786Z [INFO] Printing general display info ... 2025-05-07T19:43:03.7556495Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:03.8435761Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:03.8446003Z /usr/bin/sudo 2025-05-07T19:43:03.8455792Z which: no apt-get in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:03.8466060Z /usr/bin/yum 2025-05-07T19:43:03.8466771Z [INSTALL] Updating system repositories ... 2025-05-07T19:43:03.8492288Z [EXEC] [ATTEMPT 0/3] + sudo yum update -y 2025-05-07T19:43:04.0702500Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:46 2025. 2025-05-07T19:43:04.1653533Z Dependencies resolved. 2025-05-07T19:43:04.1869728Z Nothing to do. 2025-05-07T19:43:04.1870460Z Complete! 2025-05-07T19:43:04.2523041Z [INSTALL] Installing system package(s): hostname lshw ... 2025-05-07T19:43:04.2552726Z [EXEC] [ATTEMPT 0/3] + sudo yum install -y hostname lshw 2025-05-07T19:43:04.4792002Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:46 2025. 2025-05-07T19:43:04.5314414Z Dependencies resolved. 2025-05-07T19:43:04.5481293Z ================================================================================ 2025-05-07T19:43:04.5482256Z Package Arch Version Repository Size 2025-05-07T19:43:04.5482752Z ================================================================================ 2025-05-07T19:43:04.5483076Z Installing: 2025-05-07T19:43:04.5483416Z hostname x86_64 3.23-4.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:43:04.5483886Z lshw x86_64 B.02.19.2-7.amzn2023.0.3 amazonlinux 319 k 2025-05-07T19:43:04.5484193Z 2025-05-07T19:43:04.5484289Z Transaction Summary 2025-05-07T19:43:04.5484533Z ================================================================================ 2025-05-07T19:43:04.5484852Z Install 2 Packages 2025-05-07T19:43:04.5484990Z 2025-05-07T19:43:04.5485102Z Total download size: 347 k 2025-05-07T19:43:04.5485350Z Installed size: 883 k 2025-05-07T19:43:04.5485593Z Downloading Packages: 2025-05-07T19:43:04.6560472Z (1/2): hostname-3.23-4.amzn2023.0.3.x86_64.rpm 1.4 MB/s | 28 kB 00:00 2025-05-07T19:43:04.6602410Z (2/2): lshw-B.02.19.2-7.amzn2023.0.3.x86_64.rpm 13 MB/s | 319 kB 00:00 2025-05-07T19:43:04.6608020Z -------------------------------------------------------------------------------- 2025-05-07T19:43:04.6608870Z Total 3.0 MB/s | 347 kB 00:00 2025-05-07T19:43:04.6816175Z Running transaction check 2025-05-07T19:43:04.6870700Z Transaction check succeeded. 2025-05-07T19:43:04.6871457Z Running transaction test 2025-05-07T19:43:04.7021942Z Transaction test succeeded. 2025-05-07T19:43:04.7023830Z Running transaction 2025-05-07T19:43:04.7291953Z Preparing : 1/1 2025-05-07T19:43:04.7358355Z Installing : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:04.7378631Z Installing : hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:05.7816129Z Running scriptlet: hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:05.7817130Z Verifying : hostname-3.23-4.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:05.8177579Z Verifying : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:05.8177935Z 2025-05-07T19:43:05.8178281Z Installed: 2025-05-07T19:43:05.8178642Z hostname-3.23-4.amzn2023.0.3.x86_64 lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2025-05-07T19:43:05.8178978Z 2025-05-07T19:43:05.8179079Z Complete! 2025-05-07T19:43:05.8535047Z + hostname 2025-05-07T19:43:05.8535214Z 2025-05-07T19:43:05.8549086Z 180e7cabfdf5 2025-05-07T19:43:05.8549523Z 2025-05-07T19:43:05.8549804Z + sudo lshw -C display 2025-05-07T19:43:05.8550275Z 2025-05-07T19:43:06.0549850Z *-display UNCLAIMED 2025-05-07T19:43:06.0550732Z description: VGA compatible controller 2025-05-07T19:43:06.0551696Z product: Amazon.com, Inc. 2025-05-07T19:43:06.0552511Z vendor: Amazon.com, Inc. 2025-05-07T19:43:06.0553236Z physical id: 3 2025-05-07T19:43:06.0553915Z bus info: pci@0000:00:03.0 2025-05-07T19:43:06.0554665Z version: 00 2025-05-07T19:43:06.0555330Z width: 32 bits 2025-05-07T19:43:06.0555582Z clock: 33MHz 2025-05-07T19:43:06.0555836Z capabilities: vga_controller bus_master 2025-05-07T19:43:06.0556202Z configuration: latency=0 2025-05-07T19:43:06.0556525Z resources: memory:c0000000-c03fffff memory:c0000-dffff 2025-05-07T19:43:06.0572029Z 2025-05-07T19:43:06.0572382Z ################################################################################ 2025-05-07T19:43:06.0572777Z [INFO] Printing NVIDIA GPU info ... 2025-05-07T19:43:06.0680967Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:06.0702700Z which: no nvidia-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:06.0704141Z [CHECK] nvidia-smi not found 2025-05-07T19:43:06.0705012Z ################################################################################ 2025-05-07T19:43:06.0705635Z [INFO] Printing AMD GPU info ... 2025-05-07T19:43:06.0812304Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:06.0833604Z which: no rocminfo in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:06.0834730Z [CHECK] rocminfo not found 2025-05-07T19:43:06.0839453Z which: no rocm-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:06.0839926Z [CHECK] rocm-smi not found 2025-05-07T19:43:06.0909776Z ##[group]Run . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:06.0910220Z . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:06.0910835Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:06.0911140Z env: 2025-05-07T19:43:06.0911376Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:06.0911679Z BUILD_ENV: build_binary 2025-05-07T19:43:06.0911906Z BUILD_TARGET: default 2025-05-07T19:43:06.0912140Z BUILD_VARIANT: cuda 2025-05-07T19:43:06.0912363Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:06.0912616Z ##[endgroup] 2025-05-07T19:43:06.5355336Z ################################################################################ 2025-05-07T19:43:06.5355917Z # Setup Miniconda 2025-05-07T19:43:06.5356334Z # 2025-05-07T19:43:06.5373380Z # [2025-05-07T19:43:06.536Z] + setup_miniconda /github/home/miniconda 2025-05-07T19:43:06.5374093Z ################################################################################ 2025-05-07T19:43:06.5374422Z 2025-05-07T19:43:06.5393996Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:06.6258979Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:06.6259472Z + mkdir -p /github/home/miniconda 2025-05-07T19:43:06.6259680Z 2025-05-07T19:43:06.6273424Z 2025-05-07T19:43:06.6273768Z [SETUP] Downloading the Miniconda installer ... 2025-05-07T19:43:06.6293246Z [EXEC] [ATTEMPT 0/3] + wget -q https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O miniconda.sh 2025-05-07T19:43:07.7430684Z [SETUP] Installing Miniconda ... 2025-05-07T19:43:07.7431163Z + bash miniconda.sh -b -p /github/home/miniconda -u 2025-05-07T19:43:07.7431427Z 2025-05-07T19:43:07.7575469Z PREFIX=/github/home/miniconda 2025-05-07T19:43:08.1140948Z Unpacking payload ... 2025-05-07T19:43:08.5983855Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:09.2779733Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:11.1424411Z 2025-05-07T19:43:11.1425300Z Installing base environment... 2025-05-07T19:43:11.1425953Z 2025-05-07T19:43:12.1360294Z Preparing transaction: ...working... done 2025-05-07T19:43:14.9884934Z Executing transaction: ...working... done 2025-05-07T19:43:15.5376306Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:15.6059114Z installation finished. 2025-05-07T19:43:15.6061827Z 2025-05-07T19:43:15.6062956Z + rm -f miniconda.sh 2025-05-07T19:43:15.6063538Z 2025-05-07T19:43:15.6237372Z 2025-05-07T19:43:15.6237867Z [SETUP] Reloading the bash configuration ... 2025-05-07T19:43:15.6238337Z + /github/home/miniconda/bin/conda init bash 2025-05-07T19:43:15.6238594Z 2025-05-07T19:43:15.9872336Z no change /github/home/miniconda/condabin/conda 2025-05-07T19:43:15.9872982Z no change /github/home/miniconda/bin/conda 2025-05-07T19:43:15.9873542Z no change /github/home/miniconda/bin/conda-env 2025-05-07T19:43:15.9873929Z no change /github/home/miniconda/bin/activate 2025-05-07T19:43:15.9874334Z no change /github/home/miniconda/bin/deactivate 2025-05-07T19:43:15.9874750Z no change /github/home/miniconda/etc/profile.d/conda.sh 2025-05-07T19:43:15.9875229Z no change /github/home/miniconda/etc/fish/conf.d/conda.fish 2025-05-07T19:43:15.9875691Z no change /github/home/miniconda/shell/condabin/Conda.psm1 2025-05-07T19:43:15.9876301Z no change /github/home/miniconda/shell/condabin/conda-hook.ps1 2025-05-07T19:43:15.9876888Z no change /github/home/miniconda/lib/python3.13/site-packages/xontrib/conda.xsh 2025-05-07T19:43:15.9877775Z no change /github/home/miniconda/etc/profile.d/conda.csh 2025-05-07T19:43:15.9878199Z modified /github/home/.bashrc 2025-05-07T19:43:15.9878392Z 2025-05-07T19:43:15.9878607Z ==> For changes to take effect, close and re-open your current shell. <== 2025-05-07T19:43:15.9878933Z 2025-05-07T19:43:16.0418628Z 2025-05-07T19:43:16.0419586Z + . /github/home/.bashrc 2025-05-07T19:43:16.0419856Z 2025-05-07T19:43:16.8301390Z 2025-05-07T19:43:16.8302473Z [SETUP] Installing libmamba-solver (required since Anaconda 2024.02-1) and libarchive ... 2025-05-07T19:43:16.8329028Z [EXEC] [ATTEMPT 0/3] + conda install --solver=classic -c conda-forge --override-channels -y conda-libmamba-solver libmamba libmambapy libarchive 2025-05-07T19:43:28.6466405Z Collecting package metadata (current_repodata.json): - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:43:30.1012120Z Solving environment: | / - \ | / - \ | / - done 2025-05-07T19:43:30.1914234Z 2025-05-07T19:43:30.1914751Z ## Package Plan ## 2025-05-07T19:43:30.1914997Z 2025-05-07T19:43:30.1915150Z environment location: /github/home/miniconda 2025-05-07T19:43:30.1915440Z 2025-05-07T19:43:30.1915574Z added / updated specs: 2025-05-07T19:43:30.1915895Z - conda-libmamba-solver 2025-05-07T19:43:30.1916176Z - libarchive 2025-05-07T19:43:30.1916429Z - libmamba 2025-05-07T19:43:30.1916661Z - libmambapy 2025-05-07T19:43:30.1916801Z 2025-05-07T19:43:30.1916832Z 2025-05-07T19:43:30.1916964Z The following packages will be downloaded: 2025-05-07T19:43:30.1917197Z 2025-05-07T19:43:30.1917672Z package | build 2025-05-07T19:43:30.1918070Z ---------------------------|----------------- 2025-05-07T19:43:30.1918548Z ca-certificates-2025.4.26 | hbd8a1cb_0 149 KB conda-forge 2025-05-07T19:43:30.1919179Z certifi-2025.4.26 | pyhd8ed1ab_0 154 KB conda-forge 2025-05-07T19:43:30.1919675Z conda-25.3.1 | py313h78bf25f_1 1.1 MB conda-forge 2025-05-07T19:43:30.1920177Z conda-libmamba-solver-25.4.0| pyhd8ed1ab_0 41 KB conda-forge 2025-05-07T19:43:30.1920852Z ------------------------------------------------------------ 2025-05-07T19:43:30.1941404Z Total: 1.4 MB 2025-05-07T19:43:30.1941728Z 2025-05-07T19:43:30.1941860Z The following packages will be UPDATED: 2025-05-07T19:43:30.1942107Z 2025-05-07T19:43:30.1947715Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:43:30.1948589Z conda pkgs/main::conda-25.3.1-py313h06a4308~ --> conda-forge::conda-25.3.1-py313h78bf25f_1 2025-05-07T19:43:30.1949034Z 2025-05-07T19:43:30.1949269Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:43:30.1949611Z 2025-05-07T19:43:30.1949977Z certifi pkgs/main/linux-64::certifi-2025.4.26~ --> conda-forge/noarch::certifi-2025.4.26-pyhd8ed1ab_0 2025-05-07T19:43:30.1950838Z conda-libmamba-so~ pkgs/main::conda-libmamba-solver-25.4~ --> conda-forge::conda-libmamba-solver-25.4.0-pyhd8ed1ab_0 2025-05-07T19:43:30.1951383Z 2025-05-07T19:43:30.1951388Z 2025-05-07T19:43:30.1951391Z 2025-05-07T19:43:30.1951544Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:30.1951976Z conda-25.3.1 | 1.1 MB | | 0% 2025-05-07T19:43:30.1952218Z 2025-05-07T19:43:30.1952549Z certifi-2025.4.26 | 154 KB | | 0%  2025-05-07T19:43:30.1952824Z 2025-05-07T19:43:30.1952833Z 2025-05-07T19:43:30.1953070Z ca-certificates-2025 | 149 KB | | 0%  2025-05-07T19:43:30.1953459Z 2025-05-07T19:43:30.1953463Z 2025-05-07T19:43:30.1953775Z 2025-05-07T19:43:30.2413500Z conda-libmamba-solve | 41 KB | | 0%  2025-05-07T19:43:30.2414396Z 2025-05-07T19:43:30.2414408Z 2025-05-07T19:43:30.2414419Z 2025-05-07T19:43:30.2502723Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:30.2503717Z 2025-05-07T19:43:30.2503722Z 2025-05-07T19:43:30.2503726Z 2025-05-07T19:43:30.2741064Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:30.2741427Z 2025-05-07T19:43:30.2801608Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:30.2874160Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:30.2874696Z 2025-05-07T19:43:30.3048444Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:30.3048758Z 2025-05-07T19:43:30.3048915Z 2025-05-07T19:43:30.3083504Z ca-certificates-2025 | 149 KB | # | 11%  2025-05-07T19:43:30.3084152Z 2025-05-07T19:43:30.3084156Z 2025-05-07T19:43:30.3261633Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:30.3262508Z 2025-05-07T19:43:30.3262522Z 2025-05-07T19:43:30.3858682Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:30.3859156Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:30.3862294Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:30.3862647Z 2025-05-07T19:43:30.3862854Z 2025-05-07T19:43:30.3863060Z  2025-05-07T19:43:30.3863291Z 2025-05-07T19:43:30.3863295Z 2025-05-07T19:43:30.3863472Z  2025-05-07T19:43:30.3863691Z 2025-05-07T19:43:30.3863695Z 2025-05-07T19:43:30.3863700Z 2025-05-07T19:43:30.3863906Z  done 2025-05-07T19:43:30.4876176Z Preparing transaction: | done 2025-05-07T19:43:30.5882447Z Verifying transaction: - done 2025-05-07T19:43:31.8910055Z Executing transaction: | / - \ | / - \ | / - \ | done 2025-05-07T19:43:33.4683812Z [SETUP] Updating Miniconda base packages ... 2025-05-07T19:43:33.4707598Z [EXEC] [ATTEMPT 0/3] + conda update -n base -c defaults --update-deps -y conda 2025-05-07T19:43:34.1691468Z Channels: 2025-05-07T19:43:34.1692138Z - defaults 2025-05-07T19:43:34.1692769Z Platform: linux-64 2025-05-07T19:43:35.2516237Z Collecting package metadata (repodata.json): - \ | / - \ done 2025-05-07T19:43:35.3833016Z Solving environment: / - Channels: 2025-05-07T19:43:35.3833962Z - defaults 2025-05-07T19:43:35.3834502Z Platform: linux-64 2025-05-07T19:43:35.6612751Z Collecting package metadata (repodata.json): | / - \ done 2025-05-07T19:43:35.8702779Z Solving environment: / - \ done 2025-05-07T19:43:35.9910282Z | done 2025-05-07T19:43:36.0556573Z 2025-05-07T19:43:36.0557041Z ## Package Plan ## 2025-05-07T19:43:36.0557254Z 2025-05-07T19:43:36.0557558Z environment location: /github/home/miniconda 2025-05-07T19:43:36.0558083Z 2025-05-07T19:43:36.0558235Z added / updated specs: 2025-05-07T19:43:36.0558526Z - conda 2025-05-07T19:43:36.0558656Z 2025-05-07T19:43:36.0558683Z 2025-05-07T19:43:36.0558838Z The following packages will be downloaded: 2025-05-07T19:43:36.0559070Z 2025-05-07T19:43:36.0559198Z package | build 2025-05-07T19:43:36.0559568Z ---------------------------|----------------- 2025-05-07T19:43:36.0559935Z pip-25.1 | pyhc872135_2 1.3 MB 2025-05-07T19:43:36.0560370Z tzdata-2025b | h04d1e81_0 116 KB 2025-05-07T19:43:36.0560799Z ------------------------------------------------------------ 2025-05-07T19:43:36.0561164Z Total: 1.4 MB 2025-05-07T19:43:36.0561403Z 2025-05-07T19:43:36.0561553Z The following packages will be UPDATED: 2025-05-07T19:43:36.0561891Z 2025-05-07T19:43:36.0562644Z pip pkgs/main/linux-64::pip-25.0-py313h06~ --> pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:36.0563193Z tzdata 2025a-h04d1e81_0 --> 2025b-h04d1e81_0 2025-05-07T19:43:36.0563450Z 2025-05-07T19:43:36.0563454Z 2025-05-07T19:43:36.0563458Z 2025-05-07T19:43:36.0563638Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:36.0564011Z pip-25.1 | 1.3 MB | | 0% 2025-05-07T19:43:36.0564259Z 2025-05-07T19:43:36.0956271Z tzdata-2025b | 116 KB | | 0%  2025-05-07T19:43:36.0956579Z 2025-05-07T19:43:36.1321047Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:36.2758904Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:36.2759667Z 2025-05-07T19:43:36.2760464Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:36.2761225Z 2025-05-07T19:43:36.3009840Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:36.3011302Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:36.3012312Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:36.3013260Z 2025-05-07T19:43:36.3013862Z 2025-05-07T19:43:36.3014410Z  done 2025-05-07T19:43:36.4017525Z Preparing transaction: - done 2025-05-07T19:43:36.5030329Z Verifying transaction: | done 2025-05-07T19:43:38.5060674Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:39.0403008Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:43:39.0403929Z + conda clean --packages --tarball -y 2025-05-07T19:43:39.0404241Z 2025-05-07T19:43:39.4769845Z Will remove 99 (117.8 MB) tarball(s). 2025-05-07T19:43:39.4771635Z Will remove 11 (16.0 MB) package(s). 2025-05-07T19:43:39.5317182Z 2025-05-07T19:43:39.5321356Z + conda clean --all -y 2025-05-07T19:43:39.5321938Z 2025-05-07T19:43:39.9727353Z There are no unused tarball(s) to remove. 2025-05-07T19:43:39.9728169Z Will remove 1 index cache(s). 2025-05-07T19:43:39.9728909Z There are no unused package(s) to remove. 2025-05-07T19:43:39.9729298Z There are no tempfile(s) to remove. 2025-05-07T19:43:39.9729626Z There are no logfile(s) to remove. 2025-05-07T19:43:40.0277511Z 2025-05-07T19:43:40.0277987Z + conda info 2025-05-07T19:43:40.0278148Z 2025-05-07T19:43:40.5873693Z 2025-05-07T19:43:40.5874360Z active environment : base 2025-05-07T19:43:40.5874773Z active env location : /github/home/miniconda 2025-05-07T19:43:40.5875136Z shell level : 1 2025-05-07T19:43:40.5875486Z user config file : /github/home/.condarc 2025-05-07T19:43:40.5875881Z populated config files : /github/home/miniconda/.condarc 2025-05-07T19:43:40.5876307Z conda version : 25.3.1 2025-05-07T19:43:40.5876630Z conda-build version : not installed 2025-05-07T19:43:40.5876954Z python version : 3.13.2.final.0 2025-05-07T19:43:40.5877301Z solver : libmamba (default) 2025-05-07T19:43:40.5877643Z virtual packages : __archspec=1=cascadelake 2025-05-07T19:43:40.5878010Z __conda=25.3.1=0 2025-05-07T19:43:40.5878416Z __glibc=2.34=0 2025-05-07T19:43:40.5878724Z __linux=6.1.130=0 2025-05-07T19:43:40.5879003Z __unix=0=0 2025-05-07T19:43:40.5879361Z base environment : /github/home/miniconda (writable) 2025-05-07T19:43:40.5879787Z conda av data dir : /github/home/miniconda/etc/conda 2025-05-07T19:43:40.5880136Z conda av metadata url : None 2025-05-07T19:43:40.5880537Z channel URLs : https://repo.anaconda.com/pkgs/main/linux-64 2025-05-07T19:43:40.5880972Z https://repo.anaconda.com/pkgs/main/noarch 2025-05-07T19:43:40.5881394Z https://repo.anaconda.com/pkgs/r/linux-64 2025-05-07T19:43:40.5882047Z https://repo.anaconda.com/pkgs/r/noarch 2025-05-07T19:43:40.5882461Z package cache : /github/home/miniconda/pkgs 2025-05-07T19:43:40.5882829Z /github/home/.conda/pkgs 2025-05-07T19:43:40.5883183Z envs directories : /github/home/miniconda/envs 2025-05-07T19:43:40.5883552Z /github/home/.conda/envs 2025-05-07T19:43:40.5883866Z platform : linux-64 2025-05-07T19:43:40.5884738Z user-agent : conda/25.3.1 requests/2.32.3 CPython/3.13.2 Linux/6.1.130-139.222.amzn2023.x86_64 amzn/2023.7.20250428 glibc/2.34 solver/libmamba conda-libmamba-solver/25.4.0 libmambapy/2.0.5 aau/0.7.0 c/. s/. e/. 2025-05-07T19:43:40.5885585Z UID:GID : 0:0 2025-05-07T19:43:40.5885875Z netrc file : None 2025-05-07T19:43:40.5886166Z offline mode : False 2025-05-07T19:43:40.5886344Z 2025-05-07T19:43:40.6475758Z 2025-05-07T19:43:40.6476426Z [SETUP] Exporting Miniconda variables ... 2025-05-07T19:43:40.6478351Z [SETUP] Saving Miniconda variables to /__w/_temp/_runner_file_commands/add_path_6a335b1d-81a6-42f8-af04-4232f2b63839 ... 2025-05-07T19:43:40.6480351Z [SETUP] Successfully set up Miniconda at /github/home/miniconda 2025-05-07T19:43:40.6619700Z ##[group]Run . $PRELUDE; create_conda_environment $BUILD_ENV 3.13 2025-05-07T19:43:40.6620249Z . $PRELUDE; create_conda_environment $BUILD_ENV 3.13 2025-05-07T19:43:40.6621038Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:40.6621384Z env: 2025-05-07T19:43:40.6621605Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:40.6621926Z BUILD_ENV: build_binary 2025-05-07T19:43:40.6622185Z BUILD_TARGET: default 2025-05-07T19:43:40.6622418Z BUILD_VARIANT: cuda 2025-05-07T19:43:40.6622762Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:40.6622991Z ##[endgroup] 2025-05-07T19:43:41.1051987Z ################################################################################ 2025-05-07T19:43:41.1052422Z # Create Conda Environment 2025-05-07T19:43:41.1052670Z # 2025-05-07T19:43:41.1068270Z # [2025-05-07T19:43:41.106Z] + create_conda_environment build_binary 3.13 2025-05-07T19:43:41.1068891Z ################################################################################ 2025-05-07T19:43:41.1069129Z 2025-05-07T19:43:41.1100695Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:41.2012386Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:41.2013508Z [SETUP] Listing existing Conda environments ... 2025-05-07T19:43:41.2014488Z + conda info --envs 2025-05-07T19:43:41.2014892Z 2025-05-07T19:43:41.7759199Z 2025-05-07T19:43:41.7759782Z # conda environments: 2025-05-07T19:43:41.7760544Z # 2025-05-07T19:43:41.7761172Z base /github/home/miniconda 2025-05-07T19:43:41.7761853Z 2025-05-07T19:43:41.8351714Z 2025-05-07T19:43:41.8352615Z [SETUP] Deleting the prefix directory if it exists ... 2025-05-07T19:43:43.4657550Z + rm -rf /github/home/miniconda/envs/build_binary 2025-05-07T19:43:43.4658328Z 2025-05-07T19:43:43.4673518Z 2025-05-07T19:43:43.4679303Z [SETUP] Creating new Conda environment (Python 3.13) ... 2025-05-07T19:43:43.4701946Z [EXEC] [ATTEMPT 0/3] + conda create -y -n build_binary python=3.13 2025-05-07T19:43:44.0454490Z Channels: 2025-05-07T19:43:44.0455181Z - defaults 2025-05-07T19:43:44.0455815Z Platform: linux-64 2025-05-07T19:43:45.3484863Z Collecting package metadata (repodata.json): - \ | / - \ | / done 2025-05-07T19:43:45.4492295Z Solving environment: \ done 2025-05-07T19:43:45.4776143Z 2025-05-07T19:43:45.4776443Z ## Package Plan ## 2025-05-07T19:43:45.4776619Z 2025-05-07T19:43:45.4776940Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:43:45.4777541Z 2025-05-07T19:43:45.4777663Z added / updated specs: 2025-05-07T19:43:45.4777968Z - python=3.13 2025-05-07T19:43:45.4778150Z 2025-05-07T19:43:45.4778155Z 2025-05-07T19:43:45.4778291Z The following packages will be downloaded: 2025-05-07T19:43:45.4778554Z 2025-05-07T19:43:45.4778685Z package | build 2025-05-07T19:43:45.4779062Z ---------------------------|----------------- 2025-05-07T19:43:45.4779477Z _libgcc_mutex-0.1 | main 3 KB 2025-05-07T19:43:45.4779942Z _openmp_mutex-5.1 | 1_gnu 21 KB 2025-05-07T19:43:45.4780394Z ca-certificates-2025.2.25 | h06a4308_0 129 KB 2025-05-07T19:43:45.4780873Z python_abi-3.13 | 0_cp313 6 KB 2025-05-07T19:43:45.4781272Z ------------------------------------------------------------ 2025-05-07T19:43:45.4781660Z Total: 159 KB 2025-05-07T19:43:45.4781890Z 2025-05-07T19:43:45.4782055Z The following NEW packages will be INSTALLED: 2025-05-07T19:43:45.4782304Z 2025-05-07T19:43:45.4782537Z _libgcc_mutex pkgs/main/linux-64::_libgcc_mutex-0.1-main 2025-05-07T19:43:45.4783038Z _openmp_mutex pkgs/main/linux-64::_openmp_mutex-5.1-1_gnu 2025-05-07T19:43:45.4783603Z bzip2 pkgs/main/linux-64::bzip2-1.0.8-h5eee18b_6 2025-05-07T19:43:45.4784130Z ca-certificates pkgs/main/linux-64::ca-certificates-2025.2.25-h06a4308_0 2025-05-07T19:43:45.4785008Z expat pkgs/main/linux-64::expat-2.7.1-h6a678d5_0 2025-05-07T19:43:45.4785505Z ld_impl_linux-64 pkgs/main/linux-64::ld_impl_linux-64-2.40-h12ee557_0 2025-05-07T19:43:45.4786029Z libffi pkgs/main/linux-64::libffi-3.4.4-h6a678d5_1 2025-05-07T19:43:45.4786477Z libgcc-ng pkgs/main/linux-64::libgcc-ng-11.2.0-h1234567_1 2025-05-07T19:43:45.4786972Z libgomp pkgs/main/linux-64::libgomp-11.2.0-h1234567_1 2025-05-07T19:43:45.4787455Z libmpdec pkgs/main/linux-64::libmpdec-4.0.0-h5eee18b_0 2025-05-07T19:43:45.4787945Z libstdcxx-ng pkgs/main/linux-64::libstdcxx-ng-11.2.0-h1234567_1 2025-05-07T19:43:45.4788627Z libuuid pkgs/main/linux-64::libuuid-1.41.5-h5eee18b_0 2025-05-07T19:43:45.4789075Z ncurses pkgs/main/linux-64::ncurses-6.4-h6a678d5_0 2025-05-07T19:43:45.4789551Z openssl pkgs/main/linux-64::openssl-3.0.16-h5eee18b_0 2025-05-07T19:43:45.4789992Z pip pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:45.4790462Z python pkgs/main/linux-64::python-3.13.2-hf623796_100_cp313 2025-05-07T19:43:45.4790955Z python_abi pkgs/main/linux-64::python_abi-3.13-0_cp313 2025-05-07T19:43:45.4791402Z readline pkgs/main/linux-64::readline-8.2-h5eee18b_0 2025-05-07T19:43:45.4791920Z setuptools pkgs/main/linux-64::setuptools-78.1.1-py313h06a4308_0 2025-05-07T19:43:45.4792410Z sqlite pkgs/main/linux-64::sqlite-3.45.3-h5eee18b_0 2025-05-07T19:43:45.4792840Z tk pkgs/main/linux-64::tk-8.6.14-h39e8969_0 2025-05-07T19:43:45.4793275Z tzdata pkgs/main/noarch::tzdata-2025b-h04d1e81_0 2025-05-07T19:43:45.4793709Z wheel pkgs/main/linux-64::wheel-0.45.1-py313h06a4308_0 2025-05-07T19:43:45.4794141Z xz pkgs/main/linux-64::xz-5.6.4-h5eee18b_1 2025-05-07T19:43:45.4794526Z zlib pkgs/main/linux-64::zlib-1.2.13-h5eee18b_1 2025-05-07T19:43:45.4794808Z 2025-05-07T19:43:45.4794813Z 2025-05-07T19:43:45.4794818Z 2025-05-07T19:43:45.4794967Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:45.4795406Z ca-certificates-2025 | 129 KB | | 0% 2025-05-07T19:43:45.4795651Z 2025-05-07T19:43:45.4795983Z _openmp_mutex-5.1 | 21 KB | | 0%  2025-05-07T19:43:45.4796260Z 2025-05-07T19:43:45.4796263Z 2025-05-07T19:43:45.4796480Z python_abi-3.13 | 6 KB | | 0%  2025-05-07T19:43:45.4796736Z 2025-05-07T19:43:45.4796739Z 2025-05-07T19:43:45.4797277Z 2025-05-07T19:43:45.5213137Z _libgcc_mutex-0.1 | 3 KB | | 0%  2025-05-07T19:43:45.5214023Z 2025-05-07T19:43:45.5221952Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:45.5273555Z ca-certificates-2025 | 129 KB | ########## | 100% 2025-05-07T19:43:45.5274362Z 2025-05-07T19:43:45.5274375Z 2025-05-07T19:43:45.5274387Z 2025-05-07T19:43:45.5359449Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:45.5360307Z 2025-05-07T19:43:45.5360320Z 2025-05-07T19:43:45.5360331Z 2025-05-07T19:43:45.5362348Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:45.5435892Z ca-certificates-2025 | 129 KB | ########## | 100% 2025-05-07T19:43:45.5436703Z 2025-05-07T19:43:45.5716363Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:45.5717188Z 2025-05-07T19:43:45.5717201Z 2025-05-07T19:43:45.5767976Z python_abi-3.13 | 6 KB | ########## | 100%  2025-05-07T19:43:45.5768289Z 2025-05-07T19:43:45.5768361Z 2025-05-07T19:43:45.5775472Z python_abi-3.13 | 6 KB | ########## | 100%  2025-05-07T19:43:45.5775913Z 2025-05-07T19:43:45.5776149Z 2025-05-07T19:43:45.5776538Z  2025-05-07T19:43:45.5776761Z 2025-05-07T19:43:45.5776765Z 2025-05-07T19:43:45.5777291Z  2025-05-07T19:43:45.5777564Z 2025-05-07T19:43:45.5777568Z 2025-05-07T19:43:45.5777572Z 2025-05-07T19:43:45.5777772Z  done 2025-05-07T19:43:45.7888864Z Preparing transaction: / - done 2025-05-07T19:43:47.3438551Z Verifying transaction: | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:49.5586417Z Executing transaction: \ | / - \ | / - \ | / - \ | / - \ | / - \ | done 2025-05-07T19:43:49.5622953Z # 2025-05-07T19:43:49.5623678Z # To activate this environment, use 2025-05-07T19:43:49.5625958Z # 2025-05-07T19:43:49.5626183Z # $ conda activate build_binary 2025-05-07T19:43:49.5626505Z # 2025-05-07T19:43:49.5627021Z # To deactivate an active environment, use 2025-05-07T19:43:49.5627363Z # 2025-05-07T19:43:49.5627576Z # $ conda deactivate 2025-05-07T19:43:49.5627778Z 2025-05-07T19:43:49.6462181Z [SETUP] Upgrading PIP to latest ... 2025-05-07T19:43:49.6487427Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --upgrade pip 2025-05-07T19:43:52.6091250Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:43:52.6095854Z 2025-05-07T19:43:52.6097084Z Requirement already satisfied: pip in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (25.1) 2025-05-07T19:43:52.6098121Z Collecting pip 2025-05-07T19:43:52.6098426Z Downloading pip-25.1.1-py3-none-any.whl.metadata (3.6 kB) 2025-05-07T19:43:52.6098853Z Downloading pip-25.1.1-py3-none-any.whl (1.8 MB) 2025-05-07T19:43:52.6099662Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.8/1.8 MB 96.6 MB/s eta 0:00:00 2025-05-07T19:43:52.6100045Z Installing collected packages: pip 2025-05-07T19:43:52.6100352Z Attempting uninstall: pip 2025-05-07T19:43:52.6100629Z Found existing installation: pip 25.1 2025-05-07T19:43:52.6100946Z Uninstalling pip-25.1: 2025-05-07T19:43:52.6101216Z Successfully uninstalled pip-25.1 2025-05-07T19:43:52.6101538Z Successfully installed pip-25.1.1 2025-05-07T19:43:52.6101722Z 2025-05-07T19:43:52.6686366Z [SETUP] Upgrading pyOpenSSL ... 2025-05-07T19:43:52.6714384Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y pyOpenSSL>22.1.0 2025-05-07T19:43:53.3372568Z Channels: 2025-05-07T19:43:53.3373429Z - conda-forge 2025-05-07T19:43:53.3373719Z Platform: linux-64 2025-05-07T19:44:03.0220138Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:44:04.9280620Z Solving environment: | / - \ | done 2025-05-07T19:44:04.9762482Z 2025-05-07T19:44:04.9763618Z ## Package Plan ## 2025-05-07T19:44:04.9764146Z 2025-05-07T19:44:04.9764930Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:04.9765891Z 2025-05-07T19:44:04.9766169Z added / updated specs: 2025-05-07T19:44:04.9766794Z - pyopenssl[version='>22.1.0'] 2025-05-07T19:44:04.9767004Z 2025-05-07T19:44:04.9767010Z 2025-05-07T19:44:04.9767174Z The following packages will be downloaded: 2025-05-07T19:44:04.9767436Z 2025-05-07T19:44:04.9767568Z package | build 2025-05-07T19:44:04.9767961Z ---------------------------|----------------- 2025-05-07T19:44:04.9768367Z cffi-1.17.1 | py313hfab6e84_0 289 KB conda-forge 2025-05-07T19:44:04.9768916Z cryptography-44.0.3 | py313h6556f6e_0 1.5 MB conda-forge 2025-05-07T19:44:04.9769397Z libgcc-15.1.0 | h767d61c_2 810 KB conda-forge 2025-05-07T19:44:04.9769883Z libgcc-ng-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:44:04.9770840Z libgomp-15.1.0 | h767d61c_2 442 KB conda-forge 2025-05-07T19:44:04.9771328Z openssl-3.5.0 | h7b32b05_1 3.0 MB conda-forge 2025-05-07T19:44:04.9771820Z pycparser-2.22 | pyh29332c3_1 108 KB conda-forge 2025-05-07T19:44:04.9772300Z pyopenssl-25.0.0 | pyhd8ed1ab_0 120 KB conda-forge 2025-05-07T19:44:04.9772848Z typing-extensions-4.13.2 | h0e9735f_0 88 KB conda-forge 2025-05-07T19:44:04.9773380Z typing_extensions-4.13.2 | pyh29332c3_0 51 KB conda-forge 2025-05-07T19:44:04.9774019Z ------------------------------------------------------------ 2025-05-07T19:44:04.9774389Z Total: 6.4 MB 2025-05-07T19:44:04.9774649Z 2025-05-07T19:44:04.9774795Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:04.9775037Z 2025-05-07T19:44:04.9775294Z cffi conda-forge/linux-64::cffi-1.17.1-py313hfab6e84_0 2025-05-07T19:44:04.9775832Z cryptography conda-forge/linux-64::cryptography-44.0.3-py313h6556f6e_0 2025-05-07T19:44:04.9776400Z libgcc conda-forge/linux-64::libgcc-15.1.0-h767d61c_2 2025-05-07T19:44:04.9779961Z pycparser conda-forge/noarch::pycparser-2.22-pyh29332c3_1 2025-05-07T19:44:04.9780469Z pyopenssl conda-forge/noarch::pyopenssl-25.0.0-pyhd8ed1ab_0 2025-05-07T19:44:04.9781028Z typing-extensions conda-forge/noarch::typing-extensions-4.13.2-h0e9735f_0 2025-05-07T19:44:04.9781610Z typing_extensions conda-forge/noarch::typing_extensions-4.13.2-pyh29332c3_0 2025-05-07T19:44:04.9781987Z 2025-05-07T19:44:04.9782106Z The following packages will be UPDATED: 2025-05-07T19:44:04.9782311Z 2025-05-07T19:44:04.9782735Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:44:04.9783521Z libgcc-ng pkgs/main::libgcc-ng-11.2.0-h1234567_1 --> conda-forge::libgcc-ng-15.1.0-h69a702a_2 2025-05-07T19:44:04.9784196Z libgomp pkgs/main::libgomp-11.2.0-h1234567_1 --> conda-forge::libgomp-15.1.0-h767d61c_2 2025-05-07T19:44:04.9785001Z openssl pkgs/main::openssl-3.0.16-h5eee18b_0 --> conda-forge::openssl-3.5.0-h7b32b05_1 2025-05-07T19:44:04.9785412Z 2025-05-07T19:44:04.9785416Z 2025-05-07T19:44:04.9785421Z 2025-05-07T19:44:04.9785574Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:04.9785991Z openssl-3.5.0 | 3.0 MB | | 0% 2025-05-07T19:44:04.9786237Z 2025-05-07T19:44:04.9786589Z cryptography-44.0.3 | 1.5 MB | | 0%  2025-05-07T19:44:04.9786885Z 2025-05-07T19:44:04.9786889Z 2025-05-07T19:44:04.9787103Z libgcc-15.1.0 | 810 KB | | 0%  2025-05-07T19:44:04.9787354Z 2025-05-07T19:44:04.9787358Z 2025-05-07T19:44:04.9787362Z 2025-05-07T19:44:04.9791949Z libgomp-15.1.0 | 442 KB | | 0%  2025-05-07T19:44:04.9792735Z 2025-05-07T19:44:04.9792747Z 2025-05-07T19:44:04.9792757Z 2025-05-07T19:44:04.9792767Z 2025-05-07T19:44:04.9808244Z cffi-1.17.1 | 289 KB | | 0%  2025-05-07T19:44:04.9808566Z 2025-05-07T19:44:04.9808570Z 2025-05-07T19:44:04.9808574Z 2025-05-07T19:44:04.9808577Z 2025-05-07T19:44:04.9808581Z 2025-05-07T19:44:04.9808862Z pyopenssl-25.0.0 | 120 KB | | 0%  2025-05-07T19:44:04.9809191Z 2025-05-07T19:44:04.9809195Z 2025-05-07T19:44:04.9809198Z 2025-05-07T19:44:04.9809202Z 2025-05-07T19:44:04.9809205Z 2025-05-07T19:44:04.9809209Z 2025-05-07T19:44:04.9809489Z pycparser-2.22 | 108 KB | | 0%  2025-05-07T19:44:04.9809804Z 2025-05-07T19:44:04.9809808Z 2025-05-07T19:44:04.9809811Z 2025-05-07T19:44:04.9809829Z 2025-05-07T19:44:04.9809833Z 2025-05-07T19:44:04.9809836Z 2025-05-07T19:44:04.9809839Z 2025-05-07T19:44:04.9810479Z typing-extensions-4. | 88 KB | | 0%  2025-05-07T19:44:04.9810814Z 2025-05-07T19:44:04.9810843Z 2025-05-07T19:44:04.9810847Z 2025-05-07T19:44:04.9810850Z 2025-05-07T19:44:04.9810854Z 2025-05-07T19:44:04.9810857Z 2025-05-07T19:44:04.9810860Z 2025-05-07T19:44:04.9810864Z 2025-05-07T19:44:04.9811161Z typing_extensions-4. | 51 KB | | 0%  2025-05-07T19:44:04.9811475Z 2025-05-07T19:44:04.9811479Z 2025-05-07T19:44:04.9811482Z 2025-05-07T19:44:04.9811514Z 2025-05-07T19:44:04.9811517Z 2025-05-07T19:44:04.9811521Z 2025-05-07T19:44:04.9811524Z 2025-05-07T19:44:04.9811527Z 2025-05-07T19:44:04.9811638Z 2025-05-07T19:44:05.0307357Z libgcc-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:05.0308009Z 2025-05-07T19:44:05.0308013Z 2025-05-07T19:44:05.0308046Z 2025-05-07T19:44:05.0522721Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:05.0523593Z 2025-05-07T19:44:05.0533862Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:05.0534678Z 2025-05-07T19:44:05.0534691Z 2025-05-07T19:44:05.0534702Z 2025-05-07T19:44:05.0764319Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:05.0808936Z openssl-3.5.0 | 3.0 MB | ##4 | 24% 2025-05-07T19:44:05.0809255Z 2025-05-07T19:44:05.0809259Z 2025-05-07T19:44:05.0809612Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:05.0809878Z 2025-05-07T19:44:05.0809882Z 2025-05-07T19:44:05.0830572Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:05.0830908Z 2025-05-07T19:44:05.0830930Z 2025-05-07T19:44:05.0830934Z 2025-05-07T19:44:05.0830938Z 2025-05-07T19:44:05.0830941Z 2025-05-07T19:44:05.0831259Z pyopenssl-25.0.0 | 120 KB | #3 | 13%  2025-05-07T19:44:05.0831565Z 2025-05-07T19:44:05.0831598Z 2025-05-07T19:44:05.0831602Z 2025-05-07T19:44:05.0831778Z 2025-05-07T19:44:05.0856146Z cffi-1.17.1 | 289 KB | 5 | 6%  2025-05-07T19:44:05.0856461Z 2025-05-07T19:44:05.0856465Z 2025-05-07T19:44:05.0856469Z 2025-05-07T19:44:05.0856473Z 2025-05-07T19:44:05.0856503Z 2025-05-07T19:44:05.0942707Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:05.0943618Z 2025-05-07T19:44:05.0943632Z 2025-05-07T19:44:05.0943643Z 2025-05-07T19:44:05.0943654Z 2025-05-07T19:44:05.1046937Z cffi-1.17.1 | 289 KB | ########## | 100%  2025-05-07T19:44:05.1047757Z 2025-05-07T19:44:05.1047771Z 2025-05-07T19:44:05.1047783Z 2025-05-07T19:44:05.1047793Z 2025-05-07T19:44:05.1047833Z 2025-05-07T19:44:05.1047844Z 2025-05-07T19:44:05.1080382Z pycparser-2.22 | 108 KB | #4 | 15%  2025-05-07T19:44:05.1081296Z 2025-05-07T19:44:05.1081310Z 2025-05-07T19:44:05.1081320Z 2025-05-07T19:44:05.1081331Z 2025-05-07T19:44:05.1081341Z 2025-05-07T19:44:05.1081351Z 2025-05-07T19:44:05.1204664Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:05.1205026Z 2025-05-07T19:44:05.1205214Z 2025-05-07T19:44:05.1205224Z 2025-05-07T19:44:05.1205228Z 2025-05-07T19:44:05.1205232Z 2025-05-07T19:44:05.1246378Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:05.1246747Z 2025-05-07T19:44:05.1246751Z 2025-05-07T19:44:05.1246755Z 2025-05-07T19:44:05.1246759Z 2025-05-07T19:44:05.1246763Z 2025-05-07T19:44:05.1246766Z 2025-05-07T19:44:05.1246770Z 2025-05-07T19:44:05.1253502Z typing-extensions-4. | 88 KB | #8 | 18%  2025-05-07T19:44:05.1253914Z 2025-05-07T19:44:05.1253937Z 2025-05-07T19:44:05.1253942Z 2025-05-07T19:44:05.1253945Z 2025-05-07T19:44:05.1253949Z 2025-05-07T19:44:05.1253953Z 2025-05-07T19:44:05.1253956Z 2025-05-07T19:44:05.1253960Z 2025-05-07T19:44:05.1274223Z typing_extensions-4. | 51 KB | ###1 | 31%  2025-05-07T19:44:05.1274562Z 2025-05-07T19:44:05.1274567Z 2025-05-07T19:44:05.1274570Z 2025-05-07T19:44:05.1274874Z 2025-05-07T19:44:05.1274881Z 2025-05-07T19:44:05.1274884Z 2025-05-07T19:44:05.1274888Z 2025-05-07T19:44:05.1274891Z 2025-05-07T19:44:05.1275778Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:05.1276154Z 2025-05-07T19:44:05.1276159Z 2025-05-07T19:44:05.1276163Z 2025-05-07T19:44:05.1276168Z 2025-05-07T19:44:05.1276172Z 2025-05-07T19:44:05.1276176Z 2025-05-07T19:44:05.1276194Z 2025-05-07T19:44:05.1445720Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:05.1446121Z 2025-05-07T19:44:05.1446125Z 2025-05-07T19:44:05.1446520Z 2025-05-07T19:44:05.1446524Z 2025-05-07T19:44:05.1446528Z 2025-05-07T19:44:05.1446532Z 2025-05-07T19:44:05.1446535Z 2025-05-07T19:44:05.1446539Z 2025-05-07T19:44:05.1446542Z 2025-05-07T19:44:05.1457392Z libgcc-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:05.1458329Z 2025-05-07T19:44:05.1458342Z 2025-05-07T19:44:05.1458386Z 2025-05-07T19:44:05.1458397Z 2025-05-07T19:44:05.1458406Z 2025-05-07T19:44:05.1458416Z 2025-05-07T19:44:05.1458426Z 2025-05-07T19:44:05.1458436Z 2025-05-07T19:44:05.1458461Z 2025-05-07T19:44:05.1528943Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:05.1529400Z 2025-05-07T19:44:05.1529405Z 2025-05-07T19:44:05.1554304Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:05.1624851Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:05.1625182Z 2025-05-07T19:44:05.1625350Z 2025-05-07T19:44:05.1625354Z 2025-05-07T19:44:05.1625582Z 2025-05-07T19:44:05.1738445Z cffi-1.17.1 | 289 KB | ########## | 100%  2025-05-07T19:44:05.1738789Z 2025-05-07T19:44:05.1738814Z 2025-05-07T19:44:05.1738818Z 2025-05-07T19:44:05.1738823Z 2025-05-07T19:44:05.1738827Z 2025-05-07T19:44:05.1738833Z 2025-05-07T19:44:05.1738836Z 2025-05-07T19:44:05.1738841Z 2025-05-07T19:44:05.1856737Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:05.1857079Z 2025-05-07T19:44:05.1857084Z 2025-05-07T19:44:05.1857104Z 2025-05-07T19:44:05.1857107Z 2025-05-07T19:44:05.1857111Z 2025-05-07T19:44:05.1857116Z 2025-05-07T19:44:05.1857402Z 2025-05-07T19:44:05.2173228Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:05.2174284Z 2025-05-07T19:44:05.2174297Z 2025-05-07T19:44:05.2174308Z 2025-05-07T19:44:05.2174320Z 2025-05-07T19:44:05.2174332Z 2025-05-07T19:44:05.2174342Z 2025-05-07T19:44:05.2174353Z 2025-05-07T19:44:05.2174363Z 2025-05-07T19:44:05.2174409Z 2025-05-07T19:44:05.2391369Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:05.2392347Z 2025-05-07T19:44:05.2392360Z 2025-05-07T19:44:05.2392371Z 2025-05-07T19:44:05.2392381Z 2025-05-07T19:44:05.2392393Z 2025-05-07T19:44:05.2392404Z 2025-05-07T19:44:05.2393191Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:05.2394056Z 2025-05-07T19:44:05.2394067Z 2025-05-07T19:44:05.2394077Z 2025-05-07T19:44:05.2394087Z 2025-05-07T19:44:05.2394097Z 2025-05-07T19:44:05.2394107Z 2025-05-07T19:44:05.2566866Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:05.2567776Z 2025-05-07T19:44:05.2568501Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:05.2569275Z 2025-05-07T19:44:05.3017419Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:05.3017890Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:05.3023366Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:05.3024228Z 2025-05-07T19:44:05.3024738Z 2025-05-07T19:44:05.3025025Z  2025-05-07T19:44:05.3025284Z 2025-05-07T19:44:05.3025288Z 2025-05-07T19:44:05.3025745Z  2025-05-07T19:44:05.3025995Z 2025-05-07T19:44:05.3025999Z 2025-05-07T19:44:05.3026003Z 2025-05-07T19:44:05.3026224Z  2025-05-07T19:44:05.3026455Z 2025-05-07T19:44:05.3026458Z 2025-05-07T19:44:05.3026462Z 2025-05-07T19:44:05.3026465Z 2025-05-07T19:44:05.3026679Z  2025-05-07T19:44:05.3026972Z 2025-05-07T19:44:05.3026976Z 2025-05-07T19:44:05.3026981Z 2025-05-07T19:44:05.3026986Z 2025-05-07T19:44:05.3026990Z 2025-05-07T19:44:05.3027251Z  2025-05-07T19:44:05.3027794Z 2025-05-07T19:44:05.3027798Z 2025-05-07T19:44:05.3027802Z 2025-05-07T19:44:05.3027805Z 2025-05-07T19:44:05.3027809Z 2025-05-07T19:44:05.3027812Z 2025-05-07T19:44:05.3028021Z  2025-05-07T19:44:05.3028258Z 2025-05-07T19:44:05.3028261Z 2025-05-07T19:44:05.3028299Z 2025-05-07T19:44:05.3028303Z 2025-05-07T19:44:05.3028306Z 2025-05-07T19:44:05.3028310Z 2025-05-07T19:44:05.3028313Z 2025-05-07T19:44:05.3028726Z  2025-05-07T19:44:05.3028962Z 2025-05-07T19:44:05.3028966Z 2025-05-07T19:44:05.3028969Z 2025-05-07T19:44:05.3028973Z 2025-05-07T19:44:05.3028976Z 2025-05-07T19:44:05.3029008Z 2025-05-07T19:44:05.3029011Z 2025-05-07T19:44:05.3029014Z 2025-05-07T19:44:05.3029216Z  2025-05-07T19:44:05.3029457Z 2025-05-07T19:44:05.3029468Z 2025-05-07T19:44:05.3029471Z 2025-05-07T19:44:05.3029474Z 2025-05-07T19:44:05.3029478Z 2025-05-07T19:44:05.3029481Z 2025-05-07T19:44:05.3029484Z 2025-05-07T19:44:05.3029514Z 2025-05-07T19:44:05.3029518Z 2025-05-07T19:44:05.3029738Z  done 2025-05-07T19:44:05.4036523Z Preparing transaction: - done 2025-05-07T19:44:05.5045845Z Verifying transaction: | done 2025-05-07T19:44:06.9073842Z Executing transaction: - \ | / - \ | / - \ | / - \ done 2025-05-07T19:44:07.0023052Z [SETUP] Testing pyOpenSSL import ... 2025-05-07T19:44:08.6818133Z [CHECK] Python (sub-)package 'OpenSSL' found ... 2025-05-07T19:44:08.6837767Z [SETUP] Installing libxcrypt ... 2025-05-07T19:44:08.6865326Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y libxcrypt 2025-05-07T19:44:09.3506119Z Channels: 2025-05-07T19:44:09.3506486Z - conda-forge 2025-05-07T19:44:09.3506778Z Platform: linux-64 2025-05-07T19:44:12.4805753Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:12.9034943Z Solving environment: \ done 2025-05-07T19:44:12.9521305Z 2025-05-07T19:44:12.9521846Z ## Package Plan ## 2025-05-07T19:44:12.9522299Z 2025-05-07T19:44:12.9522910Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:12.9523876Z 2025-05-07T19:44:12.9524145Z added / updated specs: 2025-05-07T19:44:12.9524854Z - libxcrypt 2025-05-07T19:44:12.9525225Z 2025-05-07T19:44:12.9525264Z 2025-05-07T19:44:12.9525604Z The following packages will be downloaded: 2025-05-07T19:44:12.9526276Z 2025-05-07T19:44:12.9526598Z package | build 2025-05-07T19:44:12.9527608Z ---------------------------|----------------- 2025-05-07T19:44:12.9528006Z libxcrypt-4.4.36 | hd590300_1 98 KB conda-forge 2025-05-07T19:44:12.9528657Z ------------------------------------------------------------ 2025-05-07T19:44:12.9529305Z Total: 98 KB 2025-05-07T19:44:12.9529559Z 2025-05-07T19:44:12.9529698Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:12.9530033Z 2025-05-07T19:44:12.9530286Z libxcrypt conda-forge/linux-64::libxcrypt-4.4.36-hd590300_1 2025-05-07T19:44:12.9530622Z 2025-05-07T19:44:12.9530927Z 2025-05-07T19:44:12.9530932Z 2025-05-07T19:44:12.9531091Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:13.0935672Z libxcrypt-4.4.36 | 98 KB | | 0% 2025-05-07T19:44:13.0964162Z libxcrypt-4.4.36 | 98 KB | #6 | 16% 2025-05-07T19:44:13.1067842Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:13.1068704Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:13.1069625Z 2025-05-07T19:44:13.1070060Z done 2025-05-07T19:44:13.2081709Z Preparing transaction: / done 2025-05-07T19:44:13.3089403Z Verifying transaction: \ done 2025-05-07T19:44:13.4098971Z Executing transaction: / done 2025-05-07T19:44:16.7092128Z [SETUP] Copying over ... 2025-05-07T19:44:16.7092939Z + cp /github/home/miniconda/envs/build_binary/include/crypt.h /github/home/miniconda/envs/build_binary/include/python3.13/crypt.h 2025-05-07T19:44:16.7093563Z 2025-05-07T19:44:16.7136564Z 2025-05-07T19:44:18.3127865Z [SETUP] Installed Python version: Python 3.13.2 2025-05-07T19:44:18.3129631Z [SETUP] Successfully created Conda environment: build_binary 2025-05-07T19:44:18.3205158Z ##[group]Run . $PRELUDE; install_cxx_compiler $BUILD_ENV gcc 2025-05-07T19:44:18.3205668Z . $PRELUDE; install_cxx_compiler $BUILD_ENV gcc 2025-05-07T19:44:18.3206301Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:18.3206674Z env: 2025-05-07T19:44:18.3206930Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:18.3207293Z BUILD_ENV: build_binary 2025-05-07T19:44:18.3207554Z BUILD_TARGET: default 2025-05-07T19:44:18.3207848Z BUILD_VARIANT: cuda 2025-05-07T19:44:18.3208099Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:44:18.3208391Z ##[endgroup] 2025-05-07T19:44:18.7274546Z ################################################################################ 2025-05-07T19:44:18.7274964Z # Install C/C++ Compilers 2025-05-07T19:44:18.7275268Z # 2025-05-07T19:44:18.7296159Z # [2025-05-07T19:44:18.729Z] + install_cxx_compiler build_binary gcc 2025-05-07T19:44:18.7297585Z ################################################################################ 2025-05-07T19:44:18.7298261Z 2025-05-07T19:44:18.7314154Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:18.8129637Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:18.8134360Z [INSTALL] Installing GLIBC (architecture = 64) ... 2025-05-07T19:44:18.8158368Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y sysroot_linux-64=2.17 2025-05-07T19:44:19.4809676Z Channels: 2025-05-07T19:44:19.4810604Z - conda-forge 2025-05-07T19:44:19.4811265Z Platform: linux-64 2025-05-07T19:44:22.6311880Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:23.0558121Z Solving environment: \ done 2025-05-07T19:44:23.1033051Z 2025-05-07T19:44:23.1033643Z ## Package Plan ## 2025-05-07T19:44:23.1034106Z 2025-05-07T19:44:23.1034699Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:23.1035635Z 2025-05-07T19:44:23.1035913Z added / updated specs: 2025-05-07T19:44:23.1036720Z - sysroot_linux-64=2.17 2025-05-07T19:44:23.1037216Z 2025-05-07T19:44:23.1037256Z 2025-05-07T19:44:23.1037614Z The following packages will be downloaded: 2025-05-07T19:44:23.1038292Z 2025-05-07T19:44:23.1038619Z package | build 2025-05-07T19:44:23.1039364Z ---------------------------|----------------- 2025-05-07T19:44:23.1039833Z kernel-headers_linux-64-3.10.0| he073ed8_18 921 KB conda-forge 2025-05-07T19:44:23.1040395Z sysroot_linux-64-2.17 | h0157908_18 14.5 MB conda-forge 2025-05-07T19:44:23.1040836Z ------------------------------------------------------------ 2025-05-07T19:44:23.1041320Z Total: 15.4 MB 2025-05-07T19:44:23.1041538Z 2025-05-07T19:44:23.1041668Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:23.1041921Z 2025-05-07T19:44:23.1042218Z kernel-headers_li~ conda-forge/noarch::kernel-headers_linux-64-3.10.0-he073ed8_18 2025-05-07T19:44:23.1042827Z sysroot_linux-64 conda-forge/noarch::sysroot_linux-64-2.17-h0157908_18 2025-05-07T19:44:23.1043140Z 2025-05-07T19:44:23.1043143Z 2025-05-07T19:44:23.1043146Z 2025-05-07T19:44:23.1043292Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:23.1043696Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:23.1043931Z 2025-05-07T19:44:23.2912726Z kernel-headers_linux | 921 KB | | 0%  2025-05-07T19:44:23.2914144Z 2025-05-07T19:44:23.3024502Z kernel-headers_linux | 921 KB | 1 | 2%  2025-05-07T19:44:23.3024822Z 2025-05-07T19:44:23.3132676Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:23.4144249Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:23.4816319Z sysroot_linux-64-2.1 | 14.5 MB | ########3 | 83% 2025-05-07T19:44:23.4816820Z 2025-05-07T19:44:23.4817420Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:23.4817754Z 2025-05-07T19:44:23.4864236Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:23.9234346Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:23.9237820Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:23.9238746Z 2025-05-07T19:44:23.9239015Z 2025-05-07T19:44:23.9240419Z  done 2025-05-07T19:44:24.0251093Z Preparing transaction: / done 2025-05-07T19:44:24.2261892Z Verifying transaction: \ | done 2025-05-07T19:44:24.3271164Z Executing transaction: - done 2025-05-07T19:44:24.4121468Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:44:24.4122349Z [CHECK] CONDA_PREFIX is not set. 2025-05-07T19:44:26.0389042Z [CHECK] libstdc++.so.6 found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libstdc++.so.6 2025-05-07T19:44:26.0408937Z [INSTALL] Installing GCC (11.4.0, 64) through Conda ... 2025-05-07T19:44:26.0433079Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y gxx_linux-64=11.4.0 2025-05-07T19:44:26.7271477Z Channels: 2025-05-07T19:44:26.7272098Z - conda-forge 2025-05-07T19:44:26.7272750Z Platform: linux-64 2025-05-07T19:44:29.8253202Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:30.9555961Z Solving environment: \ | / done 2025-05-07T19:44:31.0053774Z 2025-05-07T19:44:31.0054727Z ## Package Plan ## 2025-05-07T19:44:31.0055026Z 2025-05-07T19:44:31.0055273Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:31.0055651Z 2025-05-07T19:44:31.0055779Z added / updated specs: 2025-05-07T19:44:31.0056085Z - gxx_linux-64=11.4.0 2025-05-07T19:44:31.0056292Z 2025-05-07T19:44:31.0056296Z 2025-05-07T19:44:31.0056438Z The following packages will be downloaded: 2025-05-07T19:44:31.0056679Z 2025-05-07T19:44:31.0056834Z package | build 2025-05-07T19:44:31.0057209Z ---------------------------|----------------- 2025-05-07T19:44:31.0057672Z binutils_impl_linux-64-2.40| ha1999f0_7 6.0 MB conda-forge 2025-05-07T19:44:31.0058197Z binutils_linux-64-2.40 | hb3c18ed_4 28 KB conda-forge 2025-05-07T19:44:31.0058730Z gcc_impl_linux-64-11.4.0 | h00c12a0_13 53.0 MB conda-forge 2025-05-07T19:44:31.0059217Z gcc_linux-64-11.4.0 | ha077dfb_4 31 KB conda-forge 2025-05-07T19:44:31.0059746Z gxx_impl_linux-64-11.4.0 | h634f3ee_13 11.2 MB conda-forge 2025-05-07T19:44:31.0060256Z gxx_linux-64-11.4.0 | h35bfe5d_4 29 KB conda-forge 2025-05-07T19:44:31.0060810Z ld_impl_linux-64-2.40 | hf3520f5_7 691 KB conda-forge 2025-05-07T19:44:31.0061341Z libgcc-devel_linux-64-11.4.0| h8f596e0_113 2.3 MB conda-forge 2025-05-07T19:44:31.0061858Z libsanitizer-11.4.0 | h5763a12_13 3.5 MB conda-forge 2025-05-07T19:44:31.0062375Z libstdcxx-15.1.0 | h8f9b012_2 3.7 MB conda-forge 2025-05-07T19:44:31.0062889Z libstdcxx-devel_linux-64-11.4.0| h8f596e0_113 11.1 MB conda-forge 2025-05-07T19:44:31.0063437Z libstdcxx-ng-15.1.0 | h4852527_2 34 KB conda-forge 2025-05-07T19:44:31.0063906Z ------------------------------------------------------------ 2025-05-07T19:44:31.0064275Z Total: 91.6 MB 2025-05-07T19:44:31.0064911Z 2025-05-07T19:44:31.0065054Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:31.0065294Z 2025-05-07T19:44:31.0065615Z binutils_impl_lin~ conda-forge/linux-64::binutils_impl_linux-64-2.40-ha1999f0_7 2025-05-07T19:44:31.0066257Z binutils_linux-64 conda-forge/linux-64::binutils_linux-64-2.40-hb3c18ed_4 2025-05-07T19:44:31.0066871Z gcc_impl_linux-64 conda-forge/linux-64::gcc_impl_linux-64-11.4.0-h00c12a0_13 2025-05-07T19:44:31.0067601Z gcc_linux-64 conda-forge/linux-64::gcc_linux-64-11.4.0-ha077dfb_4 2025-05-07T19:44:31.0068182Z gxx_impl_linux-64 conda-forge/linux-64::gxx_impl_linux-64-11.4.0-h634f3ee_13 2025-05-07T19:44:31.0068734Z gxx_linux-64 conda-forge/linux-64::gxx_linux-64-11.4.0-h35bfe5d_4 2025-05-07T19:44:31.0069339Z libgcc-devel_linu~ conda-forge/noarch::libgcc-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:31.0070006Z libsanitizer conda-forge/linux-64::libsanitizer-11.4.0-h5763a12_13 2025-05-07T19:44:31.0070550Z libstdcxx conda-forge/linux-64::libstdcxx-15.1.0-h8f9b012_2 2025-05-07T19:44:31.0071171Z libstdcxx-devel_l~ conda-forge/noarch::libstdcxx-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:31.0071568Z 2025-05-07T19:44:31.0071729Z The following packages will be UPDATED: 2025-05-07T19:44:31.0071956Z 2025-05-07T19:44:31.0072296Z ld_impl_linux-64 pkgs/main::ld_impl_linux-64-2.40-h12e~ --> conda-forge::ld_impl_linux-64-2.40-hf3520f5_7 2025-05-07T19:44:31.0073118Z libstdcxx-ng pkgs/main::libstdcxx-ng-11.2.0-h12345~ --> conda-forge::libstdcxx-ng-15.1.0-h4852527_2 2025-05-07T19:44:31.0073567Z 2025-05-07T19:44:31.0073571Z 2025-05-07T19:44:31.0073576Z 2025-05-07T19:44:31.0073763Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:31.0074170Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:31.0074425Z 2025-05-07T19:44:31.0074904Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:31.0075170Z 2025-05-07T19:44:31.0075174Z 2025-05-07T19:44:31.0076970Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:31.0077290Z 2025-05-07T19:44:31.0077293Z 2025-05-07T19:44:31.0077297Z 2025-05-07T19:44:31.0085695Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:31.0086122Z 2025-05-07T19:44:31.0086247Z 2025-05-07T19:44:31.0086250Z 2025-05-07T19:44:31.0086512Z 2025-05-07T19:44:31.0101082Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:31.0101374Z 2025-05-07T19:44:31.0101387Z 2025-05-07T19:44:31.0101461Z 2025-05-07T19:44:31.0101537Z 2025-05-07T19:44:31.0101784Z 2025-05-07T19:44:31.0102250Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:31.0102557Z 2025-05-07T19:44:31.0102561Z 2025-05-07T19:44:31.0102564Z 2025-05-07T19:44:31.0102568Z 2025-05-07T19:44:31.0102571Z 2025-05-07T19:44:31.0102578Z 2025-05-07T19:44:31.0104999Z libgcc-devel_linux-6 | 2.3 MB | | 0%  2025-05-07T19:44:31.0106006Z 2025-05-07T19:44:31.0106019Z 2025-05-07T19:44:31.0106057Z 2025-05-07T19:44:31.0106070Z 2025-05-07T19:44:31.0106082Z 2025-05-07T19:44:31.0106094Z 2025-05-07T19:44:31.0106129Z 2025-05-07T19:44:31.0128924Z ld_impl_linux-64-2.4 | 691 KB | | 0%  2025-05-07T19:44:31.0129714Z 2025-05-07T19:44:31.0129741Z 2025-05-07T19:44:31.0129747Z 2025-05-07T19:44:31.0129752Z 2025-05-07T19:44:31.0129757Z 2025-05-07T19:44:31.0129761Z 2025-05-07T19:44:31.0129767Z 2025-05-07T19:44:31.0129807Z 2025-05-07T19:44:31.0130249Z libstdcxx-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:31.0130559Z 2025-05-07T19:44:31.0130563Z 2025-05-07T19:44:31.0130566Z 2025-05-07T19:44:31.0130594Z 2025-05-07T19:44:31.0130597Z 2025-05-07T19:44:31.0130601Z 2025-05-07T19:44:31.0130604Z 2025-05-07T19:44:31.0130608Z 2025-05-07T19:44:31.0130611Z 2025-05-07T19:44:31.0142775Z gcc_linux-64-11.4.0 | 31 KB | | 0%  2025-05-07T19:44:31.0144243Z 2025-05-07T19:44:31.0144256Z 2025-05-07T19:44:31.0144295Z 2025-05-07T19:44:31.0144305Z 2025-05-07T19:44:31.0144316Z 2025-05-07T19:44:31.0144326Z 2025-05-07T19:44:31.0144337Z 2025-05-07T19:44:31.0144347Z 2025-05-07T19:44:31.0144358Z 2025-05-07T19:44:31.0144368Z 2025-05-07T19:44:31.0145741Z gxx_linux-64-11.4.0 | 29 KB | | 0%  2025-05-07T19:44:31.0146594Z 2025-05-07T19:44:31.0146629Z 2025-05-07T19:44:31.0146669Z 2025-05-07T19:44:31.0148460Z 2025-05-07T19:44:31.0148466Z 2025-05-07T19:44:31.0148470Z 2025-05-07T19:44:31.0148473Z 2025-05-07T19:44:31.0148476Z 2025-05-07T19:44:31.0148480Z 2025-05-07T19:44:31.0148483Z 2025-05-07T19:44:31.0148486Z 2025-05-07T19:44:31.1086025Z binutils_linux-64-2. | 28 KB | | 0%  2025-05-07T19:44:31.1087025Z 2025-05-07T19:44:31.1087038Z 2025-05-07T19:44:31.1087048Z 2025-05-07T19:44:31.1428908Z binutils_impl_linux- | 6.0 MB | 9 | 10%  2025-05-07T19:44:31.1429235Z 2025-05-07T19:44:31.1429460Z 2025-05-07T19:44:31.1429475Z 2025-05-07T19:44:31.1481478Z 2025-05-07T19:44:31.2678093Z libstdcxx-15.1.0 | 3.7 MB | 1 | 2%  2025-05-07T19:44:31.2678403Z 2025-05-07T19:44:31.2678610Z 2025-05-07T19:44:31.2678623Z 2025-05-07T19:44:31.2678632Z 2025-05-07T19:44:31.2739638Z libstdcxx-15.1.0 | 3.7 MB | 3 | 4%  2025-05-07T19:44:31.2740128Z 2025-05-07T19:44:31.2740137Z 2025-05-07T19:44:31.2740186Z 2025-05-07T19:44:31.3117013Z binutils_impl_linux- | 6.0 MB | #9 | 19%  2025-05-07T19:44:31.3117536Z 2025-05-07T19:44:31.3117546Z 2025-05-07T19:44:31.3264681Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:31.3265190Z 2025-05-07T19:44:31.3369378Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:31.3369673Z 2025-05-07T19:44:31.3369813Z 2025-05-07T19:44:31.3369820Z 2025-05-07T19:44:31.3369857Z 2025-05-07T19:44:31.3501358Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:31.3501725Z 2025-05-07T19:44:31.3501731Z 2025-05-07T19:44:31.3501736Z 2025-05-07T19:44:31.3618726Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:31.3701885Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:31.3702203Z 2025-05-07T19:44:31.3702209Z 2025-05-07T19:44:31.3702214Z 2025-05-07T19:44:31.3702218Z 2025-05-07T19:44:31.3702224Z 2025-05-07T19:44:31.3876481Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:31.3876809Z 2025-05-07T19:44:31.3876840Z 2025-05-07T19:44:31.3876844Z 2025-05-07T19:44:31.3876978Z 2025-05-07T19:44:31.3876993Z 2025-05-07T19:44:31.3877000Z 2025-05-07T19:44:31.4118674Z libgcc-devel_linux-6 | 2.3 MB | | 1%  2025-05-07T19:44:31.4119018Z 2025-05-07T19:44:31.4119151Z 2025-05-07T19:44:31.4269701Z libstdcxx-devel_linu | 11.1 MB | ########8 | 89%  2025-05-07T19:44:31.4270039Z 2025-05-07T19:44:31.4617497Z gxx_impl_linux-64-11 | 11.2 MB | ##1 | 22%  2025-05-07T19:44:31.4617796Z 2025-05-07T19:44:31.4617801Z 2025-05-07T19:44:31.4617805Z 2025-05-07T19:44:31.4617814Z 2025-05-07T19:44:31.4617819Z 2025-05-07T19:44:31.4618214Z 2025-05-07T19:44:31.4621335Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:31.4831442Z gcc_impl_linux-64-11 | 53.0 MB | #3 | 13% 2025-05-07T19:44:31.4831729Z 2025-05-07T19:44:31.4831763Z 2025-05-07T19:44:31.4831781Z 2025-05-07T19:44:31.4831786Z 2025-05-07T19:44:31.4831791Z 2025-05-07T19:44:31.4832169Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:31.4832472Z 2025-05-07T19:44:31.4832476Z 2025-05-07T19:44:31.4832480Z 2025-05-07T19:44:31.4832484Z 2025-05-07T19:44:31.4832487Z 2025-05-07T19:44:31.5127461Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:31.5128118Z 2025-05-07T19:44:31.5128123Z 2025-05-07T19:44:31.5128126Z 2025-05-07T19:44:31.5128130Z 2025-05-07T19:44:31.5128134Z 2025-05-07T19:44:31.5128137Z 2025-05-07T19:44:31.5128141Z 2025-05-07T19:44:31.5270363Z ld_impl_linux-64-2.4 | 691 KB | 2 | 2%  2025-05-07T19:44:31.5270706Z 2025-05-07T19:44:31.5297674Z gxx_impl_linux-64-11 | 11.2 MB | #####6 | 57%  2025-05-07T19:44:31.5298491Z 2025-05-07T19:44:31.5298505Z 2025-05-07T19:44:31.5298519Z 2025-05-07T19:44:31.5298551Z 2025-05-07T19:44:31.5299031Z 2025-05-07T19:44:31.5299038Z 2025-05-07T19:44:31.5299043Z 2025-05-07T19:44:31.5334223Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:31.5334537Z 2025-05-07T19:44:31.5334541Z 2025-05-07T19:44:31.5334545Z 2025-05-07T19:44:31.5334551Z 2025-05-07T19:44:31.5334555Z 2025-05-07T19:44:31.5334574Z 2025-05-07T19:44:31.5334579Z 2025-05-07T19:44:31.5334588Z 2025-05-07T19:44:31.5365598Z libstdcxx-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:31.5366116Z 2025-05-07T19:44:31.5366205Z 2025-05-07T19:44:31.5366211Z 2025-05-07T19:44:31.5366277Z 2025-05-07T19:44:31.5366420Z 2025-05-07T19:44:31.5366427Z 2025-05-07T19:44:31.5366489Z 2025-05-07T19:44:31.5366496Z 2025-05-07T19:44:31.5381613Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:31.5381966Z 2025-05-07T19:44:31.5381971Z 2025-05-07T19:44:31.5381975Z 2025-05-07T19:44:31.5382420Z 2025-05-07T19:44:31.5386905Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:31.5387238Z 2025-05-07T19:44:31.5387243Z 2025-05-07T19:44:31.5387248Z 2025-05-07T19:44:31.5387261Z 2025-05-07T19:44:31.5627145Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:31.5631906Z gcc_impl_linux-64-11 | 53.0 MB | ###1 | 32% 2025-05-07T19:44:31.5632160Z 2025-05-07T19:44:31.5632887Z 2025-05-07T19:44:31.5668586Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:31.5668958Z 2025-05-07T19:44:31.5668962Z 2025-05-07T19:44:31.5668966Z 2025-05-07T19:44:31.5668969Z 2025-05-07T19:44:31.5668973Z 2025-05-07T19:44:31.5669000Z 2025-05-07T19:44:31.5669003Z 2025-05-07T19:44:31.5669007Z 2025-05-07T19:44:31.5669010Z 2025-05-07T19:44:31.5686748Z gcc_linux-64-11.4.0 | 31 KB | #####2 | 52%  2025-05-07T19:44:31.5687120Z 2025-05-07T19:44:31.5687127Z 2025-05-07T19:44:31.5687133Z 2025-05-07T19:44:31.5687138Z 2025-05-07T19:44:31.5687172Z 2025-05-07T19:44:31.5687177Z 2025-05-07T19:44:31.5687182Z 2025-05-07T19:44:31.5687187Z 2025-05-07T19:44:31.5687192Z 2025-05-07T19:44:31.5751966Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:31.5752296Z 2025-05-07T19:44:31.5752545Z 2025-05-07T19:44:31.5752557Z 2025-05-07T19:44:31.5752564Z 2025-05-07T19:44:31.5752609Z 2025-05-07T19:44:31.5752615Z 2025-05-07T19:44:31.5752621Z 2025-05-07T19:44:31.5752647Z 2025-05-07T19:44:31.5752653Z 2025-05-07T19:44:31.5752669Z 2025-05-07T19:44:31.5763155Z gxx_linux-64-11.4.0 | 29 KB | #####5 | 55%  2025-05-07T19:44:31.5763535Z 2025-05-07T19:44:31.5763539Z 2025-05-07T19:44:31.5763569Z 2025-05-07T19:44:31.5763573Z 2025-05-07T19:44:31.5763580Z 2025-05-07T19:44:31.5763585Z 2025-05-07T19:44:31.5763590Z 2025-05-07T19:44:31.5763595Z 2025-05-07T19:44:31.5763600Z 2025-05-07T19:44:31.5763993Z 2025-05-07T19:44:31.5875986Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:31.5876400Z 2025-05-07T19:44:31.5876515Z 2025-05-07T19:44:31.5876518Z 2025-05-07T19:44:31.5876561Z 2025-05-07T19:44:31.5876564Z 2025-05-07T19:44:31.5876581Z 2025-05-07T19:44:31.5876885Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:31.5877211Z 2025-05-07T19:44:31.5877214Z 2025-05-07T19:44:31.5877218Z 2025-05-07T19:44:31.5877222Z 2025-05-07T19:44:31.5877490Z 2025-05-07T19:44:31.5877494Z 2025-05-07T19:44:31.5972458Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:31.5972806Z 2025-05-07T19:44:31.5972811Z 2025-05-07T19:44:31.5972814Z 2025-05-07T19:44:31.5972818Z 2025-05-07T19:44:31.5972821Z 2025-05-07T19:44:31.5972847Z 2025-05-07T19:44:31.5972851Z 2025-05-07T19:44:31.5972855Z 2025-05-07T19:44:31.5972858Z 2025-05-07T19:44:31.5972861Z 2025-05-07T19:44:31.5972865Z 2025-05-07T19:44:31.5994895Z binutils_linux-64-2. | 28 KB | #####6 | 56%  2025-05-07T19:44:31.5995940Z 2025-05-07T19:44:31.5995954Z 2025-05-07T19:44:31.5995965Z 2025-05-07T19:44:31.5995977Z 2025-05-07T19:44:31.5995988Z 2025-05-07T19:44:31.5995999Z 2025-05-07T19:44:31.5996010Z 2025-05-07T19:44:31.5996020Z 2025-05-07T19:44:31.5996030Z 2025-05-07T19:44:31.5996040Z 2025-05-07T19:44:31.5996051Z 2025-05-07T19:44:31.6171915Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:31.6172950Z 2025-05-07T19:44:31.6172963Z 2025-05-07T19:44:31.6172974Z 2025-05-07T19:44:31.6172985Z 2025-05-07T19:44:31.6172996Z 2025-05-07T19:44:31.6173006Z 2025-05-07T19:44:31.6173016Z 2025-05-07T19:44:31.6173780Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:31.6174647Z 2025-05-07T19:44:31.6174659Z 2025-05-07T19:44:31.6174669Z 2025-05-07T19:44:31.6174680Z 2025-05-07T19:44:31.6174690Z 2025-05-07T19:44:31.6174701Z 2025-05-07T19:44:31.6174711Z 2025-05-07T19:44:31.6496117Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:31.6497091Z 2025-05-07T19:44:31.6497104Z 2025-05-07T19:44:31.6497114Z 2025-05-07T19:44:31.6497125Z 2025-05-07T19:44:31.6497135Z 2025-05-07T19:44:31.6497146Z 2025-05-07T19:44:31.6497156Z 2025-05-07T19:44:31.6497166Z 2025-05-07T19:44:31.6497945Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:31.6498960Z 2025-05-07T19:44:31.6498973Z 2025-05-07T19:44:31.6498978Z 2025-05-07T19:44:31.6498981Z 2025-05-07T19:44:31.6498984Z 2025-05-07T19:44:31.6498988Z 2025-05-07T19:44:31.6498991Z 2025-05-07T19:44:31.6498995Z 2025-05-07T19:44:31.6659506Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:31.6939352Z gcc_impl_linux-64-11 | 53.0 MB | ####4 | 45% 2025-05-07T19:44:31.6939627Z 2025-05-07T19:44:31.6939939Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:31.6940195Z 2025-05-07T19:44:31.7088506Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:31.7089339Z 2025-05-07T19:44:31.7089351Z 2025-05-07T19:44:31.7089362Z 2025-05-07T19:44:31.7089373Z 2025-05-07T19:44:31.7089383Z 2025-05-07T19:44:31.7370934Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:31.7371300Z 2025-05-07T19:44:31.7371304Z 2025-05-07T19:44:31.7371308Z 2025-05-07T19:44:31.7371311Z 2025-05-07T19:44:31.7371331Z 2025-05-07T19:44:31.7371335Z 2025-05-07T19:44:31.7371338Z 2025-05-07T19:44:31.7371342Z 2025-05-07T19:44:31.7371345Z 2025-05-07T19:44:31.7371610Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:31.7371913Z 2025-05-07T19:44:31.7371917Z 2025-05-07T19:44:31.7371920Z 2025-05-07T19:44:31.7371925Z 2025-05-07T19:44:31.7371928Z 2025-05-07T19:44:31.7371931Z 2025-05-07T19:44:31.7371935Z 2025-05-07T19:44:31.7371938Z 2025-05-07T19:44:31.7371941Z 2025-05-07T19:44:31.7645964Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:31.7646929Z 2025-05-07T19:44:31.7646942Z 2025-05-07T19:44:31.7646953Z 2025-05-07T19:44:31.7646964Z 2025-05-07T19:44:31.7646973Z 2025-05-07T19:44:31.7646983Z 2025-05-07T19:44:31.7646994Z 2025-05-07T19:44:31.7647005Z 2025-05-07T19:44:31.7647015Z 2025-05-07T19:44:31.7647025Z 2025-05-07T19:44:31.7647788Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:31.7649151Z 2025-05-07T19:44:31.7649162Z 2025-05-07T19:44:31.7649172Z 2025-05-07T19:44:31.7649183Z 2025-05-07T19:44:31.7649194Z 2025-05-07T19:44:31.7649203Z 2025-05-07T19:44:31.7649213Z 2025-05-07T19:44:31.7649223Z 2025-05-07T19:44:31.7649233Z 2025-05-07T19:44:31.7649244Z 2025-05-07T19:44:31.7660472Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:31.7768547Z gcc_impl_linux-64-11 | 53.0 MB | ######3 | 64% 2025-05-07T19:44:31.7768820Z 2025-05-07T19:44:31.7768825Z 2025-05-07T19:44:31.7769132Z 2025-05-07T19:44:31.7769472Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:31.7769758Z 2025-05-07T19:44:31.7769762Z 2025-05-07T19:44:31.7769765Z 2025-05-07T19:44:31.7950374Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:31.7950707Z 2025-05-07T19:44:31.7950711Z 2025-05-07T19:44:31.7950715Z 2025-05-07T19:44:31.7950718Z 2025-05-07T19:44:31.7950722Z 2025-05-07T19:44:31.7950737Z 2025-05-07T19:44:31.7950741Z 2025-05-07T19:44:31.7950744Z 2025-05-07T19:44:31.7950747Z 2025-05-07T19:44:31.7950751Z 2025-05-07T19:44:31.7950754Z 2025-05-07T19:44:31.7951068Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:31.7951366Z 2025-05-07T19:44:31.7951369Z 2025-05-07T19:44:31.7951374Z 2025-05-07T19:44:31.7951378Z 2025-05-07T19:44:31.7951383Z 2025-05-07T19:44:31.7951387Z 2025-05-07T19:44:31.7951392Z 2025-05-07T19:44:31.7951396Z 2025-05-07T19:44:31.7951400Z 2025-05-07T19:44:31.7951411Z 2025-05-07T19:44:31.7951425Z 2025-05-07T19:44:31.8661731Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:31.9752263Z gcc_impl_linux-64-11 | 53.0 MB | ########2 | 83% 2025-05-07T19:44:31.9753026Z 2025-05-07T19:44:32.0840509Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:32.0840978Z 2025-05-07T19:44:32.0840982Z 2025-05-07T19:44:32.1012538Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:32.1013122Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:32.6300950Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:32.6305137Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:32.6306142Z 2025-05-07T19:44:32.6306758Z 2025-05-07T19:44:32.6307399Z  2025-05-07T19:44:32.6308010Z 2025-05-07T19:44:32.6308023Z 2025-05-07T19:44:32.6308567Z  2025-05-07T19:44:32.6309086Z 2025-05-07T19:44:32.6309092Z 2025-05-07T19:44:32.6309095Z 2025-05-07T19:44:32.6309266Z  2025-05-07T19:44:32.6309495Z 2025-05-07T19:44:32.6309499Z 2025-05-07T19:44:32.6309502Z 2025-05-07T19:44:32.6309508Z 2025-05-07T19:44:32.6309681Z  2025-05-07T19:44:32.6309917Z 2025-05-07T19:44:32.6309920Z 2025-05-07T19:44:32.6309923Z 2025-05-07T19:44:32.6309927Z 2025-05-07T19:44:32.6309947Z 2025-05-07T19:44:32.6310127Z  2025-05-07T19:44:32.6310341Z 2025-05-07T19:44:32.6310345Z 2025-05-07T19:44:32.6310348Z 2025-05-07T19:44:32.6310352Z 2025-05-07T19:44:32.6310355Z 2025-05-07T19:44:32.6310358Z 2025-05-07T19:44:32.6310602Z  2025-05-07T19:44:32.6310828Z 2025-05-07T19:44:32.6310832Z 2025-05-07T19:44:32.6310835Z 2025-05-07T19:44:32.6310838Z 2025-05-07T19:44:32.6310842Z 2025-05-07T19:44:32.6310845Z 2025-05-07T19:44:32.6310848Z 2025-05-07T19:44:32.6311045Z  2025-05-07T19:44:32.6311267Z 2025-05-07T19:44:32.6311270Z 2025-05-07T19:44:32.6311274Z 2025-05-07T19:44:32.6311277Z 2025-05-07T19:44:32.6311281Z 2025-05-07T19:44:32.6311534Z 2025-05-07T19:44:32.6311538Z 2025-05-07T19:44:32.6311541Z 2025-05-07T19:44:32.6311755Z  2025-05-07T19:44:32.6311978Z 2025-05-07T19:44:32.6311981Z 2025-05-07T19:44:32.6311985Z 2025-05-07T19:44:32.6311989Z 2025-05-07T19:44:32.6311992Z 2025-05-07T19:44:32.6311995Z 2025-05-07T19:44:32.6311999Z 2025-05-07T19:44:32.6312002Z 2025-05-07T19:44:32.6312005Z 2025-05-07T19:44:32.6312215Z  2025-05-07T19:44:32.6312670Z 2025-05-07T19:44:32.6312674Z 2025-05-07T19:44:32.6312677Z 2025-05-07T19:44:32.6312681Z 2025-05-07T19:44:32.6312684Z 2025-05-07T19:44:32.6312687Z 2025-05-07T19:44:32.6312690Z 2025-05-07T19:44:32.6312693Z 2025-05-07T19:44:32.6312696Z 2025-05-07T19:44:32.6312699Z 2025-05-07T19:44:32.6312908Z  2025-05-07T19:44:32.6313125Z 2025-05-07T19:44:32.6313133Z 2025-05-07T19:44:32.6313136Z 2025-05-07T19:44:32.6313140Z 2025-05-07T19:44:32.6313143Z 2025-05-07T19:44:32.6313146Z 2025-05-07T19:44:32.6313149Z 2025-05-07T19:44:32.6313152Z 2025-05-07T19:44:32.6313156Z 2025-05-07T19:44:32.6313159Z 2025-05-07T19:44:32.6313162Z 2025-05-07T19:44:32.6313374Z  done 2025-05-07T19:44:32.7319922Z Preparing transaction: \ done 2025-05-07T19:44:33.0332784Z Verifying transaction: / - \ done 2025-05-07T19:44:33.1348862Z Executing transaction: / done 2025-05-07T19:44:33.2257864Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:36.9801939Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:36.9802632Z 2025-05-07T19:44:36.9812571Z 2025-05-07T19:44:36.9830904Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:36.9831549Z 2025-05-07T19:44:36.9845974Z 2025-05-07T19:44:36.9860417Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:36.9861022Z 2025-05-07T19:44:36.9876463Z 2025-05-07T19:44:36.9893264Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:36.9893945Z 2025-05-07T19:44:36.9907475Z 2025-05-07T19:44:38.7745065Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:38.7745852Z 2025-05-07T19:44:38.8324457Z [CHECK] Binary cc found in PATH 2025-05-07T19:44:40.6257475Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:40.6257852Z 2025-05-07T19:44:40.7073232Z [CHECK] Binary gcc found in PATH 2025-05-07T19:44:42.5168313Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:42.5168682Z 2025-05-07T19:44:42.5755438Z [CHECK] Binary c++ found in PATH 2025-05-07T19:44:44.3807582Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:44.3807956Z 2025-05-07T19:44:44.4377897Z [CHECK] Binary g++ found in PATH 2025-05-07T19:44:44.4379167Z [INFO] Printing out all preprocessor defines in the C compiler ... 2025-05-07T19:44:44.4380436Z + conda run -n build_binary cc -dM -E - 2025-05-07T19:44:44.4380820Z 2025-05-07T19:44:46.2292066Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:44:46.2292517Z #define __UINT_LEAST16_MAX__ 0xffff 2025-05-07T19:44:46.2292879Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:44:46.2293217Z #define __FLT128_MAX_10_EXP__ 4932 2025-05-07T19:44:46.2293652Z #define __FLT_MIN__ 1.17549435082228750796873653722224568e-38F 2025-05-07T19:44:46.2294062Z #define __GCC_IEC_559_COMPLEX 2 2025-05-07T19:44:46.2294412Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:44:46.2294756Z #define __SIZEOF_FLOAT80__ 16 2025-05-07T19:44:46.2295078Z #define __INTMAX_C(c) c ## L 2025-05-07T19:44:46.2295379Z #define __CHAR_BIT__ 8 2025-05-07T19:44:46.2297673Z #define __UINT8_MAX__ 0xff 2025-05-07T19:44:46.2297993Z #define __SCHAR_WIDTH__ 8 2025-05-07T19:44:46.2298279Z #define __WINT_MAX__ 0xffffffffU 2025-05-07T19:44:46.2298611Z #define __FLT32_MIN_EXP__ (-125) 2025-05-07T19:44:46.2298917Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:44:46.2299278Z #define __SIZE_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.2299613Z #define __WCHAR_MAX__ 0x7fffffff 2025-05-07T19:44:46.2299964Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:44:46.2300500Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:44:46.2300896Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:44:46.2301374Z #define __DBL_DENORM_MIN__ ((double)4.94065645841246544176568792868221372e-324L) 2025-05-07T19:44:46.2301837Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:44:46.2302219Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:44:46.2302531Z #define __GCC_IEC_559 2 2025-05-07T19:44:46.2302840Z #define __FLT32X_DECIMAL_DIG__ 17 2025-05-07T19:44:46.2303167Z #define __FLT_EVAL_METHOD__ 0 2025-05-07T19:44:46.2303488Z #define __FLT64_DECIMAL_DIG__ 17 2025-05-07T19:44:46.2303805Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:44:46.2304197Z #define __UINT_FAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.2304585Z #define __SIG_ATOMIC_TYPE__ int 2025-05-07T19:44:46.2304888Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:44:46.2305228Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:44:46.2305521Z #define __FLT32X_MAX_EXP__ 1024 2025-05-07T19:44:46.2305843Z #define __FLT32_HAS_DENORM__ 1 2025-05-07T19:44:46.2306141Z #define __UINT_FAST8_MAX__ 0xff 2025-05-07T19:44:46.2306451Z #define __FLT32_MAX_10_EXP__ 38 2025-05-07T19:44:46.2306743Z #define __DEC64_MAX_EXP__ 385 2025-05-07T19:44:46.2307052Z #define __INT8_C(c) c 2025-05-07T19:44:46.2307308Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:44:46.2307658Z #define __UINT_LEAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.2308037Z #define __SHRT_MAX__ 0x7fff 2025-05-07T19:44:46.2308503Z #define __LDBL_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:46.2308932Z #define __FLT64X_MAX_10_EXP__ 4932 2025-05-07T19:44:46.2309238Z #define __LDBL_IS_IEC_60559__ 2 2025-05-07T19:44:46.2309563Z #define __FLT64X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.2309875Z #define __UINT_LEAST8_MAX__ 0xff 2025-05-07T19:44:46.2330949Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:44:46.2331623Z #define __FLT128_DENORM_MIN__ 6.47517511943802511092443895822764655e-4966F128 2025-05-07T19:44:46.2332136Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:44:46.2332494Z #define __linux 1 2025-05-07T19:44:46.2332776Z #define __DEC32_EPSILON__ 1E-6DF 2025-05-07T19:44:46.2333089Z #define __FLT_EVAL_METHOD_TS_18661_3__ 0 2025-05-07T19:44:46.2333430Z #define __unix 1 2025-05-07T19:44:46.2333679Z #define __UINT32_MAX__ 0xffffffffU 2025-05-07T19:44:46.2334011Z #define __FLT128_MIN_EXP__ (-16381) 2025-05-07T19:44:46.2334313Z #define __WINT_MIN__ 0U 2025-05-07T19:44:46.2334610Z #define __FLT128_MIN_10_EXP__ (-4931) 2025-05-07T19:44:46.2334974Z #define __FLT32X_IS_IEC_60559__ 2 2025-05-07T19:44:46.2335285Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:44:46.2335606Z #define __SCHAR_MAX__ 0x7f 2025-05-07T19:44:46.2335891Z #define __FLT128_MANT_DIG__ 113 2025-05-07T19:44:46.2336248Z #define __WCHAR_MIN__ (-__WCHAR_MAX__ - 1) 2025-05-07T19:44:46.2336555Z #define __INT64_C(c) c ## L 2025-05-07T19:44:46.2336860Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:44:46.2337207Z #define __FLT32X_MANT_DIG__ 53 2025-05-07T19:44:46.2337513Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:44:46.2337927Z #define __FLT64X_EPSILON__ 1.08420217248550443400745280086994171e-19F64x 2025-05-07T19:44:46.2338339Z #define __STDC_HOSTED__ 1 2025-05-07T19:44:46.2338646Z #define __DEC64_MIN_EXP__ (-382) 2025-05-07T19:44:46.2338933Z #define __DBL_DIG__ 15 2025-05-07T19:44:46.2339209Z #define __FLT32_DIG__ 6 2025-05-07T19:44:46.2339536Z #define __FLT_EPSILON__ 1.19209289550781250000000000000000000e-7F 2025-05-07T19:44:46.2339946Z #define __SHRT_WIDTH__ 16 2025-05-07T19:44:46.2340438Z #define __FLT32_IS_IEC_60559__ 2 2025-05-07T19:44:46.2340796Z #define __LDBL_MIN__ 3.36210314311209350626267781732175260e-4932L 2025-05-07T19:44:46.2341210Z #define __STDC_UTF_16__ 1 2025-05-07T19:44:46.2341480Z #define __DBL_IS_IEC_60559__ 2 2025-05-07T19:44:46.2341797Z #define __DEC32_MAX__ 9.999999E96DF 2025-05-07T19:44:46.2342310Z #define __FLT64X_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951F64x 2025-05-07T19:44:46.2342770Z #define __FLT32X_HAS_INFINITY__ 1 2025-05-07T19:44:46.2343170Z #define __INT32_MAX__ 0x7fffffff 2025-05-07T19:44:46.2343481Z #define __unix__ 1 2025-05-07T19:44:46.2343725Z #define __INT_WIDTH__ 32 2025-05-07T19:44:46.2344021Z #define __SIZEOF_LONG__ 8 2025-05-07T19:44:46.2344307Z #define __STDC_IEC_559__ 1 2025-05-07T19:44:46.2344572Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:44:46.2344883Z #define __UINT16_C(c) c 2025-05-07T19:44:46.2345140Z #define __DECIMAL_DIG__ 21 2025-05-07T19:44:46.2345444Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:44:46.2345827Z #define __FLT64_EPSILON__ 2.22044604925031308084726333618164062e-16F64 2025-05-07T19:44:46.2346239Z #define __gnu_linux__ 1 2025-05-07T19:44:46.2346489Z #define __FLT128_IS_IEC_60559__ 2 2025-05-07T19:44:46.2346808Z #define __FLT64X_MIN_10_EXP__ (-4931) 2025-05-07T19:44:46.2347109Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.2347417Z #define __FLT64_MANT_DIG__ 53 2025-05-07T19:44:46.2347715Z #define __FLT64X_MANT_DIG__ 64 2025-05-07T19:44:46.2347984Z #define __GNUC__ 11 2025-05-07T19:44:46.2348244Z #define __pie__ 2 2025-05-07T19:44:46.2348471Z #define __MMX__ 1 2025-05-07T19:44:46.2348733Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:44:46.2349018Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:44:46.2349341Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:44:46.2349632Z #define __FLT64_MAX_10_EXP__ 308 2025-05-07T19:44:46.2350029Z #define __DBL_MAX__ ((double)1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:46.2350456Z #define __INT_FAST32_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2350836Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:44:46.2351155Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:44:46.2351442Z #define __HAVE_SPECULATION_SAFE_VALUE 1 2025-05-07T19:44:46.2351788Z #define __DEC32_MIN_EXP__ (-94) 2025-05-07T19:44:46.2352075Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:44:46.2352379Z #define __FLT64X_HAS_INFINITY__ 1 2025-05-07T19:44:46.2352683Z #define __UINT_LEAST32_MAX__ 0xffffffffU 2025-05-07T19:44:46.2353022Z #define __FLT32X_HAS_DENORM__ 1 2025-05-07T19:44:46.2353316Z #define __INT_FAST16_TYPE__ long int 2025-05-07T19:44:46.2353645Z #define __MMX_WITH_SSE__ 1 2025-05-07T19:44:46.2353943Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:44:46.2354230Z #define __FLT128_HAS_INFINITY__ 1 2025-05-07T19:44:46.2354550Z #define __DEC32_MIN__ 1E-95DF 2025-05-07T19:44:46.2354831Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:44:46.2355143Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:44:46.2355485Z #define __FLT32_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:46.2355911Z #define __DEC128_EPSILON__ 1E-33DL 2025-05-07T19:44:46.2356211Z #define __SSE2_MATH__ 1 2025-05-07T19:44:46.2356505Z #define __ATOMIC_HLE_RELEASE 131072 2025-05-07T19:44:46.2356827Z #define __PTRDIFF_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2357185Z #define __amd64 1 2025-05-07T19:44:46.2357459Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:44:46.2357749Z #define __ATOMIC_HLE_ACQUIRE 65536 2025-05-07T19:44:46.2358108Z #define __LONG_LONG_MAX__ 0x7fffffffffffffffLL 2025-05-07T19:44:46.2358454Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:44:46.2358765Z #define __FLT64X_MIN_EXP__ (-16381) 2025-05-07T19:44:46.2359062Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:44:46.2359379Z #define __LONG_LONG_WIDTH__ 64 2025-05-07T19:44:46.2359662Z #define __FLT32_MAX_EXP__ 128 2025-05-07T19:44:46.2359964Z #define __GXX_ABI_VERSION 1016 2025-05-07T19:44:46.2360245Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:44:46.2360558Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:44:46.2360997Z #define __INT16_MAX__ 0x7fff 2025-05-07T19:44:46.2361266Z #define __x86_64 1 2025-05-07T19:44:46.2361553Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:44:46.2361948Z #define __FLT64_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F64 2025-05-07T19:44:46.2362465Z #define __DBL_MIN__ ((double)2.22507385850720138309023271733240406e-308L) 2025-05-07T19:44:46.2362948Z #define __FLT128_EPSILON__ 1.92592994438723585305597794258492732e-34F128 2025-05-07T19:44:46.2363470Z #define __FLT64X_NORM_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:46.2363953Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:44:46.2364248Z #define __LP64__ 1 2025-05-07T19:44:46.2364517Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.2364886Z #define __FLT32X_EPSILON__ 2.22044604925031308084726333618164062e-16F32x 2025-05-07T19:44:46.2365315Z #define __DECIMAL_BID_FORMAT__ 1 2025-05-07T19:44:46.2365606Z #define __FLT64_MIN_EXP__ (-1021) 2025-05-07T19:44:46.2365925Z #define __FLT64_MIN_10_EXP__ (-307) 2025-05-07T19:44:46.2366227Z #define __FLT64X_DECIMAL_DIG__ 21 2025-05-07T19:44:46.2366546Z #define __DEC128_MIN__ 1E-6143DL 2025-05-07T19:44:46.2366833Z #define __REGISTER_PREFIX__ 2025-05-07T19:44:46.2367139Z #define __UINT16_MAX__ 0xffff 2025-05-07T19:44:46.2367422Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:44:46.2367740Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:44:46.2368124Z #define __FLT32_MIN__ 1.17549435082228750796873653722224568e-38F32 2025-05-07T19:44:46.2368516Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:44:46.2368841Z #define __FLT_DIG__ 6 2025-05-07T19:44:46.2369090Z #define __NO_INLINE__ 1 2025-05-07T19:44:46.2369372Z #define __DEC_EVAL_METHOD__ 2 2025-05-07T19:44:46.2369711Z #define __DEC128_MAX__ 9.999999999999999999999999999999999E6144DL 2025-05-07T19:44:46.2370207Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:44:46.2370663Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:44:46.2371049Z #define __VERSION__ "11.4.0" 2025-05-07T19:44:46.2371377Z #define __UINT64_C(c) c ## UL 2025-05-07T19:44:46.2371665Z #define _STDC_PREDEF_H 1 2025-05-07T19:44:46.2371973Z #define __INT_LEAST32_MAX__ 0x7fffffff 2025-05-07T19:44:46.2372293Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:44:46.2372639Z #define __FLT128_MAX_EXP__ 16384 2025-05-07T19:44:46.2372928Z #define __FLT32_MANT_DIG__ 24 2025-05-07T19:44:46.2373279Z #define __FLOAT_WORD_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:46.2373633Z #define __FLT128_HAS_DENORM__ 1 2025-05-07T19:44:46.2373945Z #define __FLT32_DECIMAL_DIG__ 9 2025-05-07T19:44:46.2374226Z #define __FLT128_DIG__ 33 2025-05-07T19:44:46.2374521Z #define __INT32_C(c) c 2025-05-07T19:44:46.2374810Z #define __DEC64_EPSILON__ 1E-15DD 2025-05-07T19:44:46.2375112Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:44:46.2375436Z #define __DEC128_MIN_EXP__ (-6142) 2025-05-07T19:44:46.2375735Z #define __INT_FAST32_TYPE__ long int 2025-05-07T19:44:46.2376098Z #define __UINT_LEAST16_TYPE__ short unsigned int 2025-05-07T19:44:46.2376429Z #define unix 1 2025-05-07T19:44:46.2376705Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:44:46.2377050Z #define __UINT64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.2377407Z #define __FLT_IS_IEC_60559__ 2 2025-05-07T19:44:46.2377756Z #define __GNUC_WIDE_EXECUTION_CHARSET_NAME "UTF-32LE" 2025-05-07T19:44:46.2378143Z #define __FLT64X_DIG__ 18 2025-05-07T19:44:46.2378449Z #define __INT8_TYPE__ signed char 2025-05-07T19:44:46.2378733Z #define __ELF__ 1 2025-05-07T19:44:46.2379018Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:44:46.2379321Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:44:46.2379660Z #define __FLT_RADIX__ 2 2025-05-07T19:44:46.2379929Z #define __INT_LEAST16_TYPE__ short int 2025-05-07T19:44:46.2380345Z #define __LDBL_EPSILON__ 1.08420217248550443400745280086994171e-19L 2025-05-07T19:44:46.2380739Z #define __UINTMAX_C(c) c ## UL 2025-05-07T19:44:46.2381040Z #define __SSE_MATH__ 1 2025-05-07T19:44:46.2381282Z #define __k8 1 2025-05-07T19:44:46.2381626Z #define __FLT32X_MIN__ 2.22507385850720138309023271733240406e-308F32x 2025-05-07T19:44:46.2382163Z #define __SIG_ATOMIC_MAX__ 0x7fffffff 2025-05-07T19:44:46.2382595Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:44:46.2382943Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:44:46.2383210Z #define __LDBL_DIG__ 18 2025-05-07T19:44:46.2383493Z #define __FLT64_IS_IEC_60559__ 2 2025-05-07T19:44:46.2383768Z #define __x86_64__ 1 2025-05-07T19:44:46.2384049Z #define __FLT32X_MIN_EXP__ (-1021) 2025-05-07T19:44:46.2384375Z #define __DEC32_SUBNORMAL_MIN__ 0.000001E-95DF 2025-05-07T19:44:46.2384771Z #define __INT_FAST16_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2385228Z #define __FLT64_DIG__ 15 2025-05-07T19:44:46.2385532Z #define __UINT_FAST32_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.2385935Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:44:46.2386277Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.2386599Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:44:46.2386896Z #define __LONG_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2387248Z #define __FLT64X_HAS_DENORM__ 1 2025-05-07T19:44:46.2387651Z #define __DEC128_SUBNORMAL_MIN__ 0.000000000000000000000000000000001E-6143DL 2025-05-07T19:44:46.2388101Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:44:46.2388411Z #define __GNUC_EXECUTION_CHARSET_NAME "UTF-8" 2025-05-07T19:44:46.2388797Z #define __UINT_FAST16_TYPE__ long unsigned int 2025-05-07T19:44:46.2389160Z #define __DEC64_MAX__ 9.999999999999999E384DD 2025-05-07T19:44:46.2389482Z #define __INT_FAST32_WIDTH__ 64 2025-05-07T19:44:46.2389810Z #define __CHAR16_TYPE__ short unsigned int 2025-05-07T19:44:46.2390143Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:44:46.2390466Z #define __SIZE_WIDTH__ 64 2025-05-07T19:44:46.2390716Z #define __SEG_FS 1 2025-05-07T19:44:46.2390992Z #define __INT_LEAST16_MAX__ 0x7fff 2025-05-07T19:44:46.2391282Z #define __DEC64_MANT_DIG__ 16 2025-05-07T19:44:46.2391593Z #define __INT64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2391928Z #define __SEG_GS 1 2025-05-07T19:44:46.2392257Z #define __FLT32_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F32 2025-05-07T19:44:46.2392692Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:44:46.2392981Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:44:46.2393308Z #define __INT16_TYPE__ short int 2025-05-07T19:44:46.2393586Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:44:46.2393910Z #define __STDC_VERSION__ 201710L 2025-05-07T19:44:46.2394191Z #define __SIZEOF_INT__ 4 2025-05-07T19:44:46.2394486Z #define __DEC32_MAX_EXP__ 97 2025-05-07T19:44:46.2394764Z #define __INT_FAST8_MAX__ 0x7f 2025-05-07T19:44:46.2395151Z #define __FLT128_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:46.2395578Z #define __INTPTR_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2395884Z #define linux 1 2025-05-07T19:44:46.2396139Z #define __FLT64_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.2396437Z #define __FLT32_MIN_10_EXP__ (-37) 2025-05-07T19:44:46.2396748Z #define __FLT32X_DIG__ 15 2025-05-07T19:44:46.2397011Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:44:46.2397310Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:44:46.2397594Z #define __FLT64_HAS_INFINITY__ 1 2025-05-07T19:44:46.2397980Z #define __FLT64X_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:46.2398439Z #define __SIG_ATOMIC_MIN__ (-__SIG_ATOMIC_MAX__ - 1) 2025-05-07T19:44:46.2398787Z #define __code_model_small__ 1 2025-05-07T19:44:46.2399084Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:44:46.2399381Z #define __DEC32_MANT_DIG__ 7 2025-05-07T19:44:46.2399669Z #define __k8__ 1 2025-05-07T19:44:46.2399912Z #define __INTPTR_TYPE__ long int 2025-05-07T19:44:46.2400251Z #define __UINT16_TYPE__ short unsigned int 2025-05-07T19:44:46.2400569Z #define __WCHAR_TYPE__ int 2025-05-07T19:44:46.2400860Z #define __pic__ 2 2025-05-07T19:44:46.2401129Z #define __UINTPTR_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.2401494Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:44:46.2401839Z #define __INT_FAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2402190Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:44:46.2402693Z #define __FLT_NORM_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:46.2403089Z #define __FLT32_HAS_INFINITY__ 1 2025-05-07T19:44:46.2403411Z #define __FLT64X_MAX_EXP__ 16384 2025-05-07T19:44:46.2403727Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:44:46.2404099Z #define __INT_MAX__ 0x7fffffff 2025-05-07T19:44:46.2404376Z #define __linux__ 1 2025-05-07T19:44:46.2404647Z #define __INT64_TYPE__ long int 2025-05-07T19:44:46.2404932Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:44:46.2405241Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:44:46.2405634Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:44:46.2405910Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:44:46.2406253Z #define __INT_LEAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2406607Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:44:46.2406963Z #define __DEC64_MIN__ 1E-383DD 2025-05-07T19:44:46.2407252Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:44:46.2407600Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:44:46.2407921Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:44:46.2408301Z #define __FLT32_NORM_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:46.2408709Z #define __SSE__ 1 2025-05-07T19:44:46.2408949Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:44:46.2409332Z #define __FLT64_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:46.2409699Z #define __amd64__ 1 2025-05-07T19:44:46.2410032Z #define __WINT_WIDTH__ 32 2025-05-07T19:44:46.2410470Z #define __INT_LEAST8_MAX__ 0x7f 2025-05-07T19:44:46.2410806Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:44:46.2411194Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:44:46.2411510Z #define __FLT32X_MAX_10_EXP__ 308 2025-05-07T19:44:46.2411811Z #define __SIZEOF_INT128__ 16 2025-05-07T19:44:46.2412127Z #define __FLT64X_IS_IEC_60559__ 2 2025-05-07T19:44:46.2412446Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:44:46.2412737Z #define __ATOMIC_RELAXED 0 2025-05-07T19:44:46.2413140Z #define __DBL_EPSILON__ ((double)2.22044604925031308084726333618164062e-16L) 2025-05-07T19:44:46.2413645Z #define __FLT128_MIN__ 3.36210314311209350626267781732175260e-4932F128 2025-05-07T19:44:46.2414061Z #define _LP64 1 2025-05-07T19:44:46.2414296Z #define __UINT8_C(c) c 2025-05-07T19:44:46.2414583Z #define __FLT64_MAX_EXP__ 1024 2025-05-07T19:44:46.2414870Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:44:46.2415187Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:44:46.2415511Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:44:46.2415840Z #define __GNUC_PATCHLEVEL__ 0 2025-05-07T19:44:46.2416255Z #define __FLT128_NORM_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:46.2416757Z #define __FLT64_NORM_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:46.2417203Z #define __FLT128_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.2417522Z #define __INTMAX_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:46.2417902Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:44:46.2418300Z #define __FLT64X_MIN__ 3.36210314311209350626267781732175260e-4932F64x 2025-05-07T19:44:46.2418737Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:44:46.2419062Z #define __FLT64_HAS_DENORM__ 1 2025-05-07T19:44:46.2419431Z #define __FLT32_EPSILON__ 1.19209289550781250000000000000000000e-7F32 2025-05-07T19:44:46.2419869Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:44:46.2420158Z #define __STDC_UTF_32__ 1 2025-05-07T19:44:46.2420463Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:44:46.2420736Z #define __FXSR__ 1 2025-05-07T19:44:46.2421093Z #define __FLT32X_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:46.2421602Z #define __DBL_NORM_MAX__ ((double)1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:46.2422081Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:46.2422547Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:44:46.2422818Z #define __UINT32_C(c) c ## U 2025-05-07T19:44:46.2423182Z #define __FLT_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F 2025-05-07T19:44:46.2423549Z #define __INT8_MAX__ 0x7f 2025-05-07T19:44:46.2423948Z #define __LONG_WIDTH__ 64 2025-05-07T19:44:46.2424198Z #define __PIC__ 2 2025-05-07T19:44:46.2424482Z #define __UINT_FAST32_TYPE__ long unsigned int 2025-05-07T19:44:46.2424887Z #define __FLT32X_NORM_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:46.2425315Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:44:46.2425661Z #define __FLT_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:46.2426033Z #define __SSE2__ 1 2025-05-07T19:44:46.2426291Z #define __INT32_TYPE__ int 2025-05-07T19:44:46.2426609Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:44:46.2426902Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:44:46.2427242Z #define __FLT64_MIN__ 2.22507385850720138309023271733240406e-308F64 2025-05-07T19:44:46.2427632Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:44:46.2427908Z #define __INTMAX_TYPE__ long int 2025-05-07T19:44:46.2428210Z #define __DEC128_MAX_EXP__ 6145 2025-05-07T19:44:46.2428646Z #define __FLT32X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.2429141Z #define __ATOMIC_CONSUME 1 2025-05-07T19:44:46.2429440Z #define __GNUC_MINOR__ 4 2025-05-07T19:44:46.2429735Z #define __INT_FAST16_WIDTH__ 64 2025-05-07T19:44:46.2430078Z #define __UINTMAX_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.2430403Z #define __PIE__ 2 2025-05-07T19:44:46.2430794Z #define __FLT32X_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F32x 2025-05-07T19:44:46.2431221Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:44:46.2431619Z #define __LDBL_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951L 2025-05-07T19:44:46.2432025Z #define __INT16_C(c) c 2025-05-07T19:44:46.2432298Z #define __STDC__ 1 2025-05-07T19:44:46.2432550Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:44:46.2432872Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:44:46.2433175Z #define __FLT32X_MIN_10_EXP__ (-307) 2025-05-07T19:44:46.2433500Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:44:46.2433901Z #define __DEC64_SUBNORMAL_MIN__ 0.000000000000001E-383DD 2025-05-07T19:44:46.2434266Z #define __DEC128_MANT_DIG__ 34 2025-05-07T19:44:46.2434587Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:44:46.2434893Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:44:46.2435213Z #define __FLT128_DECIMAL_DIG__ 36 2025-05-07T19:44:46.2435524Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:44:46.2435870Z #define __FLT32_HAS_QUIET_NAN__ 1 2025-05-07T19:44:46.2436197Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:44:46.2436524Z #define __UINT_FAST16_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:46.2436995Z #define __LDBL_NORM_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:46.2437412Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:44:46.2437770Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:44:46.2438091Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:44:46.2438391Z #define __ATOMIC_RELEASE 3 2025-05-07T19:44:46.2438569Z 2025-05-07T19:44:46.2892905Z 2025-05-07T19:44:46.2893650Z [INFO] Printing out all preprocessor defines in the C++ compiler ... 2025-05-07T19:44:46.2894178Z + conda run -n build_binary c++ -dM -E -x c++ - 2025-05-07T19:44:46.2894493Z 2025-05-07T19:44:48.1213323Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:44:48.1213732Z #define __cpp_attributes 200809L 2025-05-07T19:44:48.1214143Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:44:48.1214544Z #define __UINT_LEAST16_MAX__ 0xffff 2025-05-07T19:44:48.1214892Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:44:48.1215179Z #define __FLT128_MAX_10_EXP__ 4932 2025-05-07T19:44:48.1215560Z #define __FLT_MIN__ 1.17549435082228750796873653722224568e-38F 2025-05-07T19:44:48.1215983Z #define __GCC_IEC_559_COMPLEX 2 2025-05-07T19:44:48.1216311Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:44:48.1216677Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:44:48.1217013Z #define __SIZEOF_FLOAT80__ 16 2025-05-07T19:44:48.1217334Z #define __INTMAX_C(c) c ## L 2025-05-07T19:44:48.1217606Z #define __CHAR_BIT__ 8 2025-05-07T19:44:48.1217893Z #define __UINT8_MAX__ 0xff 2025-05-07T19:44:48.1218161Z #define __SCHAR_WIDTH__ 8 2025-05-07T19:44:48.1218832Z #define __WINT_MAX__ 0xffffffffU 2025-05-07T19:44:48.1219129Z #define __FLT32_MIN_EXP__ (-125) 2025-05-07T19:44:48.1219460Z #define __cpp_static_assert 201411L 2025-05-07T19:44:48.1219774Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:44:48.1220127Z #define __SIZE_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.1220487Z #define __WCHAR_MAX__ 0x7fffffff 2025-05-07T19:44:48.1220838Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:44:48.1221324Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:44:48.1221876Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:44:48.1222363Z #define __DBL_DENORM_MIN__ double(4.94065645841246544176568792868221372e-324L) 2025-05-07T19:44:48.1222826Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:44:48.1223211Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:44:48.1223533Z #define __GCC_IEC_559 2 2025-05-07T19:44:48.1223846Z #define __FLT32X_DECIMAL_DIG__ 17 2025-05-07T19:44:48.1224158Z #define __FLT_EVAL_METHOD__ 0 2025-05-07T19:44:48.1224509Z #define __cpp_binary_literals 201304L 2025-05-07T19:44:48.1224836Z #define __FLT64_DECIMAL_DIG__ 17 2025-05-07T19:44:48.1225203Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:44:48.1225593Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:44:48.1225943Z #define __cpp_variadic_templates 200704L 2025-05-07T19:44:48.1226353Z #define __UINT_FAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.1226716Z #define __SIG_ATOMIC_TYPE__ int 2025-05-07T19:44:48.1227042Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:44:48.1227363Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:44:48.1227703Z #define __cpp_variable_templates 201304L 2025-05-07T19:44:48.1228028Z #define __FLT32X_MAX_EXP__ 1024 2025-05-07T19:44:48.1228343Z #define __FLT32_HAS_DENORM__ 1 2025-05-07T19:44:48.1229185Z #define __UINT_FAST8_MAX__ 0xff 2025-05-07T19:44:48.1229616Z #define __cpp_rvalue_reference 200610L 2025-05-07T19:44:48.1230047Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:44:48.1230418Z #define __DEC64_MAX_EXP__ 385 2025-05-07T19:44:48.1230721Z #define __INT8_C(c) c 2025-05-07T19:44:48.1230982Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:44:48.1231307Z #define __cpp_variadic_using 201611L 2025-05-07T19:44:48.1231653Z #define __UINT_LEAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.1232031Z #define __INT_LEAST8_MAX__ 0x7f 2025-05-07T19:44:48.1232334Z #define __cpp_capture_star_this 201603L 2025-05-07T19:44:48.1232679Z #define __SHRT_MAX__ 0x7fff 2025-05-07T19:44:48.1233064Z #define __LDBL_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:48.1233452Z #define __FLT64X_MAX_10_EXP__ 4932 2025-05-07T19:44:48.1233791Z #define __cpp_if_constexpr 201606L 2025-05-07T19:44:48.1234092Z #define __LDBL_IS_IEC_60559__ 2 2025-05-07T19:44:48.1234414Z #define __FLT64X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.1234717Z #define __UINT_LEAST8_MAX__ 0xff 2025-05-07T19:44:48.1235054Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:44:48.1235482Z #define __FLT128_DENORM_MIN__ 6.47517511943802511092443895822764655e-4966F128 2025-05-07T19:44:48.1235971Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:44:48.1236332Z #define __linux 1 2025-05-07T19:44:48.1236587Z #define __DEC32_EPSILON__ 1E-6DF 2025-05-07T19:44:48.1236922Z #define __FLT_EVAL_METHOD_TS_18661_3__ 0 2025-05-07T19:44:48.1237230Z #define __unix 1 2025-05-07T19:44:48.1237516Z #define __UINT32_MAX__ 0xffffffffU 2025-05-07T19:44:48.1237826Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:44:48.1238171Z #define __FLT128_MIN_EXP__ (-16381) 2025-05-07T19:44:48.1238475Z #define __WINT_MIN__ 0U 2025-05-07T19:44:48.1238767Z #define __FLT128_MIN_10_EXP__ (-4931) 2025-05-07T19:44:48.1239076Z #define __FLT32X_IS_IEC_60559__ 2 2025-05-07T19:44:48.1239405Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:44:48.1239727Z #define __SCHAR_MAX__ 0x7f 2025-05-07T19:44:48.1240002Z #define __FLT128_MANT_DIG__ 113 2025-05-07T19:44:48.1240339Z #define __WCHAR_MIN__ (-__WCHAR_MAX__ - 1) 2025-05-07T19:44:48.1240665Z #define __INT64_C(c) c ## L 2025-05-07T19:44:48.1241137Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:44:48.1241576Z #define __FLT32X_MANT_DIG__ 53 2025-05-07T19:44:48.1242010Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:44:48.1242327Z #define __cpp_aligned_new 201606L 2025-05-07T19:44:48.1242645Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:44:48.1242923Z #define __FLT32_MAX_10_EXP__ 38 2025-05-07T19:44:48.1243322Z #define __FLT64X_EPSILON__ 1.08420217248550443400745280086994171e-19F64x 2025-05-07T19:44:48.1243733Z #define __STDC_HOSTED__ 1 2025-05-07T19:44:48.1244091Z #define __DEC64_MIN_EXP__ (-382) 2025-05-07T19:44:48.1244403Z #define __cpp_decltype_auto 201304L 2025-05-07T19:44:48.1244695Z #define __DBL_DIG__ 15 2025-05-07T19:44:48.1244963Z #define __FLT32_DIG__ 6 2025-05-07T19:44:48.1245273Z #define __FLT_EPSILON__ 1.19209289550781250000000000000000000e-7F 2025-05-07T19:44:48.1245660Z #define __GXX_WEAK__ 1 2025-05-07T19:44:48.1245905Z #define __SHRT_WIDTH__ 16 2025-05-07T19:44:48.1246188Z #define __FLT32_IS_IEC_60559__ 2 2025-05-07T19:44:48.1246522Z #define __LDBL_MIN__ 3.36210314311209350626267781732175260e-4932L 2025-05-07T19:44:48.1246904Z #define __DBL_IS_IEC_60559__ 2 2025-05-07T19:44:48.1247203Z #define __DEC32_MAX__ 9.999999E96DF 2025-05-07T19:44:48.1247503Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:44:48.1247863Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:44:48.1248273Z #define __FLT64X_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951F64x 2025-05-07T19:44:48.1248703Z #define __FLT32X_HAS_INFINITY__ 1 2025-05-07T19:44:48.1248987Z #define __INT32_MAX__ 0x7fffffff 2025-05-07T19:44:48.1249274Z #define __unix__ 1 2025-05-07T19:44:48.1249502Z #define __INT_WIDTH__ 32 2025-05-07T19:44:48.1249778Z #define __SIZEOF_LONG__ 8 2025-05-07T19:44:48.1250128Z #define __STDC_IEC_559__ 1 2025-05-07T19:44:48.1250555Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:44:48.1250876Z #define __UINT16_C(c) c 2025-05-07T19:44:48.1251213Z #define __DECIMAL_DIG__ 21 2025-05-07T19:44:48.1251527Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:44:48.1251917Z #define __FLT64_EPSILON__ 2.22044604925031308084726333618164062e-16F64 2025-05-07T19:44:48.1252358Z #define __gnu_linux__ 1 2025-05-07T19:44:48.1252622Z #define __INT16_MAX__ 0x7fff 2025-05-07T19:44:48.1252945Z #define __FLT64_MIN_EXP__ (-1021) 2025-05-07T19:44:48.1253250Z #define __FLT64X_MIN_10_EXP__ (-4931) 2025-05-07T19:44:48.1253590Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.1253909Z #define __FLT64_MANT_DIG__ 53 2025-05-07T19:44:48.1254205Z #define __FLT64X_MANT_DIG__ 64 2025-05-07T19:44:48.1254510Z #define __GNUC__ 11 2025-05-07T19:44:48.1254746Z #define __GXX_RTTI 1 2025-05-07T19:44:48.1255018Z #define __pie__ 2 2025-05-07T19:44:48.1255257Z #define __MMX__ 1 2025-05-07T19:44:48.1255532Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:44:48.1255822Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:44:48.1256170Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:44:48.1256570Z #define __STDC_UTF_16__ 1 2025-05-07T19:44:48.1256873Z #define __FLT64_MAX_10_EXP__ 308 2025-05-07T19:44:48.1257216Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:44:48.1257543Z #define __FLT32_HAS_INFINITY__ 1 2025-05-07T19:44:48.1257932Z #define __DBL_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:48.1258324Z #define __cpp_raw_strings 200710L 2025-05-07T19:44:48.1258677Z #define __INT_FAST32_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.1259013Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:44:48.1259328Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:44:48.1259610Z #define __HAVE_SPECULATION_SAFE_VALUE 1 2025-05-07T19:44:48.1259964Z #define __cpp_fold_expressions 201603L 2025-05-07T19:44:48.1260275Z #define __DEC32_MIN_EXP__ (-94) 2025-05-07T19:44:48.1260583Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:44:48.1260869Z #define __FLT64X_HAS_INFINITY__ 1 2025-05-07T19:44:48.1261160Z #define __UINT_LEAST32_MAX__ 0xffffffffU 2025-05-07T19:44:48.1261493Z #define __FLT32X_HAS_DENORM__ 1 2025-05-07T19:44:48.1261769Z #define __INT_FAST16_TYPE__ long int 2025-05-07T19:44:48.1262173Z #define __MMX_WITH_SSE__ 1 2025-05-07T19:44:48.1262433Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:44:48.1262734Z #define __cplusplus 201703L 2025-05-07T19:44:48.1263009Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:44:48.1263323Z #define __DEC32_MIN__ 1E-95DF 2025-05-07T19:44:48.1263582Z #define __DEPRECATED 1 2025-05-07T19:44:48.1263876Z #define __cpp_rvalue_references 200610L 2025-05-07T19:44:48.1264199Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:44:48.1264462Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:44:48.1264870Z #define __FLT32_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:48.1265237Z #define __DEC128_EPSILON__ 1E-33DL 2025-05-07T19:44:48.1265538Z #define __SSE2_MATH__ 1 2025-05-07T19:44:48.1265788Z #define __ATOMIC_HLE_RELEASE 131072 2025-05-07T19:44:48.1266123Z #define __PTRDIFF_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.1266423Z #define __amd64 1 2025-05-07T19:44:48.1266687Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:44:48.1266998Z #define __ATOMIC_HLE_ACQUIRE 65536 2025-05-07T19:44:48.1267274Z #define __GNUG__ 11 2025-05-07T19:44:48.1267565Z #define __LONG_LONG_MAX__ 0x7fffffffffffffffLL 2025-05-07T19:44:48.1267887Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:44:48.1268182Z #define __cpp_nsdmi 200809L 2025-05-07T19:44:48.1268454Z #define __FLT64X_MIN_EXP__ (-16381) 2025-05-07T19:44:48.1268764Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:44:48.1269023Z #define __LONG_LONG_WIDTH__ 64 2025-05-07T19:44:48.1269328Z #define __cpp_initializer_lists 200806L 2025-05-07T19:44:48.1269637Z #define __FLT32_MAX_EXP__ 128 2025-05-07T19:44:48.1269933Z #define __cpp_hex_float 201603L 2025-05-07T19:44:48.1270233Z #define __GXX_ABI_VERSION 1016 2025-05-07T19:44:48.1270504Z #define __FLT128_HAS_INFINITY__ 1 2025-05-07T19:44:48.1270822Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:44:48.1271098Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:44:48.1271399Z #define __x86_64 1 2025-05-07T19:44:48.1271641Z #define __cpp_lambdas 200907L 2025-05-07T19:44:48.1271948Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:44:48.1272323Z #define __FLT64_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F64 2025-05-07T19:44:48.1272745Z #define __cpp_template_auto 201606L 2025-05-07T19:44:48.1273108Z #define __DBL_MIN__ double(2.22507385850720138309023271733240406e-308L) 2025-05-07T19:44:48.1273594Z #define __FLT128_EPSILON__ 1.92592994438723585305597794258492732e-34F128 2025-05-07T19:44:48.1274100Z #define __FLT64X_NORM_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:48.1274497Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:44:48.1274787Z #define __LP64__ 1 2025-05-07T19:44:48.1275019Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.1275396Z #define __FLT32X_EPSILON__ 2.22044604925031308084726333618164062e-16F32x 2025-05-07T19:44:48.1275786Z #define __DECIMAL_BID_FORMAT__ 1 2025-05-07T19:44:48.1276097Z #define __FLT64_MIN_10_EXP__ (-307) 2025-05-07T19:44:48.1276383Z #define __FLT64X_DECIMAL_DIG__ 21 2025-05-07T19:44:48.1276701Z #define __DEC128_MIN__ 1E-6143DL 2025-05-07T19:44:48.1277000Z #define __REGISTER_PREFIX__ 2025-05-07T19:44:48.1277264Z #define __UINT16_MAX__ 0xffff 2025-05-07T19:44:48.1277557Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:44:48.1277885Z #define __FLT32_MIN__ 1.17549435082228750796873653722224568e-38F32 2025-05-07T19:44:48.1278280Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:44:48.1278565Z #define __FLT_DIG__ 6 2025-05-07T19:44:48.1278824Z #define __NO_INLINE__ 1 2025-05-07T19:44:48.1279069Z #define __DEC_EVAL_METHOD__ 2 2025-05-07T19:44:48.1279431Z #define __DEC128_MAX__ 9.999999999999999999999999999999999E6144DL 2025-05-07T19:44:48.1279809Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:44:48.1280068Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:44:48.1280358Z #define __VERSION__ "11.4.0" 2025-05-07T19:44:48.1280619Z #define __UINT64_C(c) c ## UL 2025-05-07T19:44:48.1280925Z #define __cpp_unicode_characters 201411L 2025-05-07T19:44:48.1281235Z #define _STDC_PREDEF_H 1 2025-05-07T19:44:48.1281613Z #define __INT_LEAST32_MAX__ 0x7fffffff 2025-05-07T19:44:48.1281909Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:44:48.1282226Z #define __FLT128_MAX_EXP__ 16384 2025-05-07T19:44:48.1282497Z #define __FLT32_MANT_DIG__ 24 2025-05-07T19:44:48.1282838Z #define __FLOAT_WORD_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:48.1283220Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:44:48.1283514Z #define __FLT128_HAS_DENORM__ 1 2025-05-07T19:44:48.1283822Z #define __FLT32_DECIMAL_DIG__ 9 2025-05-07T19:44:48.1284096Z #define __FLT128_DIG__ 33 2025-05-07T19:44:48.1284493Z #define __INT32_C(c) c 2025-05-07T19:44:48.1284748Z #define __DEC64_EPSILON__ 1E-15DD 2025-05-07T19:44:48.1285065Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:44:48.1285352Z #define __DEC128_MIN_EXP__ (-6142) 2025-05-07T19:44:48.1285668Z #define __INT_FAST32_TYPE__ long int 2025-05-07T19:44:48.1285990Z #define __UINT_LEAST16_TYPE__ short unsigned int 2025-05-07T19:44:48.1286345Z #define unix 1 2025-05-07T19:44:48.1286609Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:44:48.1286881Z #define __cpp_rtti 199711L 2025-05-07T19:44:48.1287192Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:44:48.1287516Z #define __UINT64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.1287867Z #define __FLT_IS_IEC_60559__ 2 2025-05-07T19:44:48.1288183Z #define __GNUC_WIDE_EXECUTION_CHARSET_NAME "UTF-32LE" 2025-05-07T19:44:48.1288561Z #define __FLT64X_DIG__ 18 2025-05-07T19:44:48.1288828Z #define __INT8_TYPE__ signed char 2025-05-07T19:44:48.1289162Z #define __cpp_digit_separators 201309L 2025-05-07T19:44:48.1289458Z #define __ELF__ 1 2025-05-07T19:44:48.1289734Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:44:48.1290128Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:44:48.1290595Z #define __FLT_RADIX__ 2 2025-05-07T19:44:48.1290969Z #define __INT_LEAST16_TYPE__ short int 2025-05-07T19:44:48.1291367Z #define __LDBL_EPSILON__ 1.08420217248550443400745280086994171e-19L 2025-05-07T19:44:48.1291812Z #define __UINTMAX_C(c) c ## UL 2025-05-07T19:44:48.1292123Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:44:48.1292464Z #define __k8 1 2025-05-07T19:44:48.1292794Z #define __FLT32X_MIN__ 2.22507385850720138309023271733240406e-308F32x 2025-05-07T19:44:48.1293231Z #define __SIG_ATOMIC_MAX__ 0x7fffffff 2025-05-07T19:44:48.1293585Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:44:48.1293908Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:44:48.1294225Z #define __LDBL_DIG__ 18 2025-05-07T19:44:48.1294493Z #define __FLT64_IS_IEC_60559__ 2 2025-05-07T19:44:48.1294801Z #define __x86_64__ 1 2025-05-07T19:44:48.1295063Z #define __FLT32X_MIN_EXP__ (-1021) 2025-05-07T19:44:48.1295416Z #define __DEC32_SUBNORMAL_MIN__ 0.000001E-95DF 2025-05-07T19:44:48.1295781Z #define __INT_FAST16_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.1296140Z #define __FLT64_DIG__ 15 2025-05-07T19:44:48.1296441Z #define __UINT_FAST32_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.1296847Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:44:48.1297215Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.1297513Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:44:48.1297847Z #define __LONG_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.1298169Z #define __FLT64X_HAS_DENORM__ 1 2025-05-07T19:44:48.1298590Z #define __DEC128_SUBNORMAL_MIN__ 0.000000000000000000000000000000001E-6143DL 2025-05-07T19:44:48.1299027Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:44:48.1299372Z #define __GNUC_EXECUTION_CHARSET_NAME "UTF-8" 2025-05-07T19:44:48.1299726Z #define __cpp_unicode_literals 200710L 2025-05-07T19:44:48.1300101Z #define __UINT_FAST16_TYPE__ long unsigned int 2025-05-07T19:44:48.1300491Z #define __DEC64_MAX__ 9.999999999999999E384DD 2025-05-07T19:44:48.1300821Z #define __INT_FAST32_WIDTH__ 64 2025-05-07T19:44:48.1301166Z #define __CHAR16_TYPE__ short unsigned int 2025-05-07T19:44:48.1301505Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:44:48.1301840Z #define __SIZE_WIDTH__ 64 2025-05-07T19:44:48.1302105Z #define __SEG_FS 1 2025-05-07T19:44:48.1302383Z #define __INT_LEAST16_MAX__ 0x7fff 2025-05-07T19:44:48.1302862Z #define __DEC64_MANT_DIG__ 16 2025-05-07T19:44:48.1303180Z #define __INT64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.1303480Z #define __SEG_GS 1 2025-05-07T19:44:48.1303845Z #define __FLT32_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F32 2025-05-07T19:44:48.1304278Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:44:48.1304568Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:44:48.1304905Z #define __INT16_TYPE__ short int 2025-05-07T19:44:48.1305195Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:44:48.1305599Z #define __cpp_structured_bindings 201606L 2025-05-07T19:44:48.1305903Z #define __SIZEOF_INT__ 4 2025-05-07T19:44:48.1306190Z #define __DEC32_MAX_EXP__ 97 2025-05-07T19:44:48.1306455Z #define __INT_FAST8_MAX__ 0x7f 2025-05-07T19:44:48.1306826Z #define __FLT128_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:48.1307254Z #define __INTPTR_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.1307581Z #define __cpp_sized_deallocation 201309L 2025-05-07T19:44:48.1307948Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:44:48.1308255Z #define linux 1 2025-05-07T19:44:48.1308520Z #define __FLT64_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.1308802Z #define __FLT32_MIN_10_EXP__ (-37) 2025-05-07T19:44:48.1309105Z #define __EXCEPTIONS 1 2025-05-07T19:44:48.1309353Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:44:48.1309641Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:44:48.1309912Z #define __cpp_range_based_for 201603L 2025-05-07T19:44:48.1310235Z #define __FLT64_HAS_INFINITY__ 1 2025-05-07T19:44:48.1310615Z #define __FLT64X_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:44:48.1311015Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16 2025-05-07T19:44:48.1311394Z #define __SIG_ATOMIC_MIN__ (-__SIG_ATOMIC_MAX__ - 1) 2025-05-07T19:44:48.1311733Z #define __code_model_small__ 1 2025-05-07T19:44:48.1312039Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:44:48.1312350Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:44:48.1312685Z #define __DEC32_MANT_DIG__ 7 2025-05-07T19:44:48.1312967Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:44:48.1313291Z #define __k8__ 1 2025-05-07T19:44:48.1313529Z #define __INTPTR_TYPE__ long int 2025-05-07T19:44:48.1313849Z #define __UINT16_TYPE__ short unsigned int 2025-05-07T19:44:48.1314178Z #define __WCHAR_TYPE__ int 2025-05-07T19:44:48.1314428Z #define __pic__ 2 2025-05-07T19:44:48.1314706Z #define __UINTPTR_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.1315020Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:44:48.1315325Z #define __cpp_decltype 200707L 2025-05-07T19:44:48.1315623Z #define __INT_FAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.1315991Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:44:48.1316362Z #define __FLT_NORM_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:48.1316761Z #define __FLT64X_MAX_EXP__ 16384 2025-05-07T19:44:48.1317102Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:44:48.1317434Z #define __cpp_inline_variables 201606L 2025-05-07T19:44:48.1317776Z #define __INT_MAX__ 0x7fffffff 2025-05-07T19:44:48.1318041Z #define __linux__ 1 2025-05-07T19:44:48.1318297Z #define __INT64_TYPE__ long int 2025-05-07T19:44:48.1318538Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:44:48.1318788Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:44:48.1319040Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:44:48.1319315Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:44:48.1319613Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:44:48.1319915Z #define __INT_LEAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.1320228Z #define __DEC64_MIN__ 1E-383DD 2025-05-07T19:44:48.1320474Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:44:48.1320765Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:44:48.1321043Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:44:48.1321366Z #define __FLT32_NORM_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:44:48.1321706Z #define __SSE__ 1 2025-05-07T19:44:48.1321942Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:44:48.1322379Z #define __FLT64_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:48.1322785Z #define __amd64__ 1 2025-05-07T19:44:48.1323060Z #define __WINT_WIDTH__ 32 2025-05-07T19:44:48.1323330Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:44:48.1323649Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:44:48.1323931Z #define __FLT32X_MAX_10_EXP__ 308 2025-05-07T19:44:48.1324256Z #define __SIZEOF_INT128__ 16 2025-05-07T19:44:48.1324532Z #define __FLT64X_IS_IEC_60559__ 2 2025-05-07T19:44:48.1324895Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:44:48.1325174Z #define __ATOMIC_RELAXED 0 2025-05-07T19:44:48.1325562Z #define __DBL_EPSILON__ double(2.22044604925031308084726333618164062e-16L) 2025-05-07T19:44:48.1326031Z #define __FLT128_MIN__ 3.36210314311209350626267781732175260e-4932F128 2025-05-07T19:44:48.1326430Z #define _LP64 1 2025-05-07T19:44:48.1326686Z #define __UINT8_C(c) c 2025-05-07T19:44:48.1326931Z #define __FLT64_MAX_EXP__ 1024 2025-05-07T19:44:48.1327204Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:44:48.1327457Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:44:48.1327719Z #define __GNUC_PATCHLEVEL__ 0 2025-05-07T19:44:48.1328055Z #define __FLT128_NORM_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:44:48.1328645Z #define __FLT64_NORM_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:44:48.1329179Z #define __FLT128_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.1329584Z #define __INTMAX_MAX__ 0x7fffffffffffffffL 2025-05-07T19:44:48.1330026Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:44:48.1330345Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:44:48.1330743Z #define __FLT64X_MIN__ 3.36210314311209350626267781732175260e-4932F64x 2025-05-07T19:44:48.1331119Z #define __STDCPP_THREADS__ 1 2025-05-07T19:44:48.1331396Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:44:48.1331657Z #define __FLT64_HAS_DENORM__ 1 2025-05-07T19:44:48.1332010Z #define __FLT32_EPSILON__ 1.19209289550781250000000000000000000e-7F32 2025-05-07T19:44:48.1332389Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:44:48.1332676Z #define __STDC_UTF_32__ 1 2025-05-07T19:44:48.1332980Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:44:48.1333261Z #define __FXSR__ 1 2025-05-07T19:44:48.1333624Z #define __FLT32X_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:48.1334127Z #define __DBL_NORM_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:44:48.1334610Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:44:48.1334948Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:44:48.1335271Z #define __cpp_runtime_arrays 198712L 2025-05-07T19:44:48.1335595Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:44:48.1335947Z #define __UINT32_C(c) c ## U 2025-05-07T19:44:48.1336243Z #define __cpp_alias_templates 200704L 2025-05-07T19:44:48.1336663Z #define __FLT_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F 2025-05-07T19:44:48.1337085Z #define __FLT128_IS_IEC_60559__ 2 2025-05-07T19:44:48.1337385Z #define __INT8_MAX__ 0x7f 2025-05-07T19:44:48.1337695Z #define __LONG_WIDTH__ 64 2025-05-07T19:44:48.1337953Z #define __PIC__ 2 2025-05-07T19:44:48.1338221Z #define __UINT_FAST32_TYPE__ long unsigned int 2025-05-07T19:44:48.1338629Z #define __FLT32X_NORM_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:44:48.1339038Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:44:48.1339373Z #define __FLT_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:44:48.1339737Z #define __cpp_constexpr 201603L 2025-05-07T19:44:48.1340010Z #define __SSE2__ 1 2025-05-07T19:44:48.1340245Z #define __cpp_deduction_guides 201703L 2025-05-07T19:44:48.1340548Z #define __INT32_TYPE__ int 2025-05-07T19:44:48.1340797Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:44:48.1341071Z #define __cpp_exceptions 199711L 2025-05-07T19:44:48.1341343Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:44:48.1341690Z #define __FLT64_MIN__ 2.22507385850720138309023271733240406e-308F64 2025-05-07T19:44:48.1342151Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:44:48.1342550Z #define __INTMAX_TYPE__ long int 2025-05-07T19:44:48.1342801Z #define __DEC128_MAX_EXP__ 6145 2025-05-07T19:44:48.1343065Z #define __FLT32X_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.1343340Z #define __ATOMIC_CONSUME 1 2025-05-07T19:44:48.1343572Z #define __GNUC_MINOR__ 4 2025-05-07T19:44:48.1343825Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:44:48.1344096Z #define __INT_FAST16_WIDTH__ 64 2025-05-07T19:44:48.1344382Z #define __UINTMAX_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.1344663Z #define __PIE__ 2 2025-05-07T19:44:48.1347009Z #define __FLT32X_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F32x 2025-05-07T19:44:48.1347440Z #define __cpp_template_template_args 201611L 2025-05-07T19:44:48.1347746Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:44:48.1348084Z #define __LDBL_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951L 2025-05-07T19:44:48.1348437Z #define __INT16_C(c) c 2025-05-07T19:44:48.1348660Z #define __STDC__ 1 2025-05-07T19:44:48.1348862Z #define __FLT32X_DIG__ 15 2025-05-07T19:44:48.1349118Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:44:48.1349379Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:44:48.1349653Z #define __FLT32X_MIN_10_EXP__ (-307) 2025-05-07T19:44:48.1349929Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:44:48.1350267Z #define __DEC64_SUBNORMAL_MIN__ 0.000000000000001E-383DD 2025-05-07T19:44:48.1350581Z #define __DEC128_MANT_DIG__ 34 2025-05-07T19:44:48.1350844Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:44:48.1351129Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:44:48.1351399Z #define __SSE_MATH__ 1 2025-05-07T19:44:48.1351637Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:44:48.1351902Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:44:48.1352202Z #define __FLT128_DECIMAL_DIG__ 36 2025-05-07T19:44:48.1352464Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:44:48.1352745Z #define __FLT32_HAS_QUIET_NAN__ 1 2025-05-07T19:44:48.1352994Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:44:48.1353285Z #define __UINT_FAST16_MAX__ 0xffffffffffffffffUL 2025-05-07T19:44:48.1353680Z #define __LDBL_NORM_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:44:48.1354035Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:44:48.1354349Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:44:48.1354625Z #define _GNU_SOURCE 1 2025-05-07T19:44:48.1354867Z #define __cpp_init_captures 201304L 2025-05-07T19:44:48.1355127Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:44:48.1355377Z #define __ATOMIC_RELEASE 3 2025-05-07T19:44:48.1355527Z 2025-05-07T19:44:48.1989595Z 2025-05-07T19:44:48.1990123Z + conda run -n build_binary c++ --version 2025-05-07T19:44:48.1990411Z 2025-05-07T19:44:49.9932804Z c++ (conda-forge gcc 11.4.0-13) 11.4.0 2025-05-07T19:44:49.9933224Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:44:49.9933704Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:44:49.9934262Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:44:49.9934665Z 2025-05-07T19:44:49.9934671Z 2025-05-07T19:44:50.0680463Z 2025-05-07T19:44:50.0681203Z [INFO] Printing the default version of the C standard used by the compiler ... 2025-05-07T19:44:50.0681856Z + conda run -n build_binary cc -dM -E - < /dev/null | grep __STDC_VERSION__ 2025-05-07T19:44:50.0682183Z 2025-05-07T19:44:51.9677867Z #define __STDC_VERSION__ 201710L 2025-05-07T19:44:51.9678161Z 2025-05-07T19:44:51.9678429Z [INFO] Printing the default version of the C++ standard used by the compiler ... 2025-05-07T19:44:51.9679077Z + conda run -n build_binary c++ -dM -E -x c++ - < /dev/null | grep __cplusplus 2025-05-07T19:44:51.9679404Z 2025-05-07T19:44:53.8330952Z #define __cplusplus 201703L 2025-05-07T19:44:53.8331239Z 2025-05-07T19:44:53.8331397Z [INSTALL] Successfully installed C/C++ compilers 2025-05-07T19:44:53.8406787Z ##[group]Run . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:44:53.8407294Z . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:44:53.8408072Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:53.8408440Z env: 2025-05-07T19:44:53.8408719Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:53.8409044Z BUILD_ENV: build_binary 2025-05-07T19:44:53.8409339Z BUILD_TARGET: default 2025-05-07T19:44:53.8409595Z BUILD_VARIANT: cuda 2025-05-07T19:44:53.8409973Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:44:53.8410270Z ##[endgroup] 2025-05-07T19:44:54.2832000Z ################################################################################ 2025-05-07T19:44:54.2832401Z # Install Build Tools 2025-05-07T19:44:54.2832699Z # 2025-05-07T19:44:54.2849270Z # [2025-05-07T19:44:54.284Z] + install_build_tools build_binary 2025-05-07T19:44:54.2850276Z ################################################################################ 2025-05-07T19:44:54.2850785Z 2025-05-07T19:44:54.2861304Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:54.3723338Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:54.3729923Z [INSTALL] Installing build tools ... 2025-05-07T19:44:54.3752683Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y auditwheel bazel cmake>=3.30 hypothesis jinja2 make ncurses ninja openblas patchelf rhash scikit-build wheel pyyaml 2025-05-07T19:44:55.0792979Z Channels: 2025-05-07T19:44:55.0793293Z - conda-forge 2025-05-07T19:44:55.0793552Z Platform: linux-64 2025-05-07T19:44:58.2188482Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:45:01.4892612Z Solving environment: \ | / - done 2025-05-07T19:45:01.5419142Z 2025-05-07T19:45:01.5419564Z ## Package Plan ## 2025-05-07T19:45:01.5419750Z 2025-05-07T19:45:01.5419975Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:01.5420298Z 2025-05-07T19:45:01.5420454Z added / updated specs: 2025-05-07T19:45:01.5420724Z - auditwheel 2025-05-07T19:45:01.5420934Z - bazel 2025-05-07T19:45:01.5421161Z - cmake[version='>=3.30'] 2025-05-07T19:45:01.5421438Z - hypothesis 2025-05-07T19:45:01.5421665Z - jinja2 2025-05-07T19:45:01.5421874Z - make 2025-05-07T19:45:01.5422061Z - ncurses 2025-05-07T19:45:01.5422269Z - ninja 2025-05-07T19:45:01.5422458Z - openblas 2025-05-07T19:45:01.5422675Z - patchelf 2025-05-07T19:45:01.5422872Z - pyyaml 2025-05-07T19:45:01.5423079Z - rhash 2025-05-07T19:45:01.5423269Z - scikit-build 2025-05-07T19:45:01.5423493Z - wheel 2025-05-07T19:45:01.5423604Z 2025-05-07T19:45:01.5423608Z 2025-05-07T19:45:01.5423729Z The following packages will be downloaded: 2025-05-07T19:45:01.5423964Z 2025-05-07T19:45:01.5424079Z package | build 2025-05-07T19:45:01.5424418Z ---------------------------|----------------- 2025-05-07T19:45:01.5424808Z alsa-lib-1.2.14 | hb9d3cd8_0 553 KB conda-forge 2025-05-07T19:45:01.5425259Z attrs-25.3.0 | pyh71513ae_0 56 KB conda-forge 2025-05-07T19:45:01.5425705Z auditwheel-6.2.0 | pyha804496_1 40 KB conda-forge 2025-05-07T19:45:01.5426145Z bazel-7.5.0 | h96810dc_2 47.4 MB conda-forge 2025-05-07T19:45:01.5426551Z c-ares-1.34.5 | hb9d3cd8_0 202 KB conda-forge 2025-05-07T19:45:01.5427006Z cairo-1.18.4 | h3394656_0 955 KB conda-forge 2025-05-07T19:45:01.5427470Z click-8.1.8 | pyh707e725_0 83 KB conda-forge 2025-05-07T19:45:01.5427902Z cmake-4.0.2 | h74e3db0_0 19.4 MB conda-forge 2025-05-07T19:45:01.5428641Z distro-1.9.0 | pyhd8ed1ab_1 41 KB conda-forge 2025-05-07T19:45:01.5429617Z exceptiongroup-1.2.2 | pyhd8ed1ab_1 20 KB conda-forge 2025-05-07T19:45:01.5430103Z expat-2.7.0 | h5888daf_0 137 KB conda-forge 2025-05-07T19:45:01.5430656Z font-ttf-dejavu-sans-mono-2.37| hab24e00_0 388 KB conda-forge 2025-05-07T19:45:01.5431417Z font-ttf-inconsolata-3.000 | h77eed37_0 94 KB conda-forge 2025-05-07T19:45:01.5431995Z font-ttf-source-code-pro-2.038| h77eed37_0 684 KB conda-forge 2025-05-07T19:45:01.5432579Z font-ttf-ubuntu-0.83 | h77eed37_3 1.5 MB conda-forge 2025-05-07T19:45:01.5433052Z fontconfig-2.15.0 | h7e30c49_1 259 KB conda-forge 2025-05-07T19:45:01.5433566Z fonts-conda-ecosystem-1 | 0 4 KB conda-forge 2025-05-07T19:45:01.5434117Z fonts-conda-forge-1 | 0 4 KB conda-forge 2025-05-07T19:45:01.5434599Z freetype-2.13.3 | ha770c72_1 168 KB conda-forge 2025-05-07T19:45:01.5435066Z giflib-5.2.2 | hd590300_0 75 KB conda-forge 2025-05-07T19:45:01.5435518Z graphite2-1.3.13 | h59595ed_1003 95 KB conda-forge 2025-05-07T19:45:01.5436013Z harfbuzz-11.0.0 | h76408a6_0 1.6 MB conda-forge 2025-05-07T19:45:01.5436488Z hypothesis-6.131.14 | pyha770c72_0 348 KB conda-forge 2025-05-07T19:45:01.5436970Z icu-75.1 | he02047a_0 11.6 MB conda-forge 2025-05-07T19:45:01.5437403Z ijar-7.5.0 | h5888daf_0 114 KB conda-forge 2025-05-07T19:45:01.5437836Z jinja2-3.1.6 | pyhd8ed1ab_0 110 KB conda-forge 2025-05-07T19:45:01.5438319Z keyutils-1.6.1 | h166bdaf_0 115 KB conda-forge 2025-05-07T19:45:01.5438758Z krb5-1.21.3 | h659f571_0 1.3 MB conda-forge 2025-05-07T19:45:01.5439203Z lcms2-2.17 | h717163a_0 242 KB conda-forge 2025-05-07T19:45:01.5439625Z lerc-4.0.0 | h0aef613_1 258 KB conda-forge 2025-05-07T19:45:01.5440138Z libabseil-20250127.1 | cxx17_hbbce691_0 1.3 MB conda-forge 2025-05-07T19:45:01.5440727Z libcups-2.3.3 | h4637d8d_4 4.3 MB conda-forge 2025-05-07T19:45:01.5441150Z libcurl-8.13.0 | h332b0f4_0 428 KB conda-forge 2025-05-07T19:45:01.5441605Z libdeflate-1.23 | h86f0d12_0 71 KB conda-forge 2025-05-07T19:45:01.5442090Z libedit-3.1.20250104 | pl5321h7949ede_0 132 KB conda-forge 2025-05-07T19:45:01.5442585Z libev-4.33 | hd590300_2 110 KB conda-forge 2025-05-07T19:45:01.5443053Z libexpat-2.7.0 | h5888daf_0 73 KB conda-forge 2025-05-07T19:45:01.5443606Z libfreetype-2.13.3 | ha770c72_1 8 KB conda-forge 2025-05-07T19:45:01.5444098Z libfreetype6-2.13.3 | h48d6fc4_1 371 KB conda-forge 2025-05-07T19:45:01.5444552Z libgfortran-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:45:01.5445038Z libgfortran5-15.1.0 | hcea5267_2 1.5 MB conda-forge 2025-05-07T19:45:01.5445481Z libglib-2.84.0 | h2ff4ddf_0 3.8 MB conda-forge 2025-05-07T19:45:01.5445921Z libgrpc-1.71.0 | h8e591d7_1 7.6 MB conda-forge 2025-05-07T19:45:01.5446367Z libiconv-1.18 | h4ce23a2_1 696 KB conda-forge 2025-05-07T19:45:01.5446806Z libjpeg-turbo-3.1.0 | hb9d3cd8_0 614 KB conda-forge 2025-05-07T19:45:01.5447278Z liblzma-5.8.1 | hb9d3cd8_1 110 KB conda-forge 2025-05-07T19:45:01.5447722Z liblzma-devel-5.8.1 | hb9d3cd8_1 431 KB conda-forge 2025-05-07T19:45:01.5448201Z libnghttp2-1.64.0 | h161d5f1_0 632 KB conda-forge 2025-05-07T19:45:01.5448806Z libopenblas-0.3.29 |pthreads_h94d23a6_0 5.6 MB conda-forge 2025-05-07T19:45:01.5449265Z libpng-1.6.47 | h943b412_0 282 KB conda-forge 2025-05-07T19:45:01.5449803Z libprotobuf-5.29.3 | h501fc15_1 3.2 MB conda-forge 2025-05-07T19:45:01.5450513Z libre2-11-2024.07.02 | hba17884_3 205 KB conda-forge 2025-05-07T19:45:01.5451020Z libsqlite-3.49.2 | hee588c1_0 895 KB conda-forge 2025-05-07T19:45:01.5451480Z libssh2-1.11.1 | hcf80075_0 298 KB conda-forge 2025-05-07T19:45:01.5451964Z libtiff-4.7.0 | hd9ff511_4 419 KB conda-forge 2025-05-07T19:45:01.5452446Z libuuid-2.38.1 | h0b41bf4_0 33 KB conda-forge 2025-05-07T19:45:01.5452883Z libuv-1.50.0 | hb9d3cd8_0 870 KB conda-forge 2025-05-07T19:45:01.5453379Z libwebp-base-1.5.0 | h851e524_0 420 KB conda-forge 2025-05-07T19:45:01.5453844Z libxcb-1.17.0 | h8a09558_0 387 KB conda-forge 2025-05-07T19:45:01.5454320Z libzlib-1.3.1 | hb9d3cd8_2 60 KB conda-forge 2025-05-07T19:45:01.5454761Z make-4.4.1 | hb9d3cd8_2 501 KB conda-forge 2025-05-07T19:45:01.5455253Z markupsafe-3.0.2 | py313h8060acc_1 24 KB conda-forge 2025-05-07T19:45:01.5455747Z ncurses-6.5 | h2d0b736_3 871 KB conda-forge 2025-05-07T19:45:01.5456183Z ninja-1.12.1 | hff21bea_1 158 KB conda-forge 2025-05-07T19:45:01.5456682Z openblas-0.3.29 |pthreads_h6ec200e_0 5.8 MB conda-forge 2025-05-07T19:45:01.5457170Z openjdk-23.0.2 | h53dfc1b_2 181.4 MB conda-forge 2025-05-07T19:45:01.5457670Z packaging-25.0 | pyh29332c3_1 61 KB conda-forge 2025-05-07T19:45:01.5458172Z patchelf-0.18.0 | h3f2d84a_2 133 KB conda-forge 2025-05-07T19:45:01.5458618Z pcre2-10.44 | hc749103_2 934 KB conda-forge 2025-05-07T19:45:01.5459086Z pixman-0.46.0 | h29eaf8c_0 389 KB conda-forge 2025-05-07T19:45:01.5459576Z pthread-stubs-0.4 | hb9d3cd8_1002 8 KB conda-forge 2025-05-07T19:45:01.5460072Z pyelftools-0.32 | pyh707e725_1 146 KB conda-forge 2025-05-07T19:45:01.5460581Z python-3.13.2 |hf636f53_101_cp313 31.7 MB conda-forge 2025-05-07T19:45:01.5461082Z pyyaml-6.0.2 | py313h8060acc_2 201 KB conda-forge 2025-05-07T19:45:01.5461530Z re2-2024.07.02 | h9925aae_3 26 KB conda-forge 2025-05-07T19:45:01.5461991Z rhash-1.4.5 | hb9d3cd8_0 183 KB conda-forge 2025-05-07T19:45:01.5462461Z scikit-build-0.18.1 | pyhae55e72_2 114 KB conda-forge 2025-05-07T19:45:01.5463065Z singlejar-7.5.0 | h0e684df_1 122 KB conda-forge 2025-05-07T19:45:01.5463542Z sortedcontainers-2.4.0 | pyhd8ed1ab_1 28 KB conda-forge 2025-05-07T19:45:01.5464051Z sqlite-3.49.2 | h9eae976_0 840 KB conda-forge 2025-05-07T19:45:01.5464495Z tk-8.6.13 |noxft_h4845f30_101 3.2 MB conda-forge 2025-05-07T19:45:01.5464903Z tomli-2.2.1 | pyhd8ed1ab_1 19 KB conda-forge 2025-05-07T19:45:01.5465362Z wheel-0.45.1 | pyhd8ed1ab_1 61 KB conda-forge 2025-05-07T19:45:01.5465790Z xorg-libice-1.1.2 | hb9d3cd8_0 57 KB conda-forge 2025-05-07T19:45:01.5466269Z xorg-libsm-1.2.6 | he73a12e_0 27 KB conda-forge 2025-05-07T19:45:01.5466746Z xorg-libx11-1.8.12 | h4f16b4b_0 816 KB conda-forge 2025-05-07T19:45:01.5468320Z xorg-libxau-1.0.12 | hb9d3cd8_0 14 KB conda-forge 2025-05-07T19:45:01.5468824Z xorg-libxdmcp-1.1.5 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:01.5469364Z xorg-libxext-1.3.6 | hb9d3cd8_0 49 KB conda-forge 2025-05-07T19:45:01.5469867Z xorg-libxfixes-6.0.1 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:01.5470328Z xorg-libxi-1.8.2 | hb9d3cd8_0 46 KB conda-forge 2025-05-07T19:45:01.5470815Z xorg-libxrandr-1.5.4 | hb9d3cd8_0 29 KB conda-forge 2025-05-07T19:45:01.5471331Z xorg-libxrender-0.9.12 | hb9d3cd8_0 32 KB conda-forge 2025-05-07T19:45:01.5471792Z xorg-libxt-1.3.1 | hb9d3cd8_0 371 KB conda-forge 2025-05-07T19:45:01.5472259Z xorg-libxtst-1.2.5 | hb9d3cd8_3 32 KB conda-forge 2025-05-07T19:45:01.5472681Z xz-5.8.1 | hbcc6ac9_1 23 KB conda-forge 2025-05-07T19:45:01.5473125Z xz-gpl-tools-5.8.1 | hbcc6ac9_1 33 KB conda-forge 2025-05-07T19:45:01.5473555Z xz-tools-5.8.1 | hb9d3cd8_1 94 KB conda-forge 2025-05-07T19:45:01.5473989Z yaml-0.2.5 | h7f98852_2 87 KB conda-forge 2025-05-07T19:45:01.5474397Z zlib-1.3.1 | hb9d3cd8_2 90 KB conda-forge 2025-05-07T19:45:01.5474768Z zstd-1.5.7 | hb8e6e7a_2 554 KB conda-forge 2025-05-07T19:45:01.5475181Z ------------------------------------------------------------ 2025-05-07T19:45:01.5475530Z Total: 351.6 MB 2025-05-07T19:45:01.5475778Z 2025-05-07T19:45:01.5475931Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:01.5476159Z 2025-05-07T19:45:01.5476389Z alsa-lib conda-forge/linux-64::alsa-lib-1.2.14-hb9d3cd8_0 2025-05-07T19:45:01.5476858Z attrs conda-forge/noarch::attrs-25.3.0-pyh71513ae_0 2025-05-07T19:45:01.5477339Z auditwheel conda-forge/noarch::auditwheel-6.2.0-pyha804496_1 2025-05-07T19:45:01.5477794Z bazel conda-forge/linux-64::bazel-7.5.0-h96810dc_2 2025-05-07T19:45:01.5478247Z c-ares conda-forge/linux-64::c-ares-1.34.5-hb9d3cd8_0 2025-05-07T19:45:01.5478668Z cairo conda-forge/linux-64::cairo-1.18.4-h3394656_0 2025-05-07T19:45:01.5479117Z click conda-forge/noarch::click-8.1.8-pyh707e725_0 2025-05-07T19:45:01.5479538Z cmake conda-forge/linux-64::cmake-4.0.2-h74e3db0_0 2025-05-07T19:45:01.5480000Z distro conda-forge/noarch::distro-1.9.0-pyhd8ed1ab_1 2025-05-07T19:45:01.5480532Z exceptiongroup conda-forge/noarch::exceptiongroup-1.2.2-pyhd8ed1ab_1 2025-05-07T19:45:01.5481132Z font-ttf-dejavu-s~ conda-forge/noarch::font-ttf-dejavu-sans-mono-2.37-hab24e00_0 2025-05-07T19:45:01.5481794Z font-ttf-inconsol~ conda-forge/noarch::font-ttf-inconsolata-3.000-h77eed37_0 2025-05-07T19:45:01.5482406Z font-ttf-source-c~ conda-forge/noarch::font-ttf-source-code-pro-2.038-h77eed37_0 2025-05-07T19:45:01.5483019Z font-ttf-ubuntu conda-forge/noarch::font-ttf-ubuntu-0.83-h77eed37_3 2025-05-07T19:45:01.5483800Z fontconfig conda-forge/linux-64::fontconfig-2.15.0-h7e30c49_1 2025-05-07T19:45:01.5484342Z fonts-conda-ecosy~ conda-forge/noarch::fonts-conda-ecosystem-1-0 2025-05-07T19:45:01.5484917Z fonts-conda-forge conda-forge/noarch::fonts-conda-forge-1-0 2025-05-07T19:45:01.5485422Z freetype conda-forge/linux-64::freetype-2.13.3-ha770c72_1 2025-05-07T19:45:01.5485930Z giflib conda-forge/linux-64::giflib-5.2.2-hd590300_0 2025-05-07T19:45:01.5486455Z graphite2 conda-forge/linux-64::graphite2-1.3.13-h59595ed_1003 2025-05-07T19:45:01.5486953Z harfbuzz conda-forge/linux-64::harfbuzz-11.0.0-h76408a6_0 2025-05-07T19:45:01.5487598Z hypothesis conda-forge/noarch::hypothesis-6.131.14-pyha770c72_0 2025-05-07T19:45:01.5488075Z icu conda-forge/linux-64::icu-75.1-he02047a_0 2025-05-07T19:45:01.5488519Z ijar conda-forge/linux-64::ijar-7.5.0-h5888daf_0 2025-05-07T19:45:01.5489048Z jinja2 conda-forge/noarch::jinja2-3.1.6-pyhd8ed1ab_0 2025-05-07T19:45:01.5489512Z keyutils conda-forge/linux-64::keyutils-1.6.1-h166bdaf_0 2025-05-07T19:45:01.5490061Z krb5 conda-forge/linux-64::krb5-1.21.3-h659f571_0 2025-05-07T19:45:01.5490654Z lcms2 conda-forge/linux-64::lcms2-2.17-h717163a_0 2025-05-07T19:45:01.5491120Z lerc conda-forge/linux-64::lerc-4.0.0-h0aef613_1 2025-05-07T19:45:01.5491662Z libabseil conda-forge/linux-64::libabseil-20250127.1-cxx17_hbbce691_0 2025-05-07T19:45:01.5492195Z libcups conda-forge/linux-64::libcups-2.3.3-h4637d8d_4 2025-05-07T19:45:01.5492700Z libcurl conda-forge/linux-64::libcurl-8.13.0-h332b0f4_0 2025-05-07T19:45:01.5493200Z libdeflate conda-forge/linux-64::libdeflate-1.23-h86f0d12_0 2025-05-07T19:45:01.5493764Z libedit conda-forge/linux-64::libedit-3.1.20250104-pl5321h7949ede_0 2025-05-07T19:45:01.5494287Z libev conda-forge/linux-64::libev-4.33-hd590300_2 2025-05-07T19:45:01.5494748Z libexpat conda-forge/linux-64::libexpat-2.7.0-h5888daf_0 2025-05-07T19:45:01.5495291Z libfreetype conda-forge/linux-64::libfreetype-2.13.3-ha770c72_1 2025-05-07T19:45:01.5495842Z libfreetype6 conda-forge/linux-64::libfreetype6-2.13.3-h48d6fc4_1 2025-05-07T19:45:01.5496419Z libgfortran conda-forge/linux-64::libgfortran-15.1.0-h69a702a_2 2025-05-07T19:45:01.5496986Z libgfortran5 conda-forge/linux-64::libgfortran5-15.1.0-hcea5267_2 2025-05-07T19:45:01.5497498Z libglib conda-forge/linux-64::libglib-2.84.0-h2ff4ddf_0 2025-05-07T19:45:01.5497998Z libgrpc conda-forge/linux-64::libgrpc-1.71.0-h8e591d7_1 2025-05-07T19:45:01.5498484Z libiconv conda-forge/linux-64::libiconv-1.18-h4ce23a2_1 2025-05-07T19:45:01.5499033Z libjpeg-turbo conda-forge/linux-64::libjpeg-turbo-3.1.0-hb9d3cd8_0 2025-05-07T19:45:01.5499556Z liblzma conda-forge/linux-64::liblzma-5.8.1-hb9d3cd8_1 2025-05-07T19:45:01.5500096Z liblzma-devel conda-forge/linux-64::liblzma-devel-5.8.1-hb9d3cd8_1 2025-05-07T19:45:01.5500666Z libnghttp2 conda-forge/linux-64::libnghttp2-1.64.0-h161d5f1_0 2025-05-07T19:45:01.5501234Z libopenblas conda-forge/linux-64::libopenblas-0.3.29-pthreads_h94d23a6_0 2025-05-07T19:45:01.5501800Z libpng conda-forge/linux-64::libpng-1.6.47-h943b412_0 2025-05-07T19:45:01.5502298Z libprotobuf conda-forge/linux-64::libprotobuf-5.29.3-h501fc15_1 2025-05-07T19:45:01.5502959Z libre2-11 conda-forge/linux-64::libre2-11-2024.07.02-hba17884_3 2025-05-07T19:45:01.5503481Z libsqlite conda-forge/linux-64::libsqlite-3.49.2-hee588c1_0 2025-05-07T19:45:01.5503962Z libssh2 conda-forge/linux-64::libssh2-1.11.1-hcf80075_0 2025-05-07T19:45:01.5504456Z libtiff conda-forge/linux-64::libtiff-4.7.0-hd9ff511_4 2025-05-07T19:45:01.5504907Z libuv conda-forge/linux-64::libuv-1.50.0-hb9d3cd8_0 2025-05-07T19:45:01.5505420Z libwebp-base conda-forge/linux-64::libwebp-base-1.5.0-h851e524_0 2025-05-07T19:45:01.5505948Z libxcb conda-forge/linux-64::libxcb-1.17.0-h8a09558_0 2025-05-07T19:45:01.5506407Z libzlib conda-forge/linux-64::libzlib-1.3.1-hb9d3cd8_2 2025-05-07T19:45:01.5506889Z make conda-forge/linux-64::make-4.4.1-hb9d3cd8_2 2025-05-07T19:45:01.5507376Z markupsafe conda-forge/linux-64::markupsafe-3.0.2-py313h8060acc_1 2025-05-07T19:45:01.5507997Z ninja conda-forge/linux-64::ninja-1.12.1-hff21bea_1 2025-05-07T19:45:01.5508506Z openblas conda-forge/linux-64::openblas-0.3.29-pthreads_h6ec200e_0 2025-05-07T19:45:01.5509082Z openjdk conda-forge/linux-64::openjdk-23.0.2-h53dfc1b_2 2025-05-07T19:45:01.5509580Z packaging conda-forge/noarch::packaging-25.0-pyh29332c3_1 2025-05-07T19:45:01.5510049Z patchelf conda-forge/linux-64::patchelf-0.18.0-h3f2d84a_2 2025-05-07T19:45:01.5510587Z pcre2 conda-forge/linux-64::pcre2-10.44-hc749103_2 2025-05-07T19:45:01.5511039Z pixman conda-forge/linux-64::pixman-0.46.0-h29eaf8c_0 2025-05-07T19:45:01.5511689Z pthread-stubs conda-forge/linux-64::pthread-stubs-0.4-hb9d3cd8_1002 2025-05-07T19:45:01.5512258Z pyelftools conda-forge/noarch::pyelftools-0.32-pyh707e725_1 2025-05-07T19:45:01.5512759Z pyyaml conda-forge/linux-64::pyyaml-6.0.2-py313h8060acc_2 2025-05-07T19:45:01.5513250Z re2 conda-forge/linux-64::re2-2024.07.02-h9925aae_3 2025-05-07T19:45:01.5513707Z rhash conda-forge/linux-64::rhash-1.4.5-hb9d3cd8_0 2025-05-07T19:45:01.5514371Z scikit-build conda-forge/noarch::scikit-build-0.18.1-pyhae55e72_2 2025-05-07T19:45:01.5515049Z singlejar conda-forge/linux-64::singlejar-7.5.0-h0e684df_1 2025-05-07T19:45:01.5515676Z sortedcontainers conda-forge/noarch::sortedcontainers-2.4.0-pyhd8ed1ab_1 2025-05-07T19:45:01.5516346Z tomli conda-forge/noarch::tomli-2.2.1-pyhd8ed1ab_1 2025-05-07T19:45:01.5516835Z xorg-libice conda-forge/linux-64::xorg-libice-1.1.2-hb9d3cd8_0 2025-05-07T19:45:01.5517378Z xorg-libsm conda-forge/linux-64::xorg-libsm-1.2.6-he73a12e_0 2025-05-07T19:45:01.5517921Z xorg-libx11 conda-forge/linux-64::xorg-libx11-1.8.12-h4f16b4b_0 2025-05-07T19:45:01.5518446Z xorg-libxau conda-forge/linux-64::xorg-libxau-1.0.12-hb9d3cd8_0 2025-05-07T19:45:01.5519053Z xorg-libxdmcp conda-forge/linux-64::xorg-libxdmcp-1.1.5-hb9d3cd8_0 2025-05-07T19:45:01.5519876Z xorg-libxext conda-forge/linux-64::xorg-libxext-1.3.6-hb9d3cd8_0 2025-05-07T19:45:01.5520484Z xorg-libxfixes conda-forge/linux-64::xorg-libxfixes-6.0.1-hb9d3cd8_0 2025-05-07T19:45:01.5521053Z xorg-libxi conda-forge/linux-64::xorg-libxi-1.8.2-hb9d3cd8_0 2025-05-07T19:45:01.5521592Z xorg-libxrandr conda-forge/linux-64::xorg-libxrandr-1.5.4-hb9d3cd8_0 2025-05-07T19:45:01.5522214Z xorg-libxrender conda-forge/linux-64::xorg-libxrender-0.9.12-hb9d3cd8_0 2025-05-07T19:45:01.5522770Z xorg-libxt conda-forge/linux-64::xorg-libxt-1.3.1-hb9d3cd8_0 2025-05-07T19:45:01.5523326Z xorg-libxtst conda-forge/linux-64::xorg-libxtst-1.2.5-hb9d3cd8_3 2025-05-07T19:45:01.5523892Z xz-gpl-tools conda-forge/linux-64::xz-gpl-tools-5.8.1-hbcc6ac9_1 2025-05-07T19:45:01.5524397Z xz-tools conda-forge/linux-64::xz-tools-5.8.1-hb9d3cd8_1 2025-05-07T19:45:01.5524879Z yaml conda-forge/linux-64::yaml-0.2.5-h7f98852_2 2025-05-07T19:45:01.5525302Z zstd conda-forge/linux-64::zstd-1.5.7-hb8e6e7a_2 2025-05-07T19:45:01.5525602Z 2025-05-07T19:45:01.5525730Z The following packages will be UPDATED: 2025-05-07T19:45:01.5525957Z 2025-05-07T19:45:01.5526284Z libuuid pkgs/main::libuuid-1.41.5-h5eee18b_0 --> conda-forge::libuuid-2.38.1-h0b41bf4_0 2025-05-07T19:45:01.5526973Z ncurses pkgs/main::ncurses-6.4-h6a678d5_0 --> conda-forge::ncurses-6.5-h2d0b736_3 2025-05-07T19:45:01.5527691Z python pkgs/main::python-3.13.2-hf623796_100~ --> conda-forge::python-3.13.2-hf636f53_101_cp313 2025-05-07T19:45:01.5528409Z sqlite pkgs/main::sqlite-3.45.3-h5eee18b_0 --> conda-forge::sqlite-3.49.2-h9eae976_0 2025-05-07T19:45:01.5529422Z wheel pkgs/main/linux-64::wheel-0.45.1-py31~ --> conda-forge/noarch::wheel-0.45.1-pyhd8ed1ab_1 2025-05-07T19:45:01.5530165Z xz pkgs/main::xz-5.6.4-h5eee18b_1 --> conda-forge::xz-5.8.1-hbcc6ac9_1 2025-05-07T19:45:01.5530918Z zlib pkgs/main::zlib-1.2.13-h5eee18b_1 --> conda-forge::zlib-1.3.1-hb9d3cd8_2 2025-05-07T19:45:01.5531312Z 2025-05-07T19:45:01.5531718Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:45:01.5532070Z 2025-05-07T19:45:01.5532358Z expat pkgs/main::expat-2.7.1-h6a678d5_0 --> conda-forge::expat-2.7.0-h5888daf_0 2025-05-07T19:45:01.5533082Z tk pkgs/main::tk-8.6.14-h39e8969_0 --> conda-forge::tk-8.6.13-noxft_h4845f30_101 2025-05-07T19:45:01.5533468Z 2025-05-07T19:45:01.5533496Z 2025-05-07T19:45:01.5533499Z 2025-05-07T19:45:01.5533660Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:01.5534102Z openjdk-23.0.2 | 181.4 MB | | 0% 2025-05-07T19:45:01.5534356Z 2025-05-07T19:45:01.5534802Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:01.5535084Z 2025-05-07T19:45:01.5535087Z 2025-05-07T19:45:01.5541960Z python-3.13.2 | 31.7 MB | | 0%  2025-05-07T19:45:01.5542224Z 2025-05-07T19:45:01.5542228Z 2025-05-07T19:45:01.5542231Z 2025-05-07T19:45:01.5561412Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:01.5561762Z 2025-05-07T19:45:01.5561829Z 2025-05-07T19:45:01.5561834Z 2025-05-07T19:45:01.5561848Z 2025-05-07T19:45:01.5568714Z icu-75.1 | 11.6 MB | | 0%  2025-05-07T19:45:01.5568991Z 2025-05-07T19:45:01.5568995Z 2025-05-07T19:45:01.5568998Z 2025-05-07T19:45:01.5569002Z 2025-05-07T19:45:01.5569005Z 2025-05-07T19:45:01.5572007Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:01.5572325Z 2025-05-07T19:45:01.5572334Z 2025-05-07T19:45:01.5572337Z 2025-05-07T19:45:01.5572341Z 2025-05-07T19:45:01.5572345Z 2025-05-07T19:45:01.5572348Z 2025-05-07T19:45:01.5573110Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:01.5573435Z 2025-05-07T19:45:01.5573439Z 2025-05-07T19:45:01.5573442Z 2025-05-07T19:45:01.5573445Z 2025-05-07T19:45:01.5573449Z 2025-05-07T19:45:01.5573452Z 2025-05-07T19:45:01.5573456Z 2025-05-07T19:45:01.5574243Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:01.5574583Z 2025-05-07T19:45:01.5574587Z 2025-05-07T19:45:01.5574590Z 2025-05-07T19:45:01.5574600Z 2025-05-07T19:45:01.5574603Z 2025-05-07T19:45:01.5574606Z 2025-05-07T19:45:01.5574610Z 2025-05-07T19:45:01.5574613Z 2025-05-07T19:45:01.5575339Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:01.5575658Z 2025-05-07T19:45:01.5575661Z 2025-05-07T19:45:01.5575665Z 2025-05-07T19:45:01.5575668Z 2025-05-07T19:45:01.5575672Z 2025-05-07T19:45:01.5575675Z 2025-05-07T19:45:01.5575678Z 2025-05-07T19:45:01.5575682Z 2025-05-07T19:45:01.5575685Z 2025-05-07T19:45:01.5576438Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:01.5576751Z 2025-05-07T19:45:01.5576760Z 2025-05-07T19:45:01.5576764Z 2025-05-07T19:45:01.5576767Z 2025-05-07T19:45:01.5576771Z 2025-05-07T19:45:01.5576774Z 2025-05-07T19:45:01.5576777Z 2025-05-07T19:45:01.5576787Z 2025-05-07T19:45:01.5576790Z 2025-05-07T19:45:01.5576793Z 2025-05-07T19:45:01.5578625Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:01.5578978Z 2025-05-07T19:45:01.5578989Z 2025-05-07T19:45:01.5578992Z 2025-05-07T19:45:01.5578996Z 2025-05-07T19:45:01.5578999Z 2025-05-07T19:45:01.5579002Z 2025-05-07T19:45:01.5579006Z 2025-05-07T19:45:01.5579009Z 2025-05-07T19:45:01.5579012Z 2025-05-07T19:45:01.5579016Z 2025-05-07T19:45:01.5579019Z 2025-05-07T19:45:01.5582827Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:01.5583127Z 2025-05-07T19:45:01.5583131Z 2025-05-07T19:45:01.5583134Z 2025-05-07T19:45:01.5583138Z 2025-05-07T19:45:01.5583142Z 2025-05-07T19:45:01.5583145Z 2025-05-07T19:45:01.5583148Z 2025-05-07T19:45:01.5583152Z 2025-05-07T19:45:01.5583155Z 2025-05-07T19:45:01.5583159Z 2025-05-07T19:45:01.5583162Z 2025-05-07T19:45:01.5583165Z 2025-05-07T19:45:01.5583585Z harfbuzz-11.0.0 | 1.6 MB | | 0%  2025-05-07T19:45:01.5583902Z 2025-05-07T19:45:01.5583905Z 2025-05-07T19:45:01.5583909Z 2025-05-07T19:45:01.5584004Z 2025-05-07T19:45:01.5584008Z 2025-05-07T19:45:01.5584013Z 2025-05-07T19:45:01.5584017Z 2025-05-07T19:45:01.5584022Z 2025-05-07T19:45:01.5584026Z 2025-05-07T19:45:01.5584071Z 2025-05-07T19:45:01.5584075Z 2025-05-07T19:45:01.5584080Z 2025-05-07T19:45:01.5584084Z 2025-05-07T19:45:01.5584467Z font-ttf-ubuntu-0.83 | 1.5 MB | | 0%  2025-05-07T19:45:01.5584940Z 2025-05-07T19:45:01.5584947Z 2025-05-07T19:45:01.5584992Z 2025-05-07T19:45:01.5584998Z 2025-05-07T19:45:01.5585004Z 2025-05-07T19:45:01.5585010Z 2025-05-07T19:45:01.5585016Z 2025-05-07T19:45:01.5585022Z 2025-05-07T19:45:01.5585028Z 2025-05-07T19:45:01.5585034Z 2025-05-07T19:45:01.5585040Z 2025-05-07T19:45:01.5585045Z 2025-05-07T19:45:01.5585051Z 2025-05-07T19:45:01.5585057Z 2025-05-07T19:45:01.5585391Z libgfortran5-15.1.0 | 1.5 MB | | 0%  2025-05-07T19:45:01.5585763Z 2025-05-07T19:45:01.5585771Z 2025-05-07T19:45:01.5585785Z 2025-05-07T19:45:01.5585790Z 2025-05-07T19:45:01.5585796Z 2025-05-07T19:45:01.5585802Z 2025-05-07T19:45:01.5585808Z 2025-05-07T19:45:01.5585815Z 2025-05-07T19:45:01.5585821Z 2025-05-07T19:45:01.5585827Z 2025-05-07T19:45:01.5585832Z 2025-05-07T19:45:01.5585837Z 2025-05-07T19:45:01.5585843Z 2025-05-07T19:45:01.5585848Z 2025-05-07T19:45:01.5585853Z 2025-05-07T19:45:01.5586192Z krb5-1.21.3 | 1.3 MB | | 0%  2025-05-07T19:45:01.5586505Z 2025-05-07T19:45:01.5586509Z 2025-05-07T19:45:01.5586512Z 2025-05-07T19:45:01.5586532Z 2025-05-07T19:45:01.5586535Z 2025-05-07T19:45:01.5586539Z 2025-05-07T19:45:01.5586542Z 2025-05-07T19:45:01.5586545Z 2025-05-07T19:45:01.5586549Z 2025-05-07T19:45:01.5586552Z 2025-05-07T19:45:01.5586560Z 2025-05-07T19:45:01.5586563Z 2025-05-07T19:45:01.5586593Z 2025-05-07T19:45:01.5586596Z 2025-05-07T19:45:01.5586600Z 2025-05-07T19:45:01.5586603Z 2025-05-07T19:45:01.5586936Z libabseil-20250127.1 | 1.3 MB | | 0%  2025-05-07T19:45:01.5587287Z 2025-05-07T19:45:01.5587290Z 2025-05-07T19:45:01.5587294Z 2025-05-07T19:45:01.5587297Z 2025-05-07T19:45:01.5587301Z 2025-05-07T19:45:01.5587304Z 2025-05-07T19:45:01.5587335Z 2025-05-07T19:45:01.5587339Z 2025-05-07T19:45:01.5587342Z 2025-05-07T19:45:01.5587345Z 2025-05-07T19:45:01.5587349Z 2025-05-07T19:45:01.5587352Z 2025-05-07T19:45:01.5587355Z 2025-05-07T19:45:01.5587359Z 2025-05-07T19:45:01.5587362Z 2025-05-07T19:45:01.5587366Z 2025-05-07T19:45:01.5587369Z 2025-05-07T19:45:01.5587679Z cairo-1.18.4 | 955 KB | | 0%  2025-05-07T19:45:01.5588050Z 2025-05-07T19:45:01.5588053Z 2025-05-07T19:45:01.5588056Z 2025-05-07T19:45:01.5588065Z 2025-05-07T19:45:01.5588069Z 2025-05-07T19:45:01.5588072Z 2025-05-07T19:45:01.5588076Z 2025-05-07T19:45:01.5588079Z 2025-05-07T19:45:01.5588082Z 2025-05-07T19:45:01.5588089Z 2025-05-07T19:45:01.5588092Z 2025-05-07T19:45:01.5588096Z 2025-05-07T19:45:01.5588099Z 2025-05-07T19:45:01.5588102Z 2025-05-07T19:45:01.5588106Z 2025-05-07T19:45:01.5588109Z 2025-05-07T19:45:01.5588112Z 2025-05-07T19:45:01.5588116Z 2025-05-07T19:45:01.5588446Z pcre2-10.44 | 934 KB | | 0%  2025-05-07T19:45:01.5588751Z 2025-05-07T19:45:01.5588754Z 2025-05-07T19:45:01.5588758Z 2025-05-07T19:45:01.5588761Z 2025-05-07T19:45:01.5588764Z 2025-05-07T19:45:01.5588768Z 2025-05-07T19:45:01.5588771Z 2025-05-07T19:45:01.5588774Z 2025-05-07T19:45:01.5588778Z 2025-05-07T19:45:01.5588781Z 2025-05-07T19:45:01.5588785Z 2025-05-07T19:45:01.5588788Z 2025-05-07T19:45:01.5588818Z 2025-05-07T19:45:01.5588822Z 2025-05-07T19:45:01.5588919Z 2025-05-07T19:45:01.5588923Z 2025-05-07T19:45:01.5588926Z 2025-05-07T19:45:01.5588930Z 2025-05-07T19:45:01.5588933Z 2025-05-07T19:45:01.9677507Z ... (more hidden) ... 2025-05-07T19:45:01.9678105Z 2025-05-07T19:45:01.9678111Z 2025-05-07T19:45:01.9678115Z 2025-05-07T19:45:01.9678120Z 2025-05-07T19:45:01.9678415Z icu-75.1 | 11.6 MB | | 0%  2025-05-07T19:45:01.9678696Z 2025-05-07T19:45:01.9678699Z 2025-05-07T19:45:01.9752947Z python-3.13.2 | 31.7 MB | | 0%  2025-05-07T19:45:02.0234549Z openjdk-23.0.2 | 181.4 MB | | 0% 2025-05-07T19:45:02.0234865Z 2025-05-07T19:45:02.0234871Z 2025-05-07T19:45:02.0234878Z 2025-05-07T19:45:02.0472369Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:02.0472662Z 2025-05-07T19:45:02.0680806Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:02.0681080Z 2025-05-07T19:45:02.0681128Z 2025-05-07T19:45:02.0761675Z python-3.13.2 | 31.7 MB | ##7 | 28%  2025-05-07T19:45:02.1001557Z openjdk-23.0.2 | 181.4 MB | 5 | 5% 2025-05-07T19:45:02.1002394Z 2025-05-07T19:45:02.1002407Z 2025-05-07T19:45:02.1002418Z 2025-05-07T19:45:02.1002428Z 2025-05-07T19:45:02.1003188Z icu-75.1 | 11.6 MB | ########## | 100%  2025-05-07T19:45:02.1003919Z 2025-05-07T19:45:02.1003930Z 2025-05-07T19:45:02.1003940Z 2025-05-07T19:45:02.1003951Z 2025-05-07T19:45:02.1234524Z icu-75.1 | 11.6 MB | ########## | 100%  2025-05-07T19:45:02.1234799Z 2025-05-07T19:45:02.1234980Z 2025-05-07T19:45:02.1234985Z 2025-05-07T19:45:02.1449983Z cmake-4.0.2 | 19.4 MB | ##1 | 21%  2025-05-07T19:45:02.1450379Z 2025-05-07T19:45:02.1450386Z 2025-05-07T19:45:02.1450393Z 2025-05-07T19:45:02.1450399Z 2025-05-07T19:45:02.1450405Z 2025-05-07T19:45:02.1534332Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:02.1534657Z 2025-05-07T19:45:02.1681677Z bazel-7.5.0 | 47.4 MB | 3 | 4%  2025-05-07T19:45:02.1682546Z 2025-05-07T19:45:02.1682573Z 2025-05-07T19:45:02.1763203Z python-3.13.2 | 31.7 MB | #####5 | 55%  2025-05-07T19:45:02.2237237Z openjdk-23.0.2 | 181.4 MB | # | 10% 2025-05-07T19:45:02.2238030Z 2025-05-07T19:45:02.2238044Z 2025-05-07T19:45:02.2238054Z 2025-05-07T19:45:02.2634428Z cmake-4.0.2 | 19.4 MB | ##### | 51%  2025-05-07T19:45:02.2635250Z 2025-05-07T19:45:02.2635263Z 2025-05-07T19:45:02.2635274Z 2025-05-07T19:45:02.2635284Z 2025-05-07T19:45:02.2635294Z 2025-05-07T19:45:02.2636017Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:02.2636809Z 2025-05-07T19:45:02.2636819Z 2025-05-07T19:45:02.2636829Z 2025-05-07T19:45:02.2636839Z 2025-05-07T19:45:02.2636850Z 2025-05-07T19:45:02.2694370Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:02.2695247Z 2025-05-07T19:45:02.2695545Z 2025-05-07T19:45:02.2762368Z python-3.13.2 | 31.7 MB | ########3 | 84%  2025-05-07T19:45:02.3062635Z openjdk-23.0.2 | 181.4 MB | #4 | 14% 2025-05-07T19:45:02.3063496Z 2025-05-07T19:45:02.3063512Z 2025-05-07T19:45:02.3063519Z 2025-05-07T19:45:02.3063526Z 2025-05-07T19:45:02.3063533Z 2025-05-07T19:45:02.3063644Z 2025-05-07T19:45:02.3281118Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:02.3281462Z 2025-05-07T19:45:02.3281468Z 2025-05-07T19:45:02.3281475Z 2025-05-07T19:45:02.3619491Z cmake-4.0.2 | 19.4 MB | ########7 | 87%  2025-05-07T19:45:02.3619807Z 2025-05-07T19:45:02.3858893Z bazel-7.5.0 | 47.4 MB | 5 | 6%  2025-05-07T19:45:02.4067079Z openjdk-23.0.2 | 181.4 MB | #8 | 18% 2025-05-07T19:45:02.4067882Z 2025-05-07T19:45:02.4067896Z 2025-05-07T19:45:02.4067907Z 2025-05-07T19:45:02.4068453Z 2025-05-07T19:45:02.4068471Z 2025-05-07T19:45:02.4068482Z 2025-05-07T19:45:02.4650009Z openblas-0.3.29 | 5.8 MB | ######## | 81%  2025-05-07T19:45:02.4650632Z 2025-05-07T19:45:02.4828108Z bazel-7.5.0 | 47.4 MB | # | 11%  2025-05-07T19:45:02.4828775Z 2025-05-07T19:45:02.4828809Z 2025-05-07T19:45:02.4828815Z 2025-05-07T19:45:02.4828823Z 2025-05-07T19:45:02.4828830Z 2025-05-07T19:45:02.4828837Z 2025-05-07T19:45:02.4861084Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:02.5272983Z openjdk-23.0.2 | 181.4 MB | ##3 | 23% 2025-05-07T19:45:02.5273297Z 2025-05-07T19:45:02.5273494Z 2025-05-07T19:45:02.5273504Z 2025-05-07T19:45:02.5273510Z 2025-05-07T19:45:02.5273524Z 2025-05-07T19:45:02.5273529Z 2025-05-07T19:45:02.5273535Z 2025-05-07T19:45:02.5651928Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:02.5652270Z 2025-05-07T19:45:02.6459189Z bazel-7.5.0 | 47.4 MB | ##2 | 23%  2025-05-07T19:45:02.6613362Z openjdk-23.0.2 | 181.4 MB | ##7 | 28% 2025-05-07T19:45:02.6613667Z 2025-05-07T19:45:02.6613702Z 2025-05-07T19:45:02.6613706Z 2025-05-07T19:45:02.6613709Z 2025-05-07T19:45:02.6648765Z icu-75.1 | 11.6 MB | ########## | 100%  2025-05-07T19:45:02.6649060Z 2025-05-07T19:45:02.6649064Z 2025-05-07T19:45:02.6649068Z 2025-05-07T19:45:02.6651810Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:02.6652084Z 2025-05-07T19:45:02.7012105Z bazel-7.5.0 | 47.4 MB | ###6 | 36%  2025-05-07T19:45:02.7012416Z 2025-05-07T19:45:02.7012421Z 2025-05-07T19:45:02.7012424Z 2025-05-07T19:45:02.7012428Z 2025-05-07T19:45:02.7012432Z 2025-05-07T19:45:02.7012437Z 2025-05-07T19:45:02.7012442Z 2025-05-07T19:45:02.7012717Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:02.7013046Z 2025-05-07T19:45:02.7013079Z 2025-05-07T19:45:02.7013096Z 2025-05-07T19:45:02.7013100Z 2025-05-07T19:45:02.7013103Z 2025-05-07T19:45:02.7013107Z 2025-05-07T19:45:02.7013110Z 2025-05-07T19:45:02.7272757Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:02.7273132Z 2025-05-07T19:45:02.7273137Z 2025-05-07T19:45:02.7273140Z 2025-05-07T19:45:02.7273144Z 2025-05-07T19:45:02.7273147Z 2025-05-07T19:45:02.7273151Z 2025-05-07T19:45:02.7273154Z 2025-05-07T19:45:02.7273157Z 2025-05-07T19:45:02.7373047Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:02.7373366Z 2025-05-07T19:45:02.7373383Z 2025-05-07T19:45:02.7373387Z 2025-05-07T19:45:02.7373390Z 2025-05-07T19:45:02.7373394Z 2025-05-07T19:45:02.7373397Z 2025-05-07T19:45:02.7373401Z 2025-05-07T19:45:02.7373407Z 2025-05-07T19:45:02.7373413Z 2025-05-07T19:45:02.7465661Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:02.7653214Z openjdk-23.0.2 | 181.4 MB | ###1 | 32% 2025-05-07T19:45:02.7653506Z 2025-05-07T19:45:02.7931992Z bazel-7.5.0 | 47.4 MB | ####7 | 47%  2025-05-07T19:45:02.7932278Z 2025-05-07T19:45:02.7932923Z 2025-05-07T19:45:02.8349582Z python-3.13.2 | 31.7 MB | ########## | 100%  2025-05-07T19:45:02.8350386Z 2025-05-07T19:45:02.8350399Z 2025-05-07T19:45:02.8350409Z 2025-05-07T19:45:02.8350420Z 2025-05-07T19:45:02.8350430Z 2025-05-07T19:45:02.8350440Z 2025-05-07T19:45:02.8350451Z 2025-05-07T19:45:02.8350461Z 2025-05-07T19:45:02.8350471Z 2025-05-07T19:45:02.8350481Z 2025-05-07T19:45:02.8634229Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:02.8685317Z openjdk-23.0.2 | 181.4 MB | ###5 | 36% 2025-05-07T19:45:02.8685663Z 2025-05-07T19:45:02.8860212Z bazel-7.5.0 | 47.4 MB | #####6 | 57%  2025-05-07T19:45:02.8860537Z 2025-05-07T19:45:02.8860544Z 2025-05-07T19:45:02.8860572Z 2025-05-07T19:45:02.8860875Z 2025-05-07T19:45:02.8860881Z 2025-05-07T19:45:02.8860886Z 2025-05-07T19:45:02.8860893Z 2025-05-07T19:45:02.8860899Z 2025-05-07T19:45:02.8860918Z 2025-05-07T19:45:02.8861943Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:02.8862235Z 2025-05-07T19:45:02.8862277Z 2025-05-07T19:45:02.8862281Z 2025-05-07T19:45:02.8862284Z 2025-05-07T19:45:02.8862288Z 2025-05-07T19:45:02.8862291Z 2025-05-07T19:45:02.8862294Z 2025-05-07T19:45:02.8862297Z 2025-05-07T19:45:02.8862301Z 2025-05-07T19:45:02.8969695Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:02.8970115Z 2025-05-07T19:45:02.8970139Z 2025-05-07T19:45:02.8970146Z 2025-05-07T19:45:02.8970153Z 2025-05-07T19:45:02.8970161Z 2025-05-07T19:45:02.8970167Z 2025-05-07T19:45:02.8970174Z 2025-05-07T19:45:02.8970181Z 2025-05-07T19:45:02.8970447Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:02.8970761Z 2025-05-07T19:45:02.8970766Z 2025-05-07T19:45:02.8970845Z 2025-05-07T19:45:02.8970854Z 2025-05-07T19:45:02.8970858Z 2025-05-07T19:45:02.8970862Z 2025-05-07T19:45:02.8970881Z 2025-05-07T19:45:02.8970885Z 2025-05-07T19:45:02.9246949Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:02.9247261Z 2025-05-07T19:45:02.9247557Z 2025-05-07T19:45:02.9247573Z 2025-05-07T19:45:02.9247579Z 2025-05-07T19:45:02.9247585Z 2025-05-07T19:45:02.9247589Z 2025-05-07T19:45:02.9247594Z 2025-05-07T19:45:02.9247598Z 2025-05-07T19:45:02.9247601Z 2025-05-07T19:45:02.9247605Z 2025-05-07T19:45:02.9247608Z 2025-05-07T19:45:02.9514566Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:02.9514944Z 2025-05-07T19:45:02.9514950Z 2025-05-07T19:45:02.9514955Z 2025-05-07T19:45:02.9514960Z 2025-05-07T19:45:02.9514966Z 2025-05-07T19:45:02.9514972Z 2025-05-07T19:45:02.9514977Z 2025-05-07T19:45:02.9514982Z 2025-05-07T19:45:02.9515015Z 2025-05-07T19:45:02.9515019Z 2025-05-07T19:45:02.9515023Z 2025-05-07T19:45:02.9515026Z 2025-05-07T19:45:02.9620682Z harfbuzz-11.0.0 | 1.6 MB | | 1%  2025-05-07T19:45:02.9621071Z 2025-05-07T19:45:02.9621077Z 2025-05-07T19:45:02.9621080Z 2025-05-07T19:45:02.9621084Z 2025-05-07T19:45:02.9621089Z 2025-05-07T19:45:02.9621092Z 2025-05-07T19:45:02.9621096Z 2025-05-07T19:45:02.9621100Z 2025-05-07T19:45:02.9621103Z 2025-05-07T19:45:02.9621106Z 2025-05-07T19:45:02.9621470Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:02.9621776Z 2025-05-07T19:45:02.9621781Z 2025-05-07T19:45:02.9621793Z 2025-05-07T19:45:02.9621797Z 2025-05-07T19:45:02.9621800Z 2025-05-07T19:45:02.9621804Z 2025-05-07T19:45:02.9621807Z 2025-05-07T19:45:02.9621810Z 2025-05-07T19:45:02.9621814Z 2025-05-07T19:45:02.9623393Z 2025-05-07T19:45:02.9633999Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:02.9723845Z openjdk-23.0.2 | 181.4 MB | ###9 | 39% 2025-05-07T19:45:02.9724579Z 2025-05-07T19:45:03.0077119Z bazel-7.5.0 | 47.4 MB | ######6 | 66%  2025-05-07T19:45:03.0077427Z 2025-05-07T19:45:03.0077571Z 2025-05-07T19:45:03.0077581Z 2025-05-07T19:45:03.0077609Z 2025-05-07T19:45:03.0077615Z 2025-05-07T19:45:03.0077621Z 2025-05-07T19:45:03.0077628Z 2025-05-07T19:45:03.0077634Z 2025-05-07T19:45:03.0077640Z 2025-05-07T19:45:03.0077645Z 2025-05-07T19:45:03.0077649Z 2025-05-07T19:45:03.0077767Z 2025-05-07T19:45:03.0077779Z 2025-05-07T19:45:03.0152631Z font-ttf-ubuntu-0.83 | 1.5 MB | 1 | 1%  2025-05-07T19:45:03.0153017Z 2025-05-07T19:45:03.0153025Z 2025-05-07T19:45:03.0153031Z 2025-05-07T19:45:03.0153035Z 2025-05-07T19:45:03.0153063Z 2025-05-07T19:45:03.0153070Z 2025-05-07T19:45:03.0153076Z 2025-05-07T19:45:03.0153081Z 2025-05-07T19:45:03.0153086Z 2025-05-07T19:45:03.0153329Z 2025-05-07T19:45:03.0153336Z 2025-05-07T19:45:03.0153340Z 2025-05-07T19:45:03.0471598Z harfbuzz-11.0.0 | 1.6 MB | ########## | 100%  2025-05-07T19:45:03.0472189Z 2025-05-07T19:45:03.0472196Z 2025-05-07T19:45:03.0472200Z 2025-05-07T19:45:03.0472203Z 2025-05-07T19:45:03.0472207Z 2025-05-07T19:45:03.0472210Z 2025-05-07T19:45:03.0472214Z 2025-05-07T19:45:03.0472217Z 2025-05-07T19:45:03.0472221Z 2025-05-07T19:45:03.0472224Z 2025-05-07T19:45:03.0472227Z 2025-05-07T19:45:03.0472489Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:03.0472754Z 2025-05-07T19:45:03.0472758Z 2025-05-07T19:45:03.0472761Z 2025-05-07T19:45:03.0472764Z 2025-05-07T19:45:03.0472768Z 2025-05-07T19:45:03.0472771Z 2025-05-07T19:45:03.0472775Z 2025-05-07T19:45:03.0472779Z 2025-05-07T19:45:03.0472784Z 2025-05-07T19:45:03.0472788Z 2025-05-07T19:45:03.0472792Z 2025-05-07T19:45:03.0575404Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:03.0575693Z 2025-05-07T19:45:03.0575698Z 2025-05-07T19:45:03.0575702Z 2025-05-07T19:45:03.0575713Z 2025-05-07T19:45:03.0575717Z 2025-05-07T19:45:03.0575720Z 2025-05-07T19:45:03.0575733Z 2025-05-07T19:45:03.0575736Z 2025-05-07T19:45:03.0575753Z 2025-05-07T19:45:03.0575756Z 2025-05-07T19:45:03.0575760Z 2025-05-07T19:45:03.0575763Z 2025-05-07T19:45:03.0575767Z 2025-05-07T19:45:03.0659602Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:03.0660591Z 2025-05-07T19:45:03.0660604Z 2025-05-07T19:45:03.0660615Z 2025-05-07T19:45:03.0660648Z 2025-05-07T19:45:03.0660658Z 2025-05-07T19:45:03.0660669Z 2025-05-07T19:45:03.0660679Z 2025-05-07T19:45:03.0660690Z 2025-05-07T19:45:03.0660700Z 2025-05-07T19:45:03.0660710Z 2025-05-07T19:45:03.0660720Z 2025-05-07T19:45:03.0660730Z 2025-05-07T19:45:03.0660740Z 2025-05-07T19:45:03.0660761Z 2025-05-07T19:45:03.0697815Z libgfortran5-15.1.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:03.0724238Z openjdk-23.0.2 | 181.4 MB | ####2 | 43% 2025-05-07T19:45:03.0724595Z 2025-05-07T19:45:03.0885858Z bazel-7.5.0 | 47.4 MB | #######7 | 77%  2025-05-07T19:45:03.0886682Z 2025-05-07T19:45:03.0886695Z 2025-05-07T19:45:03.0886706Z 2025-05-07T19:45:03.0886717Z 2025-05-07T19:45:03.0886727Z 2025-05-07T19:45:03.0886737Z 2025-05-07T19:45:03.0886747Z 2025-05-07T19:45:03.0886757Z 2025-05-07T19:45:03.0886768Z 2025-05-07T19:45:03.0886778Z 2025-05-07T19:45:03.0886787Z 2025-05-07T19:45:03.0886797Z 2025-05-07T19:45:03.0886807Z 2025-05-07T19:45:03.0886817Z 2025-05-07T19:45:03.0886827Z 2025-05-07T19:45:03.0890856Z krb5-1.21.3 | 1.3 MB | 1 | 1%  2025-05-07T19:45:03.0891748Z 2025-05-07T19:45:03.0891759Z 2025-05-07T19:45:03.0891769Z 2025-05-07T19:45:03.0891780Z 2025-05-07T19:45:03.0891819Z 2025-05-07T19:45:03.0891830Z 2025-05-07T19:45:03.0891840Z 2025-05-07T19:45:03.0891850Z 2025-05-07T19:45:03.0891860Z 2025-05-07T19:45:03.0891869Z 2025-05-07T19:45:03.0891892Z 2025-05-07T19:45:03.0891903Z 2025-05-07T19:45:03.0891913Z 2025-05-07T19:45:03.0891923Z 2025-05-07T19:45:03.0891933Z 2025-05-07T19:45:03.0893513Z 2025-05-07T19:45:03.0931489Z libabseil-20250127.1 | 1.3 MB | 1 | 1%  2025-05-07T19:45:03.0932537Z 2025-05-07T19:45:03.0932550Z 2025-05-07T19:45:03.0932561Z 2025-05-07T19:45:03.0932571Z 2025-05-07T19:45:03.0932581Z 2025-05-07T19:45:03.1225043Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:03.1225362Z 2025-05-07T19:45:03.1225471Z 2025-05-07T19:45:03.1225475Z 2025-05-07T19:45:03.1225564Z 2025-05-07T19:45:03.1225573Z 2025-05-07T19:45:03.1225577Z 2025-05-07T19:45:03.1225582Z 2025-05-07T19:45:03.1225587Z 2025-05-07T19:45:03.1225591Z 2025-05-07T19:45:03.1225595Z 2025-05-07T19:45:03.1225826Z 2025-05-07T19:45:03.1225833Z 2025-05-07T19:45:03.1225837Z 2025-05-07T19:45:03.1225840Z 2025-05-07T19:45:03.1356077Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:03.1356660Z 2025-05-07T19:45:03.1356665Z 2025-05-07T19:45:03.1356669Z 2025-05-07T19:45:03.1356672Z 2025-05-07T19:45:03.1356675Z 2025-05-07T19:45:03.1356679Z 2025-05-07T19:45:03.1356682Z 2025-05-07T19:45:03.1356686Z 2025-05-07T19:45:03.1356689Z 2025-05-07T19:45:03.1356693Z 2025-05-07T19:45:03.1356696Z 2025-05-07T19:45:03.1356700Z 2025-05-07T19:45:03.1356703Z 2025-05-07T19:45:03.1356706Z 2025-05-07T19:45:03.1356710Z 2025-05-07T19:45:03.1356713Z 2025-05-07T19:45:03.1371948Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:03.1372296Z 2025-05-07T19:45:03.1372300Z 2025-05-07T19:45:03.1372303Z 2025-05-07T19:45:03.1372307Z 2025-05-07T19:45:03.1372311Z 2025-05-07T19:45:03.1372323Z 2025-05-07T19:45:03.1372336Z 2025-05-07T19:45:03.1372339Z 2025-05-07T19:45:03.1372356Z 2025-05-07T19:45:03.1372360Z 2025-05-07T19:45:03.1372363Z 2025-05-07T19:45:03.1372366Z 2025-05-07T19:45:03.1372375Z 2025-05-07T19:45:03.1372379Z 2025-05-07T19:45:03.1372382Z 2025-05-07T19:45:03.1698479Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:03.1718788Z openjdk-23.0.2 | 181.4 MB | ####7 | 48% 2025-05-07T19:45:03.1719139Z 2025-05-07T19:45:03.1719447Z 2025-05-07T19:45:03.1719455Z 2025-05-07T19:45:03.1719460Z 2025-05-07T19:45:03.1719464Z 2025-05-07T19:45:03.1719469Z 2025-05-07T19:45:03.1719473Z 2025-05-07T19:45:03.1719477Z 2025-05-07T19:45:03.1719482Z 2025-05-07T19:45:03.1719486Z 2025-05-07T19:45:03.1719490Z 2025-05-07T19:45:03.1719521Z 2025-05-07T19:45:03.1719526Z 2025-05-07T19:45:03.1719530Z 2025-05-07T19:45:03.1719535Z 2025-05-07T19:45:03.1719539Z 2025-05-07T19:45:03.1719544Z 2025-05-07T19:45:03.1719548Z 2025-05-07T19:45:03.1725043Z pcre2-10.44 | 934 KB | 1 | 2%  2025-05-07T19:45:03.1725385Z 2025-05-07T19:45:03.1725396Z 2025-05-07T19:45:03.1725400Z 2025-05-07T19:45:03.1725403Z 2025-05-07T19:45:03.1725407Z 2025-05-07T19:45:03.1725410Z 2025-05-07T19:45:03.1725413Z 2025-05-07T19:45:03.1725417Z 2025-05-07T19:45:03.1725420Z 2025-05-07T19:45:03.1725423Z 2025-05-07T19:45:03.1725427Z 2025-05-07T19:45:03.1725437Z 2025-05-07T19:45:03.1725441Z 2025-05-07T19:45:03.1725445Z 2025-05-07T19:45:03.1725448Z 2025-05-07T19:45:03.1725451Z 2025-05-07T19:45:03.1725775Z 2025-05-07T19:45:03.1749843Z cairo-1.18.4 | 955 KB | 1 | 2%  2025-05-07T19:45:03.1750760Z 2025-05-07T19:45:03.1750774Z 2025-05-07T19:45:03.1750785Z 2025-05-07T19:45:03.1750795Z 2025-05-07T19:45:03.1750806Z 2025-05-07T19:45:03.1750817Z 2025-05-07T19:45:03.1750827Z 2025-05-07T19:45:03.1750862Z 2025-05-07T19:45:03.1750900Z 2025-05-07T19:45:03.1750911Z 2025-05-07T19:45:03.1750922Z 2025-05-07T19:45:03.1750950Z 2025-05-07T19:45:03.1750961Z 2025-05-07T19:45:03.1750971Z 2025-05-07T19:45:03.1750993Z 2025-05-07T19:45:03.1751004Z 2025-05-07T19:45:03.1751013Z 2025-05-07T19:45:03.1751023Z 2025-05-07T19:45:03.1751033Z 2025-05-07T19:45:03.1826593Z ... (more hidden) ... 2025-05-07T19:45:03.1826912Z 2025-05-07T19:45:03.1951823Z bazel-7.5.0 | 47.4 MB | ########7 | 88%  2025-05-07T19:45:03.1952159Z 2025-05-07T19:45:03.1952190Z 2025-05-07T19:45:03.1952195Z 2025-05-07T19:45:03.1952201Z 2025-05-07T19:45:03.1952206Z 2025-05-07T19:45:03.1952212Z 2025-05-07T19:45:03.1952218Z 2025-05-07T19:45:03.1952223Z 2025-05-07T19:45:03.1952229Z 2025-05-07T19:45:03.1952235Z 2025-05-07T19:45:03.1952239Z 2025-05-07T19:45:03.1952244Z 2025-05-07T19:45:03.1952247Z 2025-05-07T19:45:03.1952250Z 2025-05-07T19:45:03.1952255Z 2025-05-07T19:45:03.1952531Z 2025-05-07T19:45:03.1952538Z 2025-05-07T19:45:03.1952542Z 2025-05-07T19:45:03.1957531Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:03.1958030Z 2025-05-07T19:45:03.1958035Z 2025-05-07T19:45:03.1958039Z 2025-05-07T19:45:03.1958042Z 2025-05-07T19:45:03.1958045Z 2025-05-07T19:45:03.1958048Z 2025-05-07T19:45:03.1958052Z 2025-05-07T19:45:03.1958055Z 2025-05-07T19:45:03.1958058Z 2025-05-07T19:45:03.1958087Z 2025-05-07T19:45:03.1958090Z 2025-05-07T19:45:03.1958093Z 2025-05-07T19:45:03.1958097Z 2025-05-07T19:45:03.1958100Z 2025-05-07T19:45:03.1958103Z 2025-05-07T19:45:03.1958106Z 2025-05-07T19:45:03.1958110Z 2025-05-07T19:45:03.1958113Z 2025-05-07T19:45:03.1961458Z 2025-05-07T19:45:03.2153150Z ... (more hidden) ... 2025-05-07T19:45:03.2153479Z 2025-05-07T19:45:03.2153627Z 2025-05-07T19:45:03.2153638Z 2025-05-07T19:45:03.2153648Z 2025-05-07T19:45:03.2153681Z 2025-05-07T19:45:03.2153687Z 2025-05-07T19:45:03.2153693Z 2025-05-07T19:45:03.2153698Z 2025-05-07T19:45:03.2153703Z 2025-05-07T19:45:03.2153707Z 2025-05-07T19:45:03.2153729Z 2025-05-07T19:45:03.2153734Z 2025-05-07T19:45:03.2153738Z 2025-05-07T19:45:03.2153743Z 2025-05-07T19:45:03.2153797Z 2025-05-07T19:45:03.2153801Z 2025-05-07T19:45:03.2153805Z 2025-05-07T19:45:03.2699630Z cairo-1.18.4 | 955 KB | ########## | 100%  2025-05-07T19:45:03.2826594Z openjdk-23.0.2 | 181.4 MB | #####2 | 52% 2025-05-07T19:45:03.2827047Z 2025-05-07T19:45:03.3246475Z bazel-7.5.0 | 47.4 MB | #########8 | 99%  2025-05-07T19:45:03.3246873Z 2025-05-07T19:45:03.3246879Z 2025-05-07T19:45:03.3246885Z 2025-05-07T19:45:03.3246920Z 2025-05-07T19:45:03.3246927Z 2025-05-07T19:45:03.3246932Z 2025-05-07T19:45:03.3701141Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:03.4699981Z openjdk-23.0.2 | 181.4 MB | #####6 | 57% 2025-05-07T19:45:03.5259903Z openjdk-23.0.2 | 181.4 MB | ######1 | 61% 2025-05-07T19:45:03.5260231Z 2025-05-07T19:45:03.5260272Z 2025-05-07T19:45:03.5260277Z 2025-05-07T19:45:03.5260282Z 2025-05-07T19:45:03.5260287Z 2025-05-07T19:45:03.5260291Z 2025-05-07T19:45:03.5260295Z 2025-05-07T19:45:03.5701989Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:03.6702457Z openjdk-23.0.2 | 181.4 MB | ######5 | 66% 2025-05-07T19:45:03.7706198Z openjdk-23.0.2 | 181.4 MB | ####### | 70% 2025-05-07T19:45:03.8707247Z openjdk-23.0.2 | 181.4 MB | #######5 | 75% 2025-05-07T19:45:03.9627834Z openjdk-23.0.2 | 181.4 MB | #######9 | 80% 2025-05-07T19:45:03.9628166Z 2025-05-07T19:45:03.9628671Z 2025-05-07T19:45:03.9628685Z 2025-05-07T19:45:03.9628692Z 2025-05-07T19:45:03.9628699Z 2025-05-07T19:45:03.9628704Z 2025-05-07T19:45:03.9628712Z 2025-05-07T19:45:03.9628719Z 2025-05-07T19:45:03.9628763Z 2025-05-07T19:45:04.0041495Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:04.0252733Z openjdk-23.0.2 | 181.4 MB | ########4 | 84% 2025-05-07T19:45:04.0253042Z 2025-05-07T19:45:04.1043646Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:04.1527888Z openjdk-23.0.2 | 181.4 MB | ########9 | 89% 2025-05-07T19:45:04.1528320Z 2025-05-07T19:45:04.1528783Z 2025-05-07T19:45:04.1529166Z 2025-05-07T19:45:04.1529196Z 2025-05-07T19:45:04.1529212Z 2025-05-07T19:45:04.1529226Z 2025-05-07T19:45:04.1529301Z 2025-05-07T19:45:04.1529317Z 2025-05-07T19:45:04.2045531Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:04.3045261Z openjdk-23.0.2 | 181.4 MB | #########4 | 94% 2025-05-07T19:45:04.5491751Z openjdk-23.0.2 | 181.4 MB | #########9 | 100% 2025-05-07T19:45:04.5492061Z 2025-05-07T19:45:04.5492069Z 2025-05-07T19:45:04.5492075Z 2025-05-07T19:45:04.5492404Z 2025-05-07T19:45:04.5492411Z 2025-05-07T19:45:04.5492415Z 2025-05-07T19:45:04.5492420Z 2025-05-07T19:45:04.5492425Z 2025-05-07T19:45:04.5492428Z 2025-05-07T19:45:04.5492587Z 2025-05-07T19:45:04.6722088Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:04.6722477Z 2025-05-07T19:45:04.6722481Z 2025-05-07T19:45:04.6722485Z 2025-05-07T19:45:04.6722488Z 2025-05-07T19:45:04.6722491Z 2025-05-07T19:45:04.6722495Z 2025-05-07T19:45:04.6722498Z 2025-05-07T19:45:04.6722502Z 2025-05-07T19:45:04.6722506Z 2025-05-07T19:45:04.6722509Z 2025-05-07T19:45:04.6722513Z 2025-05-07T19:45:04.6722516Z 2025-05-07T19:45:04.6722822Z harfbuzz-11.0.0 | 1.6 MB | ########## | 100%  2025-05-07T19:45:04.6723350Z 2025-05-07T19:45:04.6723353Z 2025-05-07T19:45:04.6723356Z 2025-05-07T19:45:04.6723360Z 2025-05-07T19:45:04.6723363Z 2025-05-07T19:45:04.6723366Z 2025-05-07T19:45:04.6723370Z 2025-05-07T19:45:04.6723388Z 2025-05-07T19:45:04.6723391Z 2025-05-07T19:45:04.6723394Z 2025-05-07T19:45:04.6723397Z 2025-05-07T19:45:04.6723401Z 2025-05-07T19:45:05.0088940Z harfbuzz-11.0.0 | 1.6 MB | ########## | 100%  2025-05-07T19:45:05.0090106Z 2025-05-07T19:45:05.0090119Z 2025-05-07T19:45:05.0090130Z 2025-05-07T19:45:05.0090140Z 2025-05-07T19:45:05.0090150Z 2025-05-07T19:45:05.0090160Z 2025-05-07T19:45:05.0090170Z 2025-05-07T19:45:05.0090202Z 2025-05-07T19:45:05.0090213Z 2025-05-07T19:45:05.0090223Z 2025-05-07T19:45:05.0090234Z 2025-05-07T19:45:05.0523453Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:05.0524103Z 2025-05-07T19:45:05.0524118Z 2025-05-07T19:45:05.0524124Z 2025-05-07T19:45:05.0524130Z 2025-05-07T19:45:05.0524202Z 2025-05-07T19:45:05.0524209Z 2025-05-07T19:45:05.0524215Z 2025-05-07T19:45:05.0524221Z 2025-05-07T19:45:05.0524227Z 2025-05-07T19:45:05.0524234Z 2025-05-07T19:45:05.0524240Z 2025-05-07T19:45:05.0524280Z 2025-05-07T19:45:05.0524284Z 2025-05-07T19:45:05.0525197Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:05.0525605Z 2025-05-07T19:45:05.0525634Z 2025-05-07T19:45:05.0525638Z 2025-05-07T19:45:05.0525641Z 2025-05-07T19:45:05.0525644Z 2025-05-07T19:45:05.0525648Z 2025-05-07T19:45:05.0525651Z 2025-05-07T19:45:05.0525654Z 2025-05-07T19:45:05.0525675Z 2025-05-07T19:45:05.0525679Z 2025-05-07T19:45:05.0525682Z 2025-05-07T19:45:05.0525686Z 2025-05-07T19:45:05.0525689Z 2025-05-07T19:45:05.1333083Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:05.1333507Z 2025-05-07T19:45:05.1333514Z 2025-05-07T19:45:05.1333519Z 2025-05-07T19:45:05.1333523Z 2025-05-07T19:45:05.1333532Z 2025-05-07T19:45:05.1333538Z 2025-05-07T19:45:05.1333544Z 2025-05-07T19:45:05.1333550Z 2025-05-07T19:45:05.1333554Z 2025-05-07T19:45:05.1333560Z 2025-05-07T19:45:05.1333565Z 2025-05-07T19:45:05.1333607Z 2025-05-07T19:45:05.1333611Z 2025-05-07T19:45:05.1333614Z 2025-05-07T19:45:05.1334000Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:05.1334353Z 2025-05-07T19:45:05.1334356Z 2025-05-07T19:45:05.1334360Z 2025-05-07T19:45:05.1334363Z 2025-05-07T19:45:05.1334366Z 2025-05-07T19:45:05.1334370Z 2025-05-07T19:45:05.1334373Z 2025-05-07T19:45:05.1334376Z 2025-05-07T19:45:05.1334380Z 2025-05-07T19:45:05.1334383Z 2025-05-07T19:45:05.1334386Z 2025-05-07T19:45:05.1334389Z 2025-05-07T19:45:05.1334392Z 2025-05-07T19:45:05.1334438Z 2025-05-07T19:45:05.5572714Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:05.5573131Z 2025-05-07T19:45:05.5573139Z 2025-05-07T19:45:05.5573144Z 2025-05-07T19:45:05.5573147Z 2025-05-07T19:45:05.5573156Z 2025-05-07T19:45:05.5573160Z 2025-05-07T19:45:05.5573164Z 2025-05-07T19:45:05.5573170Z 2025-05-07T19:45:05.5573485Z 2025-05-07T19:45:05.5573491Z 2025-05-07T19:45:05.5573534Z 2025-05-07T19:45:05.5573539Z 2025-05-07T19:45:05.5573547Z 2025-05-07T19:45:05.5573552Z 2025-05-07T19:45:05.5573557Z 2025-05-07T19:45:05.5573746Z 2025-05-07T19:45:05.5574155Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:05.5574519Z 2025-05-07T19:45:05.5574523Z 2025-05-07T19:45:05.5574552Z 2025-05-07T19:45:05.5574555Z 2025-05-07T19:45:05.5574559Z 2025-05-07T19:45:05.5574562Z 2025-05-07T19:45:05.5574566Z 2025-05-07T19:45:05.5574569Z 2025-05-07T19:45:05.5574573Z 2025-05-07T19:45:05.5574576Z 2025-05-07T19:45:05.5574580Z 2025-05-07T19:45:05.5574583Z 2025-05-07T19:45:05.5574586Z 2025-05-07T19:45:05.5574590Z 2025-05-07T19:45:05.5574593Z 2025-05-07T19:45:05.5574596Z 2025-05-07T19:45:05.5925088Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:05.5925515Z 2025-05-07T19:45:05.5925827Z 2025-05-07T19:45:05.7062866Z python-3.13.2 | 31.7 MB | ########## | 100%  2025-05-07T19:45:05.7063715Z 2025-05-07T19:45:05.7063728Z 2025-05-07T19:45:05.7063741Z 2025-05-07T19:45:05.7063787Z 2025-05-07T19:45:05.7063797Z 2025-05-07T19:45:05.7063808Z 2025-05-07T19:45:05.7063818Z 2025-05-07T19:45:05.7063828Z 2025-05-07T19:45:05.7063838Z 2025-05-07T19:45:05.7063848Z 2025-05-07T19:45:05.7063859Z 2025-05-07T19:45:05.7063869Z 2025-05-07T19:45:05.7063879Z 2025-05-07T19:45:05.7063912Z 2025-05-07T19:45:05.7063923Z 2025-05-07T19:45:05.7064714Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:05.7065556Z 2025-05-07T19:45:05.7065567Z 2025-05-07T19:45:05.7065577Z 2025-05-07T19:45:05.7065587Z 2025-05-07T19:45:05.7065597Z 2025-05-07T19:45:05.7065608Z 2025-05-07T19:45:05.7065618Z 2025-05-07T19:45:05.7065628Z 2025-05-07T19:45:05.7065658Z 2025-05-07T19:45:05.7065668Z 2025-05-07T19:45:05.7065677Z 2025-05-07T19:45:05.7065687Z 2025-05-07T19:45:05.7065711Z 2025-05-07T19:45:05.7065722Z 2025-05-07T19:45:05.7065732Z 2025-05-07T19:45:05.7593451Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:05.7594452Z 2025-05-07T19:45:05.7594465Z 2025-05-07T19:45:05.7594476Z 2025-05-07T19:45:05.7594487Z 2025-05-07T19:45:05.7594497Z 2025-05-07T19:45:05.7594508Z 2025-05-07T19:45:05.7594518Z 2025-05-07T19:45:05.7594528Z 2025-05-07T19:45:05.7594538Z 2025-05-07T19:45:05.7594548Z 2025-05-07T19:45:05.7594558Z 2025-05-07T19:45:05.7594568Z 2025-05-07T19:45:05.7594578Z 2025-05-07T19:45:05.7594588Z 2025-05-07T19:45:05.7594598Z 2025-05-07T19:45:05.7594608Z 2025-05-07T19:45:05.7594618Z 2025-05-07T19:45:05.7594628Z 2025-05-07T19:45:05.7594638Z 2025-05-07T19:45:05.7595400Z ... (more hidden) ... 2025-05-07T19:45:05.7596244Z 2025-05-07T19:45:05.7596255Z 2025-05-07T19:45:05.7596265Z 2025-05-07T19:45:05.7596275Z 2025-05-07T19:45:05.7596303Z 2025-05-07T19:45:05.7596314Z 2025-05-07T19:45:05.7596324Z 2025-05-07T19:45:05.7596334Z 2025-05-07T19:45:05.7596344Z 2025-05-07T19:45:05.7596354Z 2025-05-07T19:45:05.7596376Z 2025-05-07T19:45:05.7596387Z 2025-05-07T19:45:05.7596397Z 2025-05-07T19:45:05.7596433Z 2025-05-07T19:45:05.7596443Z 2025-05-07T19:45:05.7596453Z 2025-05-07T19:45:05.7596463Z 2025-05-07T19:45:05.7596473Z 2025-05-07T19:45:05.7596482Z 2025-05-07T19:45:05.7613098Z ... (more hidden) ... 2025-05-07T19:45:05.7613465Z 2025-05-07T19:45:05.7613470Z 2025-05-07T19:45:05.7613474Z 2025-05-07T19:45:05.7613477Z 2025-05-07T19:45:05.7613481Z 2025-05-07T19:45:05.7613484Z 2025-05-07T19:45:05.7613488Z 2025-05-07T19:45:05.7613492Z 2025-05-07T19:45:05.7613495Z 2025-05-07T19:45:05.7613499Z 2025-05-07T19:45:05.7613502Z 2025-05-07T19:45:05.7613506Z 2025-05-07T19:45:05.7613509Z 2025-05-07T19:45:05.7613512Z 2025-05-07T19:45:05.7613516Z 2025-05-07T19:45:05.7613823Z 2025-05-07T19:45:05.7613830Z 2025-05-07T19:45:05.7613833Z 2025-05-07T19:45:05.7616289Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:05.7616900Z 2025-05-07T19:45:05.7616904Z 2025-05-07T19:45:05.7616907Z 2025-05-07T19:45:05.7616911Z 2025-05-07T19:45:05.7616914Z 2025-05-07T19:45:05.7616918Z 2025-05-07T19:45:05.7616921Z 2025-05-07T19:45:05.7616924Z 2025-05-07T19:45:05.7616927Z 2025-05-07T19:45:05.7616931Z 2025-05-07T19:45:05.7616934Z 2025-05-07T19:45:05.7616937Z 2025-05-07T19:45:05.7616960Z 2025-05-07T19:45:05.7616964Z 2025-05-07T19:45:05.7616967Z 2025-05-07T19:45:05.7616970Z 2025-05-07T19:45:05.7616974Z 2025-05-07T19:45:05.7616977Z 2025-05-07T19:45:05.8093065Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:05.8093419Z 2025-05-07T19:45:05.8093423Z 2025-05-07T19:45:05.8093448Z 2025-05-07T19:45:05.8377319Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:05.8378146Z 2025-05-07T19:45:05.8378158Z 2025-05-07T19:45:05.8378169Z 2025-05-07T19:45:05.8378179Z 2025-05-07T19:45:05.8378189Z 2025-05-07T19:45:05.8378214Z 2025-05-07T19:45:05.8378250Z 2025-05-07T19:45:05.8378260Z 2025-05-07T19:45:05.8378270Z 2025-05-07T19:45:05.8378280Z 2025-05-07T19:45:05.8378290Z 2025-05-07T19:45:05.8378300Z 2025-05-07T19:45:05.8378310Z 2025-05-07T19:45:05.8378320Z 2025-05-07T19:45:05.8378330Z 2025-05-07T19:45:05.8378340Z 2025-05-07T19:45:05.8378350Z 2025-05-07T19:45:05.8379095Z cairo-1.18.4 | 955 KB | ########## | 100%  2025-05-07T19:45:05.8379428Z 2025-05-07T19:45:05.8379431Z 2025-05-07T19:45:05.8379435Z 2025-05-07T19:45:05.8379438Z 2025-05-07T19:45:05.8379442Z 2025-05-07T19:45:05.8379445Z 2025-05-07T19:45:05.8379449Z 2025-05-07T19:45:05.8379452Z 2025-05-07T19:45:05.8379455Z 2025-05-07T19:45:05.8379459Z 2025-05-07T19:45:05.8379462Z 2025-05-07T19:45:05.8379470Z 2025-05-07T19:45:05.8379474Z 2025-05-07T19:45:05.8379478Z 2025-05-07T19:45:05.8379481Z 2025-05-07T19:45:05.8379484Z 2025-05-07T19:45:05.8379494Z 2025-05-07T19:45:05.9758415Z cairo-1.18.4 | 955 KB | ########## | 100%  2025-05-07T19:45:07.6378406Z openjdk-23.0.2 | 181.4 MB | ########## | 100% 2025-05-07T19:45:07.6379196Z 2025-05-07T19:45:08.2115468Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:08.2119153Z openjdk-23.0.2 | 181.4 MB | ########## | 100% 2025-05-07T19:45:08.2119943Z 2025-05-07T19:45:08.2119956Z 2025-05-07T19:45:08.2119966Z 2025-05-07T19:45:08.2119977Z 2025-05-07T19:45:08.2119988Z 2025-05-07T19:45:08.2119999Z 2025-05-07T19:45:08.2120009Z 2025-05-07T19:45:08.2120019Z 2025-05-07T19:45:08.2120030Z 2025-05-07T19:45:08.2120041Z 2025-05-07T19:45:08.2120051Z 2025-05-07T19:45:08.2120061Z 2025-05-07T19:45:08.2120071Z 2025-05-07T19:45:08.2120083Z 2025-05-07T19:45:08.2120125Z 2025-05-07T19:45:08.2120136Z 2025-05-07T19:45:08.2120146Z 2025-05-07T19:45:08.2120156Z 2025-05-07T19:45:08.2120166Z 2025-05-07T19:45:08.2120425Z 2025-05-07T19:45:08.2121378Z  2025-05-07T19:45:08.2122328Z 2025-05-07T19:45:08.2122915Z 2025-05-07T19:45:08.2123224Z  2025-05-07T19:45:08.2123455Z 2025-05-07T19:45:08.2123459Z 2025-05-07T19:45:08.2123659Z  2025-05-07T19:45:08.2123874Z 2025-05-07T19:45:08.2123897Z 2025-05-07T19:45:08.2123900Z 2025-05-07T19:45:08.2124079Z  2025-05-07T19:45:08.2124299Z 2025-05-07T19:45:08.2124302Z 2025-05-07T19:45:08.2124306Z 2025-05-07T19:45:08.2124309Z 2025-05-07T19:45:08.2124499Z  2025-05-07T19:45:08.2125004Z 2025-05-07T19:45:08.2125010Z 2025-05-07T19:45:08.2125013Z 2025-05-07T19:45:08.2125017Z 2025-05-07T19:45:08.2125020Z 2025-05-07T19:45:08.2125216Z  2025-05-07T19:45:08.2125565Z 2025-05-07T19:45:08.2125569Z 2025-05-07T19:45:08.2125572Z 2025-05-07T19:45:08.2125576Z 2025-05-07T19:45:08.2125579Z 2025-05-07T19:45:08.2125582Z 2025-05-07T19:45:08.2125772Z  2025-05-07T19:45:08.2126021Z 2025-05-07T19:45:08.2126024Z 2025-05-07T19:45:08.2126028Z 2025-05-07T19:45:08.2126031Z 2025-05-07T19:45:08.2126034Z 2025-05-07T19:45:08.2126038Z 2025-05-07T19:45:08.2126041Z 2025-05-07T19:45:08.2126226Z  2025-05-07T19:45:08.2126453Z 2025-05-07T19:45:08.2126458Z 2025-05-07T19:45:08.2126461Z 2025-05-07T19:45:08.2126482Z 2025-05-07T19:45:08.2126486Z 2025-05-07T19:45:08.2126489Z 2025-05-07T19:45:08.2126497Z 2025-05-07T19:45:08.2126501Z 2025-05-07T19:45:08.2126688Z  2025-05-07T19:45:08.2126931Z 2025-05-07T19:45:08.2126939Z 2025-05-07T19:45:08.2126942Z 2025-05-07T19:45:08.2126946Z 2025-05-07T19:45:08.2126950Z 2025-05-07T19:45:08.2126971Z 2025-05-07T19:45:08.2126975Z 2025-05-07T19:45:08.2126978Z 2025-05-07T19:45:08.2126981Z 2025-05-07T19:45:08.2127172Z  2025-05-07T19:45:08.2127401Z 2025-05-07T19:45:08.2127404Z 2025-05-07T19:45:08.2127408Z 2025-05-07T19:45:08.2127412Z 2025-05-07T19:45:08.2127494Z 2025-05-07T19:45:08.2127497Z 2025-05-07T19:45:08.2127501Z 2025-05-07T19:45:08.2127505Z 2025-05-07T19:45:08.2127508Z 2025-05-07T19:45:08.2127511Z 2025-05-07T19:45:08.2127712Z  2025-05-07T19:45:08.2127962Z 2025-05-07T19:45:08.2127966Z 2025-05-07T19:45:08.2127973Z 2025-05-07T19:45:08.2127977Z 2025-05-07T19:45:08.2127980Z 2025-05-07T19:45:08.2127984Z 2025-05-07T19:45:08.2127987Z 2025-05-07T19:45:08.2127990Z 2025-05-07T19:45:08.2127997Z 2025-05-07T19:45:08.2128001Z 2025-05-07T19:45:08.2128004Z 2025-05-07T19:45:08.2128219Z  2025-05-07T19:45:08.2128656Z 2025-05-07T19:45:08.2128660Z 2025-05-07T19:45:08.2128664Z 2025-05-07T19:45:08.2128667Z 2025-05-07T19:45:08.2128670Z 2025-05-07T19:45:08.2128674Z 2025-05-07T19:45:08.2128677Z 2025-05-07T19:45:08.2128680Z 2025-05-07T19:45:08.2128683Z 2025-05-07T19:45:08.2128687Z 2025-05-07T19:45:08.2128690Z 2025-05-07T19:45:08.2128694Z 2025-05-07T19:45:08.2128927Z  2025-05-07T19:45:08.2129173Z 2025-05-07T19:45:08.2129177Z 2025-05-07T19:45:08.2129232Z 2025-05-07T19:45:08.2129236Z 2025-05-07T19:45:08.2129239Z 2025-05-07T19:45:08.2129247Z 2025-05-07T19:45:08.2129251Z 2025-05-07T19:45:08.2129254Z 2025-05-07T19:45:08.2129258Z 2025-05-07T19:45:08.2129261Z 2025-05-07T19:45:08.2129265Z 2025-05-07T19:45:08.2129268Z 2025-05-07T19:45:08.2129276Z 2025-05-07T19:45:08.2129519Z  2025-05-07T19:45:08.2129768Z 2025-05-07T19:45:08.2129772Z 2025-05-07T19:45:08.2129775Z 2025-05-07T19:45:08.2129779Z 2025-05-07T19:45:08.2129782Z 2025-05-07T19:45:08.2129786Z 2025-05-07T19:45:08.2129790Z 2025-05-07T19:45:08.2129793Z 2025-05-07T19:45:08.2129797Z 2025-05-07T19:45:08.2129800Z 2025-05-07T19:45:08.2129919Z 2025-05-07T19:45:08.2129925Z 2025-05-07T19:45:08.2129930Z 2025-05-07T19:45:08.2129935Z 2025-05-07T19:45:08.2130188Z  2025-05-07T19:45:08.2130441Z 2025-05-07T19:45:08.2130445Z 2025-05-07T19:45:08.2130449Z 2025-05-07T19:45:08.2130453Z 2025-05-07T19:45:08.2130566Z 2025-05-07T19:45:08.2130571Z 2025-05-07T19:45:08.2130602Z 2025-05-07T19:45:08.2130606Z 2025-05-07T19:45:08.2130609Z 2025-05-07T19:45:08.2130613Z 2025-05-07T19:45:08.2130740Z 2025-05-07T19:45:08.2130743Z 2025-05-07T19:45:08.2130747Z 2025-05-07T19:45:08.2130750Z 2025-05-07T19:45:08.2130754Z 2025-05-07T19:45:08.2130981Z  2025-05-07T19:45:08.2131261Z 2025-05-07T19:45:08.2131265Z 2025-05-07T19:45:08.2131268Z 2025-05-07T19:45:08.2131272Z 2025-05-07T19:45:08.2131275Z 2025-05-07T19:45:08.2131279Z 2025-05-07T19:45:08.2131282Z 2025-05-07T19:45:08.2131285Z 2025-05-07T19:45:08.2131289Z 2025-05-07T19:45:08.2131292Z 2025-05-07T19:45:08.2131295Z 2025-05-07T19:45:08.2131299Z 2025-05-07T19:45:08.2131302Z 2025-05-07T19:45:08.2131306Z 2025-05-07T19:45:08.2131309Z 2025-05-07T19:45:08.2131312Z 2025-05-07T19:45:08.2131545Z  2025-05-07T19:45:08.2131821Z 2025-05-07T19:45:08.2131825Z 2025-05-07T19:45:08.2131828Z 2025-05-07T19:45:08.2131832Z 2025-05-07T19:45:08.2131835Z 2025-05-07T19:45:08.2131843Z 2025-05-07T19:45:08.2131847Z 2025-05-07T19:45:08.2131851Z 2025-05-07T19:45:08.2131854Z 2025-05-07T19:45:08.2131858Z 2025-05-07T19:45:08.2131861Z 2025-05-07T19:45:08.2131864Z 2025-05-07T19:45:08.2131868Z 2025-05-07T19:45:08.2131872Z 2025-05-07T19:45:08.2131875Z 2025-05-07T19:45:08.2131878Z 2025-05-07T19:45:08.2131881Z 2025-05-07T19:45:08.2132139Z  2025-05-07T19:45:08.2132396Z 2025-05-07T19:45:08.2132400Z 2025-05-07T19:45:08.2132403Z 2025-05-07T19:45:08.2132406Z 2025-05-07T19:45:08.2132410Z 2025-05-07T19:45:08.2132414Z 2025-05-07T19:45:08.2132417Z 2025-05-07T19:45:08.2132420Z 2025-05-07T19:45:08.2132424Z 2025-05-07T19:45:08.2132427Z 2025-05-07T19:45:08.2132455Z 2025-05-07T19:45:08.2132459Z 2025-05-07T19:45:08.2132466Z 2025-05-07T19:45:08.2132470Z 2025-05-07T19:45:08.2132474Z 2025-05-07T19:45:08.2132477Z 2025-05-07T19:45:08.2132481Z 2025-05-07T19:45:08.2132484Z 2025-05-07T19:45:08.2132725Z  2025-05-07T19:45:08.2132986Z 2025-05-07T19:45:08.2132990Z 2025-05-07T19:45:08.2133131Z  2025-05-07T19:45:08.2133249Z 2025-05-07T19:45:08.2133252Z 2025-05-07T19:45:08.2133363Z  2025-05-07T19:45:08.2133504Z 2025-05-07T19:45:08.2133508Z 2025-05-07T19:45:08.2133511Z 2025-05-07T19:45:08.2133621Z  2025-05-07T19:45:08.2133742Z 2025-05-07T19:45:08.2133746Z 2025-05-07T19:45:08.2133749Z 2025-05-07T19:45:08.2133753Z 2025-05-07T19:45:08.2133885Z  2025-05-07T19:45:08.2134006Z 2025-05-07T19:45:08.2134010Z 2025-05-07T19:45:08.2134013Z 2025-05-07T19:45:08.2134017Z 2025-05-07T19:45:08.2134020Z 2025-05-07T19:45:08.2134128Z  2025-05-07T19:45:08.2134275Z 2025-05-07T19:45:08.2134283Z 2025-05-07T19:45:08.2134286Z 2025-05-07T19:45:08.2134290Z 2025-05-07T19:45:08.2134294Z 2025-05-07T19:45:08.2134297Z 2025-05-07T19:45:08.2134409Z  2025-05-07T19:45:08.2134563Z 2025-05-07T19:45:08.2134567Z 2025-05-07T19:45:08.2134570Z 2025-05-07T19:45:08.2134574Z 2025-05-07T19:45:08.2134577Z 2025-05-07T19:45:08.2134581Z 2025-05-07T19:45:08.2134584Z 2025-05-07T19:45:08.2134796Z  2025-05-07T19:45:08.2134939Z 2025-05-07T19:45:08.2134943Z 2025-05-07T19:45:08.2134946Z 2025-05-07T19:45:08.2134949Z 2025-05-07T19:45:08.2134953Z 2025-05-07T19:45:08.2134956Z 2025-05-07T19:45:08.2134959Z 2025-05-07T19:45:08.2134963Z 2025-05-07T19:45:08.2135104Z  2025-05-07T19:45:08.2135258Z 2025-05-07T19:45:08.2135262Z 2025-05-07T19:45:08.2135265Z 2025-05-07T19:45:08.2135269Z 2025-05-07T19:45:08.2135272Z 2025-05-07T19:45:08.2135276Z 2025-05-07T19:45:08.2135279Z 2025-05-07T19:45:08.2135282Z 2025-05-07T19:45:08.2135286Z 2025-05-07T19:45:08.2135476Z  2025-05-07T19:45:08.2135658Z 2025-05-07T19:45:08.2135662Z 2025-05-07T19:45:08.2135666Z 2025-05-07T19:45:08.2135669Z 2025-05-07T19:45:08.2135735Z 2025-05-07T19:45:08.2135739Z 2025-05-07T19:45:08.2135742Z 2025-05-07T19:45:08.2135746Z 2025-05-07T19:45:08.2135749Z 2025-05-07T19:45:08.2135752Z 2025-05-07T19:45:08.2135887Z  2025-05-07T19:45:08.2136196Z 2025-05-07T19:45:08.2136199Z 2025-05-07T19:45:08.2136203Z 2025-05-07T19:45:08.2136206Z 2025-05-07T19:45:08.2136209Z 2025-05-07T19:45:08.2136213Z 2025-05-07T19:45:08.2136216Z 2025-05-07T19:45:08.2136219Z 2025-05-07T19:45:08.2136222Z 2025-05-07T19:45:08.2136226Z 2025-05-07T19:45:08.2136230Z 2025-05-07T19:45:08.2136358Z  2025-05-07T19:45:08.2136552Z 2025-05-07T19:45:08.2136557Z 2025-05-07T19:45:08.2136560Z 2025-05-07T19:45:08.2136563Z 2025-05-07T19:45:08.2136567Z 2025-05-07T19:45:08.2136570Z 2025-05-07T19:45:08.2136573Z 2025-05-07T19:45:08.2136581Z 2025-05-07T19:45:08.2136585Z 2025-05-07T19:45:08.2136588Z 2025-05-07T19:45:08.2136591Z 2025-05-07T19:45:08.2136594Z 2025-05-07T19:45:08.2136724Z  2025-05-07T19:45:08.2136937Z 2025-05-07T19:45:08.2136942Z 2025-05-07T19:45:08.2136945Z 2025-05-07T19:45:08.2136949Z 2025-05-07T19:45:08.2136952Z 2025-05-07T19:45:08.2136955Z 2025-05-07T19:45:08.2136959Z 2025-05-07T19:45:08.2136962Z 2025-05-07T19:45:08.2136965Z 2025-05-07T19:45:08.2136968Z 2025-05-07T19:45:08.2136971Z 2025-05-07T19:45:08.2136975Z 2025-05-07T19:45:08.2136978Z 2025-05-07T19:45:08.2137140Z  2025-05-07T19:45:08.2137332Z 2025-05-07T19:45:08.2137335Z 2025-05-07T19:45:08.2137339Z 2025-05-07T19:45:08.2137342Z 2025-05-07T19:45:08.2137345Z 2025-05-07T19:45:08.2137348Z 2025-05-07T19:45:08.2137352Z 2025-05-07T19:45:08.2137355Z 2025-05-07T19:45:08.2137358Z 2025-05-07T19:45:08.2137361Z 2025-05-07T19:45:08.2137365Z 2025-05-07T19:45:08.2137372Z 2025-05-07T19:45:08.2137376Z 2025-05-07T19:45:08.2137379Z 2025-05-07T19:45:08.2137536Z  2025-05-07T19:45:08.2137735Z 2025-05-07T19:45:08.2137744Z 2025-05-07T19:45:08.2137747Z 2025-05-07T19:45:08.2137751Z 2025-05-07T19:45:08.2137754Z 2025-05-07T19:45:08.2137757Z 2025-05-07T19:45:08.2137760Z 2025-05-07T19:45:08.2137764Z 2025-05-07T19:45:08.2137767Z 2025-05-07T19:45:08.2137770Z 2025-05-07T19:45:08.2137773Z 2025-05-07T19:45:08.2137776Z 2025-05-07T19:45:08.2137797Z 2025-05-07T19:45:08.2137801Z 2025-05-07T19:45:08.2137804Z 2025-05-07T19:45:08.2137950Z  2025-05-07T19:45:08.2138152Z 2025-05-07T19:45:08.2138156Z 2025-05-07T19:45:08.2138159Z 2025-05-07T19:45:08.2138162Z 2025-05-07T19:45:08.2138166Z 2025-05-07T19:45:08.2138169Z 2025-05-07T19:45:08.2138172Z 2025-05-07T19:45:08.2138175Z 2025-05-07T19:45:08.2138197Z 2025-05-07T19:45:08.2138200Z 2025-05-07T19:45:08.2138203Z 2025-05-07T19:45:08.2138206Z 2025-05-07T19:45:08.2138213Z 2025-05-07T19:45:08.2138217Z 2025-05-07T19:45:08.2138220Z 2025-05-07T19:45:08.2138224Z 2025-05-07T19:45:08.2138382Z  2025-05-07T19:45:08.2138601Z 2025-05-07T19:45:08.2138604Z 2025-05-07T19:45:08.2138607Z 2025-05-07T19:45:08.2138636Z 2025-05-07T19:45:08.2138639Z 2025-05-07T19:45:08.2138642Z 2025-05-07T19:45:08.2138645Z 2025-05-07T19:45:08.2138649Z 2025-05-07T19:45:08.2138652Z 2025-05-07T19:45:08.2138655Z 2025-05-07T19:45:08.2138659Z 2025-05-07T19:45:08.2138662Z 2025-05-07T19:45:08.2138665Z 2025-05-07T19:45:08.2138668Z 2025-05-07T19:45:08.2138671Z 2025-05-07T19:45:08.2138675Z 2025-05-07T19:45:08.2138678Z 2025-05-07T19:45:08.2138843Z  2025-05-07T19:45:08.2139106Z 2025-05-07T19:45:08.2139110Z 2025-05-07T19:45:08.2139113Z 2025-05-07T19:45:08.2139117Z 2025-05-07T19:45:08.2139120Z 2025-05-07T19:45:08.2139123Z 2025-05-07T19:45:08.2139126Z 2025-05-07T19:45:08.2139191Z 2025-05-07T19:45:08.2139195Z 2025-05-07T19:45:08.2139199Z 2025-05-07T19:45:08.2139202Z 2025-05-07T19:45:08.2139205Z 2025-05-07T19:45:08.2139209Z 2025-05-07T19:45:08.2139212Z 2025-05-07T19:45:08.2139271Z 2025-05-07T19:45:08.2139274Z 2025-05-07T19:45:08.2139277Z 2025-05-07T19:45:08.2139281Z 2025-05-07T19:45:08.2139484Z  2025-05-07T19:45:08.2139712Z 2025-05-07T19:45:08.2139716Z 2025-05-07T19:45:08.2139826Z  2025-05-07T19:45:08.2139974Z 2025-05-07T19:45:08.2139978Z 2025-05-07T19:45:08.2140082Z  2025-05-07T19:45:08.2140200Z 2025-05-07T19:45:08.2140204Z 2025-05-07T19:45:08.2140207Z 2025-05-07T19:45:08.2140344Z  2025-05-07T19:45:08.2140461Z 2025-05-07T19:45:08.2140465Z 2025-05-07T19:45:08.2140469Z 2025-05-07T19:45:08.2140472Z 2025-05-07T19:45:08.2140586Z  2025-05-07T19:45:08.2140742Z 2025-05-07T19:45:08.2140746Z 2025-05-07T19:45:08.2140749Z 2025-05-07T19:45:08.2140752Z 2025-05-07T19:45:08.2140756Z 2025-05-07T19:45:08.2140875Z  2025-05-07T19:45:08.2141010Z 2025-05-07T19:45:08.2141014Z 2025-05-07T19:45:08.2141017Z 2025-05-07T19:45:08.2141021Z 2025-05-07T19:45:08.2141028Z 2025-05-07T19:45:08.2141063Z 2025-05-07T19:45:08.2141185Z  2025-05-07T19:45:08.2141332Z 2025-05-07T19:45:08.2141335Z 2025-05-07T19:45:08.2141339Z 2025-05-07T19:45:08.2141342Z 2025-05-07T19:45:08.2141345Z 2025-05-07T19:45:08.2141349Z 2025-05-07T19:45:08.2141352Z 2025-05-07T19:45:08.2141507Z  2025-05-07T19:45:08.2141661Z 2025-05-07T19:45:08.2141666Z 2025-05-07T19:45:08.2141669Z 2025-05-07T19:45:08.2141672Z 2025-05-07T19:45:08.2141676Z 2025-05-07T19:45:08.2141679Z 2025-05-07T19:45:08.2141682Z 2025-05-07T19:45:08.2141686Z 2025-05-07T19:45:08.2141840Z  2025-05-07T19:45:08.2141997Z 2025-05-07T19:45:08.2142001Z 2025-05-07T19:45:08.2142005Z 2025-05-07T19:45:08.2142008Z 2025-05-07T19:45:08.2142012Z 2025-05-07T19:45:08.2142015Z 2025-05-07T19:45:08.2142022Z 2025-05-07T19:45:08.2142026Z 2025-05-07T19:45:08.2142029Z 2025-05-07T19:45:08.2142165Z  2025-05-07T19:45:08.2142352Z 2025-05-07T19:45:08.2142355Z 2025-05-07T19:45:08.2142363Z 2025-05-07T19:45:08.2142366Z 2025-05-07T19:45:08.2142370Z 2025-05-07T19:45:08.2142373Z 2025-05-07T19:45:08.2142377Z 2025-05-07T19:45:08.2142380Z 2025-05-07T19:45:08.2142383Z 2025-05-07T19:45:08.2142386Z 2025-05-07T19:45:08.2142527Z  2025-05-07T19:45:08.2142742Z 2025-05-07T19:45:08.2142745Z 2025-05-07T19:45:08.2142748Z 2025-05-07T19:45:08.2142752Z 2025-05-07T19:45:08.2142755Z 2025-05-07T19:45:08.2142759Z 2025-05-07T19:45:08.2142762Z 2025-05-07T19:45:08.2142766Z 2025-05-07T19:45:08.2142769Z 2025-05-07T19:45:08.2142772Z 2025-05-07T19:45:08.2142775Z 2025-05-07T19:45:08.2142904Z  2025-05-07T19:45:08.2143098Z 2025-05-07T19:45:08.2143101Z 2025-05-07T19:45:08.2143105Z 2025-05-07T19:45:08.2143108Z 2025-05-07T19:45:08.2143115Z 2025-05-07T19:45:08.2143119Z 2025-05-07T19:45:08.2143123Z 2025-05-07T19:45:08.2143126Z 2025-05-07T19:45:08.2143129Z 2025-05-07T19:45:08.2143133Z 2025-05-07T19:45:08.2143136Z 2025-05-07T19:45:08.2143143Z 2025-05-07T19:45:08.2143274Z  2025-05-07T19:45:08.2143472Z 2025-05-07T19:45:08.2143476Z 2025-05-07T19:45:08.2143479Z 2025-05-07T19:45:08.2143483Z 2025-05-07T19:45:08.2143487Z 2025-05-07T19:45:08.2143490Z 2025-05-07T19:45:08.2143493Z 2025-05-07T19:45:08.2143497Z 2025-05-07T19:45:08.2143500Z 2025-05-07T19:45:08.2143503Z 2025-05-07T19:45:08.2143632Z 2025-05-07T19:45:08.2143635Z 2025-05-07T19:45:08.2143639Z 2025-05-07T19:45:08.2143790Z  2025-05-07T19:45:08.2143978Z 2025-05-07T19:45:08.2143982Z 2025-05-07T19:45:08.2143986Z 2025-05-07T19:45:08.2143989Z 2025-05-07T19:45:08.2143992Z 2025-05-07T19:45:08.2143995Z 2025-05-07T19:45:08.2143999Z 2025-05-07T19:45:08.2144002Z 2025-05-07T19:45:08.2144005Z 2025-05-07T19:45:08.2144066Z 2025-05-07T19:45:08.2144070Z 2025-05-07T19:45:08.2144074Z 2025-05-07T19:45:08.2144077Z 2025-05-07T19:45:08.2144081Z 2025-05-07T19:45:08.2144241Z  2025-05-07T19:45:08.2144502Z 2025-05-07T19:45:08.2144506Z 2025-05-07T19:45:08.2144509Z 2025-05-07T19:45:08.2144512Z 2025-05-07T19:45:08.2144516Z 2025-05-07T19:45:08.2144519Z 2025-05-07T19:45:08.2144522Z 2025-05-07T19:45:08.2144526Z 2025-05-07T19:45:08.2144530Z 2025-05-07T19:45:08.2144533Z 2025-05-07T19:45:08.2144537Z 2025-05-07T19:45:08.2144558Z 2025-05-07T19:45:08.2144561Z 2025-05-07T19:45:08.2144564Z 2025-05-07T19:45:08.2144567Z 2025-05-07T19:45:08.2144716Z  2025-05-07T19:45:08.2144920Z 2025-05-07T19:45:08.2144923Z 2025-05-07T19:45:08.2144927Z 2025-05-07T19:45:08.2144930Z 2025-05-07T19:45:08.2144933Z 2025-05-07T19:45:08.2144937Z 2025-05-07T19:45:08.2144940Z 2025-05-07T19:45:08.2144960Z 2025-05-07T19:45:08.2144963Z 2025-05-07T19:45:08.2144971Z 2025-05-07T19:45:08.2144974Z 2025-05-07T19:45:08.2144978Z 2025-05-07T19:45:08.2144981Z 2025-05-07T19:45:08.2144984Z 2025-05-07T19:45:08.2144988Z 2025-05-07T19:45:08.2144995Z 2025-05-07T19:45:08.2145146Z  2025-05-07T19:45:08.2145354Z 2025-05-07T19:45:08.2145374Z 2025-05-07T19:45:08.2145377Z 2025-05-07T19:45:08.2145380Z 2025-05-07T19:45:08.2145384Z 2025-05-07T19:45:08.2145387Z 2025-05-07T19:45:08.2145391Z 2025-05-07T19:45:08.2145394Z 2025-05-07T19:45:08.2145397Z 2025-05-07T19:45:08.2145400Z 2025-05-07T19:45:08.2145404Z 2025-05-07T19:45:08.2145407Z 2025-05-07T19:45:08.2145411Z 2025-05-07T19:45:08.2145414Z 2025-05-07T19:45:08.2145417Z 2025-05-07T19:45:08.2145420Z 2025-05-07T19:45:08.2145424Z 2025-05-07T19:45:08.2145580Z  2025-05-07T19:45:08.2145808Z 2025-05-07T19:45:08.2145811Z 2025-05-07T19:45:08.2145815Z 2025-05-07T19:45:08.2145819Z 2025-05-07T19:45:08.2145822Z 2025-05-07T19:45:08.2145829Z 2025-05-07T19:45:08.2145833Z 2025-05-07T19:45:08.2145836Z 2025-05-07T19:45:08.2145839Z 2025-05-07T19:45:08.2145842Z 2025-05-07T19:45:08.2145845Z 2025-05-07T19:45:08.2145852Z 2025-05-07T19:45:08.2145856Z 2025-05-07T19:45:08.2145859Z 2025-05-07T19:45:08.2145862Z 2025-05-07T19:45:08.2145866Z 2025-05-07T19:45:08.2145869Z 2025-05-07T19:45:08.2145872Z 2025-05-07T19:45:08.2146051Z  2025-05-07T19:45:08.2146265Z 2025-05-07T19:45:08.2146268Z 2025-05-07T19:45:08.2146364Z  2025-05-07T19:45:08.2146484Z 2025-05-07T19:45:08.2146487Z 2025-05-07T19:45:08.2146583Z  2025-05-07T19:45:08.2146688Z 2025-05-07T19:45:08.2146692Z 2025-05-07T19:45:08.2146696Z 2025-05-07T19:45:08.2146811Z  2025-05-07T19:45:08.2146920Z 2025-05-07T19:45:08.2146923Z 2025-05-07T19:45:08.2146927Z 2025-05-07T19:45:08.2146930Z 2025-05-07T19:45:08.2147032Z  2025-05-07T19:45:08.2147164Z 2025-05-07T19:45:08.2147168Z 2025-05-07T19:45:08.2147171Z 2025-05-07T19:45:08.2147177Z 2025-05-07T19:45:08.2147181Z 2025-05-07T19:45:08.2147287Z  2025-05-07T19:45:08.2147411Z 2025-05-07T19:45:08.2147414Z 2025-05-07T19:45:08.2147421Z 2025-05-07T19:45:08.2147444Z 2025-05-07T19:45:08.2147448Z 2025-05-07T19:45:08.2147451Z 2025-05-07T19:45:08.2147571Z  2025-05-07T19:45:08.2147707Z 2025-05-07T19:45:08.2147711Z 2025-05-07T19:45:08.2147714Z 2025-05-07T19:45:08.2147718Z 2025-05-07T19:45:08.2147721Z 2025-05-07T19:45:08.2147725Z 2025-05-07T19:45:08.2147728Z 2025-05-07T19:45:08.2147872Z  2025-05-07T19:45:08.2148020Z 2025-05-07T19:45:08.2148023Z 2025-05-07T19:45:08.2148027Z 2025-05-07T19:45:08.2148030Z 2025-05-07T19:45:08.2148033Z 2025-05-07T19:45:08.2148037Z 2025-05-07T19:45:08.2148040Z 2025-05-07T19:45:08.2148043Z 2025-05-07T19:45:08.2148190Z  2025-05-07T19:45:08.2148348Z 2025-05-07T19:45:08.2148352Z 2025-05-07T19:45:08.2148356Z 2025-05-07T19:45:08.2148359Z 2025-05-07T19:45:08.2148439Z 2025-05-07T19:45:08.2148443Z 2025-05-07T19:45:08.2148446Z 2025-05-07T19:45:08.2148449Z 2025-05-07T19:45:08.2148453Z 2025-05-07T19:45:08.2148584Z  2025-05-07T19:45:08.2148834Z 2025-05-07T19:45:08.2148838Z 2025-05-07T19:45:08.2148841Z 2025-05-07T19:45:08.2148844Z 2025-05-07T19:45:08.2148847Z 2025-05-07T19:45:08.2148851Z 2025-05-07T19:45:08.2148854Z 2025-05-07T19:45:08.2148858Z 2025-05-07T19:45:08.2148861Z 2025-05-07T19:45:08.2148864Z 2025-05-07T19:45:08.2148996Z  2025-05-07T19:45:08.2149197Z 2025-05-07T19:45:08.2149201Z 2025-05-07T19:45:08.2149205Z 2025-05-07T19:45:08.2149208Z 2025-05-07T19:45:08.2149211Z 2025-05-07T19:45:08.2149215Z 2025-05-07T19:45:08.2149218Z 2025-05-07T19:45:08.2149222Z 2025-05-07T19:45:08.2149225Z 2025-05-07T19:45:08.2149228Z 2025-05-07T19:45:08.2149231Z 2025-05-07T19:45:08.2149371Z  2025-05-07T19:45:08.2149579Z 2025-05-07T19:45:08.2149583Z 2025-05-07T19:45:08.2149590Z 2025-05-07T19:45:08.2149594Z 2025-05-07T19:45:08.2149597Z 2025-05-07T19:45:08.2149600Z 2025-05-07T19:45:08.2149603Z 2025-05-07T19:45:08.2149607Z 2025-05-07T19:45:08.2149614Z 2025-05-07T19:45:08.2149617Z 2025-05-07T19:45:08.2149621Z 2025-05-07T19:45:08.2149624Z 2025-05-07T19:45:08.2149787Z  2025-05-07T19:45:08.2149980Z 2025-05-07T19:45:08.2149984Z 2025-05-07T19:45:08.2149987Z 2025-05-07T19:45:08.2149991Z 2025-05-07T19:45:08.2149994Z 2025-05-07T19:45:08.2149997Z 2025-05-07T19:45:08.2150001Z 2025-05-07T19:45:08.2150004Z 2025-05-07T19:45:08.2150007Z 2025-05-07T19:45:08.2150011Z 2025-05-07T19:45:08.2150014Z 2025-05-07T19:45:08.2150017Z 2025-05-07T19:45:08.2150020Z 2025-05-07T19:45:08.2150189Z  2025-05-07T19:45:08.2150390Z 2025-05-07T19:45:08.2150393Z 2025-05-07T19:45:08.2150397Z 2025-05-07T19:45:08.2150400Z 2025-05-07T19:45:08.2150403Z 2025-05-07T19:45:08.2150407Z 2025-05-07T19:45:08.2150410Z 2025-05-07T19:45:08.2150417Z 2025-05-07T19:45:08.2150420Z 2025-05-07T19:45:08.2150424Z 2025-05-07T19:45:08.2150427Z 2025-05-07T19:45:08.2150430Z 2025-05-07T19:45:08.2150434Z 2025-05-07T19:45:08.2150440Z 2025-05-07T19:45:08.2150619Z  2025-05-07T19:45:08.2150824Z 2025-05-07T19:45:08.2150828Z 2025-05-07T19:45:08.2150831Z 2025-05-07T19:45:08.2150835Z 2025-05-07T19:45:08.2150839Z 2025-05-07T19:45:08.2150842Z 2025-05-07T19:45:08.2150846Z 2025-05-07T19:45:08.2150850Z 2025-05-07T19:45:08.2150853Z 2025-05-07T19:45:08.2150856Z 2025-05-07T19:45:08.2150860Z 2025-05-07T19:45:08.2150892Z 2025-05-07T19:45:08.2150896Z 2025-05-07T19:45:08.2150899Z 2025-05-07T19:45:08.2150902Z 2025-05-07T19:45:08.2151057Z  2025-05-07T19:45:08.2151271Z 2025-05-07T19:45:08.2151275Z 2025-05-07T19:45:08.2151279Z 2025-05-07T19:45:08.2151282Z 2025-05-07T19:45:08.2151285Z 2025-05-07T19:45:08.2151289Z 2025-05-07T19:45:08.2151292Z 2025-05-07T19:45:08.2151327Z 2025-05-07T19:45:08.2151331Z 2025-05-07T19:45:08.2151334Z 2025-05-07T19:45:08.2151337Z 2025-05-07T19:45:08.2151341Z 2025-05-07T19:45:08.2151344Z 2025-05-07T19:45:08.2151350Z 2025-05-07T19:45:08.2151353Z 2025-05-07T19:45:08.2151357Z 2025-05-07T19:45:08.2151520Z  2025-05-07T19:45:08.2151736Z 2025-05-07T19:45:08.2151764Z 2025-05-07T19:45:08.2151768Z 2025-05-07T19:45:08.2151771Z 2025-05-07T19:45:08.2151774Z 2025-05-07T19:45:08.2151778Z 2025-05-07T19:45:08.2151781Z 2025-05-07T19:45:08.2151784Z 2025-05-07T19:45:08.2151788Z 2025-05-07T19:45:08.2151791Z 2025-05-07T19:45:08.2151794Z 2025-05-07T19:45:08.2151797Z 2025-05-07T19:45:08.2151801Z 2025-05-07T19:45:08.2151804Z 2025-05-07T19:45:08.2151807Z 2025-05-07T19:45:08.2151811Z 2025-05-07T19:45:08.2151814Z 2025-05-07T19:45:08.2151975Z  2025-05-07T19:45:08.2152226Z 2025-05-07T19:45:08.2152229Z 2025-05-07T19:45:08.2152232Z 2025-05-07T19:45:08.2152294Z 2025-05-07T19:45:08.2152298Z 2025-05-07T19:45:08.2152302Z 2025-05-07T19:45:08.2152305Z 2025-05-07T19:45:08.2152309Z 2025-05-07T19:45:08.2152312Z 2025-05-07T19:45:08.2152376Z 2025-05-07T19:45:08.2152379Z 2025-05-07T19:45:08.2152382Z 2025-05-07T19:45:08.2152386Z 2025-05-07T19:45:08.2152389Z 2025-05-07T19:45:08.2152392Z 2025-05-07T19:45:08.2152395Z 2025-05-07T19:45:08.2152399Z 2025-05-07T19:45:08.2152402Z 2025-05-07T19:45:08.2152600Z  2025-05-07T19:45:08.2152828Z 2025-05-07T19:45:08.2152831Z 2025-05-07T19:45:08.2152936Z  2025-05-07T19:45:08.2153073Z 2025-05-07T19:45:08.2153077Z 2025-05-07T19:45:08.2153180Z  2025-05-07T19:45:08.2153292Z 2025-05-07T19:45:08.2153296Z 2025-05-07T19:45:08.2153299Z 2025-05-07T19:45:08.2153430Z  2025-05-07T19:45:08.2153548Z 2025-05-07T19:45:08.2153551Z 2025-05-07T19:45:08.2153555Z 2025-05-07T19:45:08.2153558Z 2025-05-07T19:45:08.2153673Z  2025-05-07T19:45:08.2153835Z 2025-05-07T19:45:08.2153839Z 2025-05-07T19:45:08.2153842Z 2025-05-07T19:45:08.2153846Z 2025-05-07T19:45:08.2153849Z 2025-05-07T19:45:08.2153967Z  2025-05-07T19:45:08.2154106Z 2025-05-07T19:45:08.2154109Z 2025-05-07T19:45:08.2154113Z 2025-05-07T19:45:08.2154146Z 2025-05-07T19:45:08.2154150Z 2025-05-07T19:45:08.2154153Z 2025-05-07T19:45:08.2154276Z  2025-05-07T19:45:08.2154417Z 2025-05-07T19:45:08.2154420Z 2025-05-07T19:45:08.2154423Z 2025-05-07T19:45:08.2154427Z 2025-05-07T19:45:08.2154430Z 2025-05-07T19:45:08.2154433Z 2025-05-07T19:45:08.2154436Z 2025-05-07T19:45:08.2154582Z  2025-05-07T19:45:08.2154728Z 2025-05-07T19:45:08.2154732Z 2025-05-07T19:45:08.2154735Z 2025-05-07T19:45:08.2154739Z 2025-05-07T19:45:08.2154742Z 2025-05-07T19:45:08.2154745Z 2025-05-07T19:45:08.2154748Z 2025-05-07T19:45:08.2154752Z 2025-05-07T19:45:08.2154900Z  2025-05-07T19:45:08.2155059Z 2025-05-07T19:45:08.2155063Z 2025-05-07T19:45:08.2155070Z 2025-05-07T19:45:08.2155074Z 2025-05-07T19:45:08.2155077Z 2025-05-07T19:45:08.2155080Z 2025-05-07T19:45:08.2155084Z 2025-05-07T19:45:08.2155087Z 2025-05-07T19:45:08.2155093Z 2025-05-07T19:45:08.2155222Z  2025-05-07T19:45:08.2155414Z 2025-05-07T19:45:08.2155417Z 2025-05-07T19:45:08.2155420Z 2025-05-07T19:45:08.2155424Z 2025-05-07T19:45:08.2155427Z 2025-05-07T19:45:08.2155431Z 2025-05-07T19:45:08.2155434Z 2025-05-07T19:45:08.2155438Z 2025-05-07T19:45:08.2155441Z 2025-05-07T19:45:08.2155445Z 2025-05-07T19:45:08.2155577Z  2025-05-07T19:45:08.2155776Z 2025-05-07T19:45:08.2155780Z 2025-05-07T19:45:08.2155783Z 2025-05-07T19:45:08.2155786Z 2025-05-07T19:45:08.2155789Z 2025-05-07T19:45:08.2155793Z 2025-05-07T19:45:08.2155796Z 2025-05-07T19:45:08.2155799Z 2025-05-07T19:45:08.2155802Z 2025-05-07T19:45:08.2155806Z 2025-05-07T19:45:08.2155809Z 2025-05-07T19:45:08.2155941Z  2025-05-07T19:45:08.2156151Z 2025-05-07T19:45:08.2156154Z 2025-05-07T19:45:08.2156158Z 2025-05-07T19:45:08.2156161Z 2025-05-07T19:45:08.2156164Z 2025-05-07T19:45:08.2156168Z 2025-05-07T19:45:08.2156174Z 2025-05-07T19:45:08.2156177Z 2025-05-07T19:45:08.2156181Z 2025-05-07T19:45:08.2156184Z 2025-05-07T19:45:08.2156187Z 2025-05-07T19:45:08.2156191Z 2025-05-07T19:45:08.2156355Z  2025-05-07T19:45:08.2156542Z 2025-05-07T19:45:08.2156546Z 2025-05-07T19:45:08.2156549Z 2025-05-07T19:45:08.2156553Z 2025-05-07T19:45:08.2156556Z 2025-05-07T19:45:08.2156560Z 2025-05-07T19:45:08.2156563Z 2025-05-07T19:45:08.2156566Z 2025-05-07T19:45:08.2156569Z 2025-05-07T19:45:08.2156573Z 2025-05-07T19:45:08.2156576Z 2025-05-07T19:45:08.2156579Z 2025-05-07T19:45:08.2156583Z 2025-05-07T19:45:08.2156752Z  2025-05-07T19:45:08.2156946Z 2025-05-07T19:45:08.2156949Z 2025-05-07T19:45:08.2156953Z 2025-05-07T19:45:08.2156956Z 2025-05-07T19:45:08.2156960Z 2025-05-07T19:45:08.2157021Z 2025-05-07T19:45:08.2157025Z 2025-05-07T19:45:08.2157028Z 2025-05-07T19:45:08.2157031Z 2025-05-07T19:45:08.2157035Z 2025-05-07T19:45:08.2157038Z 2025-05-07T19:45:08.2157147Z 2025-05-07T19:45:08.2157150Z 2025-05-07T19:45:08.2157154Z 2025-05-07T19:45:08.2157335Z  2025-05-07T19:45:08.2157535Z 2025-05-07T19:45:08.2157539Z 2025-05-07T19:45:08.2157542Z 2025-05-07T19:45:08.2157545Z 2025-05-07T19:45:08.2157549Z 2025-05-07T19:45:08.2157552Z 2025-05-07T19:45:08.2157555Z 2025-05-07T19:45:08.2157558Z 2025-05-07T19:45:08.2157561Z 2025-05-07T19:45:08.2157565Z 2025-05-07T19:45:08.2157568Z 2025-05-07T19:45:08.2157591Z 2025-05-07T19:45:08.2157594Z 2025-05-07T19:45:08.2157597Z 2025-05-07T19:45:08.2157600Z 2025-05-07T19:45:08.2157750Z  2025-05-07T19:45:08.2157962Z 2025-05-07T19:45:08.2157965Z 2025-05-07T19:45:08.2157968Z 2025-05-07T19:45:08.2157972Z 2025-05-07T19:45:08.2157975Z 2025-05-07T19:45:08.2157983Z 2025-05-07T19:45:08.2158013Z 2025-05-07T19:45:08.2158016Z 2025-05-07T19:45:08.2158020Z 2025-05-07T19:45:08.2158023Z 2025-05-07T19:45:08.2158026Z 2025-05-07T19:45:08.2158033Z 2025-05-07T19:45:08.2158036Z 2025-05-07T19:45:08.2158040Z 2025-05-07T19:45:08.2158043Z 2025-05-07T19:45:08.2158047Z 2025-05-07T19:45:08.2158205Z  2025-05-07T19:45:08.2158419Z 2025-05-07T19:45:08.2158450Z 2025-05-07T19:45:08.2158453Z 2025-05-07T19:45:08.2158456Z 2025-05-07T19:45:08.2158459Z 2025-05-07T19:45:08.2158463Z 2025-05-07T19:45:08.2158466Z 2025-05-07T19:45:08.2158469Z 2025-05-07T19:45:08.2158473Z 2025-05-07T19:45:08.2158476Z 2025-05-07T19:45:08.2158480Z 2025-05-07T19:45:08.2158483Z 2025-05-07T19:45:08.2158486Z 2025-05-07T19:45:08.2158489Z 2025-05-07T19:45:08.2158492Z 2025-05-07T19:45:08.2158495Z 2025-05-07T19:45:08.2158498Z 2025-05-07T19:45:08.2158662Z  2025-05-07T19:45:08.2158916Z 2025-05-07T19:45:08.2158924Z 2025-05-07T19:45:08.2158927Z 2025-05-07T19:45:08.2158931Z 2025-05-07T19:45:08.2158934Z 2025-05-07T19:45:08.2158938Z 2025-05-07T19:45:08.2158941Z 2025-05-07T19:45:08.2158948Z 2025-05-07T19:45:08.2158951Z 2025-05-07T19:45:08.2158954Z 2025-05-07T19:45:08.2158958Z 2025-05-07T19:45:08.2158961Z 2025-05-07T19:45:08.2158964Z 2025-05-07T19:45:08.2158967Z 2025-05-07T19:45:08.2158971Z 2025-05-07T19:45:08.2158974Z 2025-05-07T19:45:08.2158978Z 2025-05-07T19:45:08.2158982Z 2025-05-07T19:45:08.2159183Z  2025-05-07T19:45:08.2159412Z 2025-05-07T19:45:08.2159415Z 2025-05-07T19:45:08.2159517Z  2025-05-07T19:45:08.2159651Z 2025-05-07T19:45:08.2159655Z 2025-05-07T19:45:08.2159759Z  2025-05-07T19:45:08.2159876Z 2025-05-07T19:45:08.2159880Z 2025-05-07T19:45:08.2159883Z 2025-05-07T19:45:08.2160015Z  2025-05-07T19:45:08.2160131Z 2025-05-07T19:45:08.2160135Z 2025-05-07T19:45:08.2160139Z 2025-05-07T19:45:08.2160146Z 2025-05-07T19:45:08.2160277Z  2025-05-07T19:45:08.2160429Z 2025-05-07T19:45:08.2160432Z 2025-05-07T19:45:08.2160435Z 2025-05-07T19:45:08.2160440Z 2025-05-07T19:45:08.2160446Z 2025-05-07T19:45:08.2160572Z  done 2025-05-07T19:45:08.5258961Z Preparing transaction: | / - done 2025-05-07T19:45:12.3926034Z Verifying transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:45:15.2166191Z Executing transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ done 2025-05-07T19:45:15.6330886Z [INSTALL] Adding symlink librhash.so.0, which is needed by CMake ... 2025-05-07T19:45:17.4924059Z + ln -s /github/home/miniconda/envs/build_binary/lib/librhash.so /github/home/miniconda/envs/build_binary/lib/librhash.so.0 2025-05-07T19:45:17.4924672Z 2025-05-07T19:45:17.4935994Z 2025-05-07T19:45:17.4964924Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install build 2025-05-07T19:45:19.8273822Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:45:19.8275910Z 2025-05-07T19:45:19.8276022Z Collecting build 2025-05-07T19:45:19.8276439Z Downloading build-1.2.2.post1-py3-none-any.whl.metadata (6.5 kB) 2025-05-07T19:45:19.8277296Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from build) (25.0) 2025-05-07T19:45:19.8278069Z Collecting pyproject_hooks (from build) 2025-05-07T19:45:19.8278555Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl.metadata (1.3 kB) 2025-05-07T19:45:19.8279105Z Downloading build-1.2.2.post1-py3-none-any.whl (22 kB) 2025-05-07T19:45:19.8279606Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl (10 kB) 2025-05-07T19:45:19.8280071Z Installing collected packages: pyproject_hooks, build 2025-05-07T19:45:19.8280383Z 2025-05-07T19:45:19.8280591Z Successfully installed build-1.2.2.post1 pyproject_hooks-1.2.0 2025-05-07T19:45:19.8280908Z 2025-05-07T19:45:21.6909694Z /github/home/miniconda/envs/build_binary/bin/make 2025-05-07T19:45:21.6910042Z 2025-05-07T19:45:21.7486375Z [CHECK] Binary make found in PATH 2025-05-07T19:45:23.5244910Z /github/home/miniconda/envs/build_binary/bin/cmake 2025-05-07T19:45:23.5245760Z 2025-05-07T19:45:23.6006238Z [CHECK] Binary cmake found in PATH 2025-05-07T19:45:25.4095853Z /github/home/miniconda/envs/build_binary/bin/ninja 2025-05-07T19:45:25.4096704Z 2025-05-07T19:45:25.4772166Z [CHECK] Binary ninja found in PATH 2025-05-07T19:45:27.3683978Z [CHECK] Python (sub-)package 'click' found ... 2025-05-07T19:45:29.3619007Z [CHECK] Python (sub-)package 'hypothesis' found ... 2025-05-07T19:45:31.2725231Z [CHECK] Python (sub-)package 'jinja2' found ... 2025-05-07T19:45:33.2857826Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:45:35.1843821Z [CHECK] Python (sub-)package 'wheel' found ... 2025-05-07T19:45:35.1844439Z [INSTALL] Successfully installed all the build tools 2025-05-07T19:45:35.1922205Z ##[group]Run . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:35.1922686Z . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:35.1923294Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:35.1923653Z env: 2025-05-07T19:45:35.1923896Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:35.1924241Z BUILD_ENV: build_binary 2025-05-07T19:45:35.1924495Z BUILD_TARGET: default 2025-05-07T19:45:35.1924765Z BUILD_VARIANT: cuda 2025-05-07T19:45:35.1925038Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:45:35.1925296Z ##[endgroup] 2025-05-07T19:45:35.6012939Z ################################################################################ 2025-05-07T19:45:35.6013363Z # Install CUDA 2025-05-07T19:45:35.6013631Z # 2025-05-07T19:45:35.6029566Z # [2025-05-07T19:45:35.602Z] + install_cuda build_binary 12.6.3 2025-05-07T19:45:35.6030096Z ################################################################################ 2025-05-07T19:45:35.6030500Z 2025-05-07T19:45:35.6053823Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:35.6906277Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:35.6907354Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:45:35.6916714Z + conda clean --packages --tarball -y 2025-05-07T19:45:35.6917058Z 2025-05-07T19:45:36.1518904Z Will remove 130 (465.2 MB) tarball(s). 2025-05-07T19:45:36.1519468Z Will remove 14 (1.7 MB) package(s). 2025-05-07T19:45:36.2074520Z 2025-05-07T19:45:36.2091560Z + conda clean --all -y 2025-05-07T19:45:36.2092085Z 2025-05-07T19:45:36.8180968Z There are no unused tarball(s) to remove. 2025-05-07T19:45:36.8181978Z Will remove 1 index cache(s). 2025-05-07T19:45:36.8182806Z There are no unused package(s) to remove. 2025-05-07T19:45:36.8183721Z There are no tempfile(s) to remove. 2025-05-07T19:45:36.8184288Z There are no logfile(s) to remove. 2025-05-07T19:45:36.8753906Z 2025-05-07T19:45:36.8762129Z [INSTALL] Installing CUDA 12.6.3 ... 2025-05-07T19:45:36.8790908Z [EXEC] [ATTEMPT 0/3] + conda install --force-reinstall -n build_binary -c conda-forge --override-channels -y cuda=12.6.3 2025-05-07T19:45:37.7042951Z Channels: 2025-05-07T19:45:37.7043607Z - conda-forge 2025-05-07T19:45:37.7044356Z Platform: linux-64 2025-05-07T19:45:47.5677978Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:45:49.0113335Z Solving environment: | / - \ done 2025-05-07T19:45:49.1416049Z 2025-05-07T19:45:49.1416460Z ## Package Plan ## 2025-05-07T19:45:49.1416652Z 2025-05-07T19:45:49.1417002Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:49.1417391Z 2025-05-07T19:45:49.1417527Z added / updated specs: 2025-05-07T19:45:49.1417791Z - cuda=12.6.3 2025-05-07T19:45:49.1417952Z 2025-05-07T19:45:49.1417957Z 2025-05-07T19:45:49.1418113Z The following packages will be downloaded: 2025-05-07T19:45:49.1418341Z 2025-05-07T19:45:49.1418476Z package | build 2025-05-07T19:45:49.1418829Z ---------------------------|----------------- 2025-05-07T19:45:49.1419210Z attr-2.5.1 | h166bdaf_1 69 KB conda-forge 2025-05-07T19:45:49.1419638Z binutils-2.40 | h4852527_7 31 KB conda-forge 2025-05-07T19:45:49.1420138Z c-compiler-1.5.2 | h0b41bf4_0 6 KB conda-forge 2025-05-07T19:45:49.1420685Z cuda-12.6.3 | ha804496_0 26 KB conda-forge 2025-05-07T19:45:49.1421320Z cuda-cccl_linux-64-12.6.77 | ha770c72_0 1.0 MB conda-forge 2025-05-07T19:45:49.1421844Z cuda-command-line-tools-12.6.3| ha770c72_0 20 KB conda-forge 2025-05-07T19:45:49.1422383Z cuda-compiler-12.6.3 | hbad6d8a_0 20 KB conda-forge 2025-05-07T19:45:49.1422894Z cuda-crt-dev_linux-64-12.6.85| ha770c72_0 87 KB conda-forge 2025-05-07T19:45:49.1423781Z cuda-crt-tools-12.6.85 | ha770c72_0 26 KB conda-forge 2025-05-07T19:45:49.1424399Z cuda-cudart-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:45:49.1424986Z cuda-cudart-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:45:49.1425497Z cuda-cudart-dev_linux-64-12.6.77| h3f2d84a_0 357 KB conda-forge 2025-05-07T19:45:49.1425997Z cuda-cudart-static-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:45:49.1426529Z cuda-cudart-static_linux-64-12.6.77| h3f2d84a_0 744 KB conda-forge 2025-05-07T19:45:49.1427034Z cuda-cudart_linux-64-12.6.77| h3f2d84a_0 184 KB conda-forge 2025-05-07T19:45:49.1427533Z cuda-cuobjdump-12.6.77 | hbd13f7d_1 241 KB conda-forge 2025-05-07T19:45:49.1428001Z cuda-cupti-12.6.80 | hbd13f7d_0 1.9 MB conda-forge 2025-05-07T19:45:49.1429004Z cuda-cupti-dev-12.6.80 | h5888daf_0 3.4 MB conda-forge 2025-05-07T19:45:49.1429523Z cuda-cuxxfilt-12.6.77 | hbd13f7d_1 211 KB conda-forge 2025-05-07T19:45:49.1430017Z cuda-driver-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:45:49.1430560Z cuda-driver-dev_linux-64-12.6.77| h3f2d84a_0 35 KB conda-forge 2025-05-07T19:45:49.1431051Z cuda-gdb-12.6.77 | h50b4baa_1 370 KB conda-forge 2025-05-07T19:45:49.1431538Z cuda-libraries-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:45:49.1432063Z cuda-libraries-dev-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:45:49.1432553Z cuda-nsight-12.6.77 | h7938cbb_0 113.2 MB conda-forge 2025-05-07T19:45:49.1433027Z cuda-nvcc-12.6.85 | hcdd1206_0 23 KB conda-forge 2025-05-07T19:45:49.1433674Z cuda-nvcc-dev_linux-64-12.6.85| he91c749_0 10.8 MB conda-forge 2025-05-07T19:45:49.1434203Z cuda-nvcc-impl-12.6.85 | h85509e4_0 25 KB conda-forge 2025-05-07T19:45:49.1434686Z cuda-nvcc-tools-12.6.85 | he02047a_0 23.0 MB conda-forge 2025-05-07T19:45:49.1435192Z cuda-nvcc_linux-64-12.6.85 | h04802cd_0 25 KB conda-forge 2025-05-07T19:45:49.1435692Z cuda-nvdisasm-12.6.77 | hbd13f7d_1 47.6 MB conda-forge 2025-05-07T19:45:49.1436166Z cuda-nvml-dev-12.6.77 | hbd13f7d_1 159 KB conda-forge 2025-05-07T19:45:49.1436654Z cuda-nvprof-12.6.80 | hbd13f7d_0 2.6 MB conda-forge 2025-05-07T19:45:49.1437117Z cuda-nvprune-12.6.77 | hbd13f7d_1 66 KB conda-forge 2025-05-07T19:45:49.1437591Z cuda-nvrtc-12.6.85 | hbd13f7d_0 17.3 MB conda-forge 2025-05-07T19:45:49.1438135Z cuda-nvrtc-dev-12.6.85 | h5888daf_0 31 KB conda-forge 2025-05-07T19:45:49.1438617Z cuda-nvtx-12.6.77 | hbd13f7d_0 31 KB conda-forge 2025-05-07T19:45:49.1439097Z cuda-nvvm-dev_linux-64-12.6.85| ha770c72_0 25 KB conda-forge 2025-05-07T19:45:49.1439614Z cuda-nvvm-impl-12.6.85 | he02047a_0 7.7 MB conda-forge 2025-05-07T19:45:49.1440092Z cuda-nvvm-tools-12.6.85 | he02047a_0 10.4 MB conda-forge 2025-05-07T19:45:49.1440570Z cuda-nvvp-12.6.80 | hbd13f7d_1 109.3 MB conda-forge 2025-05-07T19:45:49.1441035Z cuda-opencl-12.6.77 | hbd13f7d_0 29 KB conda-forge 2025-05-07T19:45:49.1441608Z cuda-opencl-dev-12.6.77 | h5888daf_0 93 KB conda-forge 2025-05-07T19:45:49.1442096Z cuda-profiler-api-12.6.77 | h7938cbb_0 22 KB conda-forge 2025-05-07T19:45:49.1442549Z cuda-runtime-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:45:49.1443024Z cuda-sanitizer-api-12.6.77 | hbd13f7d_1 8.9 MB conda-forge 2025-05-07T19:45:49.1443628Z cuda-toolkit-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:45:49.1444076Z cuda-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:45:49.1444516Z cuda-version-12.6 | h7480c83_3 20 KB conda-forge 2025-05-07T19:45:49.1444962Z cuda-visual-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:45:49.1445427Z cxx-compiler-1.5.2 | hf52228f_0 6 KB conda-forge 2025-05-07T19:45:49.1445826Z dbus-1.13.6 | h5008d03_3 604 KB conda-forge 2025-05-07T19:45:49.1446211Z gcc-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:45:49.1446614Z gds-tools-1.11.1.6 | h5888daf_4 37.8 MB conda-forge 2025-05-07T19:45:49.1447001Z gmp-6.3.0 | hac33072_2 449 KB conda-forge 2025-05-07T19:45:49.1447382Z gxx-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:45:49.1447757Z libcap-2.75 | h39aace5_0 118 KB conda-forge 2025-05-07T19:45:49.1448183Z libcublas-12.6.4.1 | h5888daf_1 256.2 MB conda-forge 2025-05-07T19:45:49.1448618Z libcublas-dev-12.6.4.1 | h5888daf_1 88 KB conda-forge 2025-05-07T19:45:49.1449065Z libcufft-11.3.0.4 | hbd13f7d_0 156.2 MB conda-forge 2025-05-07T19:45:49.1449506Z libcufft-dev-11.3.0.4 | h5888daf_0 33 KB conda-forge 2025-05-07T19:45:49.1450234Z libcufile-1.11.1.6 | h12f29b5_4 900 KB conda-forge 2025-05-07T19:45:49.1450815Z libcufile-dev-1.11.1.6 | h5888daf_4 35 KB conda-forge 2025-05-07T19:45:49.1451289Z libcurand-10.3.7.77 | hbd13f7d_0 39.9 MB conda-forge 2025-05-07T19:45:49.1451781Z libcurand-dev-10.3.7.77 | h5888daf_0 262 KB conda-forge 2025-05-07T19:45:49.1452344Z libcusolver-11.7.1.2 | h5888daf_1 95.8 MB conda-forge 2025-05-07T19:45:49.1452848Z libcusolver-dev-11.7.1.2 | h5888daf_1 59 KB conda-forge 2025-05-07T19:45:49.1453352Z libcusparse-12.5.4.2 | hbd13f7d_0 118.6 MB conda-forge 2025-05-07T19:45:49.1453843Z libcusparse-dev-12.5.4.2 | h5888daf_0 51 KB conda-forge 2025-05-07T19:45:49.1454354Z libgcrypt-lib-1.11.0 | hb9d3cd8_2 572 KB conda-forge 2025-05-07T19:45:49.1454830Z libgpg-error-1.55 | h3f2d84a_0 305 KB conda-forge 2025-05-07T19:45:49.1455290Z libnl-3.11.0 | hb9d3cd8_0 724 KB conda-forge 2025-05-07T19:45:49.1455733Z libnpp-12.3.1.54 | h5888daf_0 93.4 MB conda-forge 2025-05-07T19:45:49.1456176Z libnpp-dev-12.3.1.54 | h5888daf_0 441 KB conda-forge 2025-05-07T19:45:49.1456642Z libnuma-2.0.18 | h4ab18f5_2 42 KB conda-forge 2025-05-07T19:45:49.1457098Z libnvfatbin-12.6.77 | hbd13f7d_0 783 KB conda-forge 2025-05-07T19:45:49.1457599Z libnvfatbin-dev-12.6.77 | h5888daf_0 26 KB conda-forge 2025-05-07T19:45:49.1458085Z libnvjitlink-12.6.85 | hbd13f7d_0 14.9 MB conda-forge 2025-05-07T19:45:49.1458593Z libnvjitlink-dev-12.6.85 | h5888daf_0 25 KB conda-forge 2025-05-07T19:45:49.1459089Z libnvjpeg-12.3.3.54 | h5888daf_0 2.4 MB conda-forge 2025-05-07T19:45:49.1459559Z libnvjpeg-dev-12.3.3.54 | ha770c72_0 31 KB conda-forge 2025-05-07T19:45:49.1460049Z libsystemd0-257.4 | h4e0b6ca_1 477 KB conda-forge 2025-05-07T19:45:49.1460503Z libudev1-257.4 | hbe16f8c_1 141 KB conda-forge 2025-05-07T19:45:49.1460970Z libxkbcommon-1.9.2 | h65c71a3_0 660 KB conda-forge 2025-05-07T19:45:49.1461516Z libxkbfile-1.1.0 | h166bdaf_1 111 KB conda-forge 2025-05-07T19:45:49.1461976Z libxml2-2.13.8 | h4bc477f_0 675 KB conda-forge 2025-05-07T19:45:49.1462404Z lz4-c-1.10.0 | h5888daf_1 163 KB conda-forge 2025-05-07T19:45:49.1462946Z nsight-compute-2024.3.2.3 | hb5ebaad_0 443.1 MB conda-forge 2025-05-07T19:45:49.1463387Z nspr-4.36 | h5888daf_0 225 KB conda-forge 2025-05-07T19:45:49.1463751Z nss-3.111 | h159eef7_0 1.9 MB conda-forge 2025-05-07T19:45:49.1464142Z ocl-icd-2.3.3 | hb9d3cd8_0 104 KB conda-forge 2025-05-07T19:45:49.1464593Z opencl-headers-2024.10.24 | h5888daf_0 53 KB conda-forge 2025-05-07T19:45:49.1465028Z rdma-core-57.0 | h5888daf_0 1.2 MB conda-forge 2025-05-07T19:45:49.1465446Z wayland-1.23.1 | h3e06ad9_0 314 KB conda-forge 2025-05-07T19:45:49.1465843Z xcb-util-0.4.1 | hb711507_2 19 KB conda-forge 2025-05-07T19:45:49.1466290Z xcb-util-cursor-0.1.5 | hb9d3cd8_0 20 KB conda-forge 2025-05-07T19:45:49.1466738Z xcb-util-image-0.4.0 | hb711507_2 24 KB conda-forge 2025-05-07T19:45:49.1467206Z xcb-util-keysyms-0.4.1 | hb711507_0 14 KB conda-forge 2025-05-07T19:45:49.1467700Z xcb-util-renderutil-0.3.10 | hb711507_0 17 KB conda-forge 2025-05-07T19:45:49.1468149Z xcb-util-wm-0.4.2 | hb711507_0 50 KB conda-forge 2025-05-07T19:45:49.1468604Z xkeyboard-config-2.44 | hb9d3cd8_0 384 KB conda-forge 2025-05-07T19:45:49.1469079Z xorg-libxcomposite-0.4.6 | hb9d3cd8_2 13 KB conda-forge 2025-05-07T19:45:49.1469564Z xorg-libxdamage-1.1.6 | hb9d3cd8_0 13 KB conda-forge 2025-05-07T19:45:49.1470047Z ------------------------------------------------------------ 2025-05-07T19:45:49.1470394Z Total: 1.59 GB 2025-05-07T19:45:49.1470599Z 2025-05-07T19:45:49.1470745Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:49.1470965Z 2025-05-07T19:45:49.1471145Z attr conda-forge/linux-64::attr-2.5.1-h166bdaf_1 2025-05-07T19:45:49.1471570Z binutils conda-forge/linux-64::binutils-2.40-h4852527_7 2025-05-07T19:45:49.1472020Z c-compiler conda-forge/linux-64::c-compiler-1.5.2-h0b41bf4_0 2025-05-07T19:45:49.1472466Z cuda conda-forge/noarch::cuda-12.6.3-ha804496_0 2025-05-07T19:45:49.1472953Z cuda-cccl_linux-64 conda-forge/noarch::cuda-cccl_linux-64-12.6.77-ha770c72_0 2025-05-07T19:45:49.1473550Z cuda-command-line~ conda-forge/linux-64::cuda-command-line-tools-12.6.3-ha770c72_0 2025-05-07T19:45:49.1474146Z cuda-compiler conda-forge/noarch::cuda-compiler-12.6.3-hbad6d8a_0 2025-05-07T19:45:49.1474697Z cuda-crt-dev_linu~ conda-forge/noarch::cuda-crt-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:45:49.1475272Z cuda-crt-tools conda-forge/linux-64::cuda-crt-tools-12.6.85-ha770c72_0 2025-05-07T19:45:49.1475803Z cuda-cudart conda-forge/linux-64::cuda-cudart-12.6.77-h5888daf_0 2025-05-07T19:45:49.1476319Z cuda-cudart-dev conda-forge/linux-64::cuda-cudart-dev-12.6.77-h5888daf_0 2025-05-07T19:45:49.1476916Z cuda-cudart-dev_l~ conda-forge/noarch::cuda-cudart-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:45:49.1477520Z cuda-cudart-static conda-forge/linux-64::cuda-cudart-static-12.6.77-h5888daf_0 2025-05-07T19:45:49.1478162Z cuda-cudart-stati~ conda-forge/noarch::cuda-cudart-static_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:45:49.1478789Z cuda-cudart_linux~ conda-forge/noarch::cuda-cudart_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:45:49.1479348Z cuda-cuobjdump conda-forge/linux-64::cuda-cuobjdump-12.6.77-hbd13f7d_1 2025-05-07T19:45:49.1479886Z cuda-cupti conda-forge/linux-64::cuda-cupti-12.6.80-hbd13f7d_0 2025-05-07T19:45:49.1480482Z cuda-cupti-dev conda-forge/linux-64::cuda-cupti-dev-12.6.80-h5888daf_0 2025-05-07T19:45:49.1481024Z cuda-cuxxfilt conda-forge/linux-64::cuda-cuxxfilt-12.6.77-hbd13f7d_1 2025-05-07T19:45:49.1481575Z cuda-driver-dev conda-forge/linux-64::cuda-driver-dev-12.6.77-h5888daf_0 2025-05-07T19:45:49.1482151Z cuda-driver-dev_l~ conda-forge/noarch::cuda-driver-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:45:49.1482690Z cuda-gdb conda-forge/linux-64::cuda-gdb-12.6.77-h50b4baa_1 2025-05-07T19:45:49.1483168Z cuda-libraries conda-forge/linux-64::cuda-libraries-12.6.3-ha770c72_0 2025-05-07T19:45:49.1483744Z cuda-libraries-dev conda-forge/linux-64::cuda-libraries-dev-12.6.3-ha770c72_0 2025-05-07T19:45:49.1484301Z cuda-nsight conda-forge/linux-64::cuda-nsight-12.6.77-h7938cbb_0 2025-05-07T19:45:49.1484768Z cuda-nvcc conda-forge/linux-64::cuda-nvcc-12.6.85-hcdd1206_0 2025-05-07T19:45:49.1485304Z cuda-nvcc-dev_lin~ conda-forge/noarch::cuda-nvcc-dev_linux-64-12.6.85-he91c749_0 2025-05-07T19:45:49.1485874Z cuda-nvcc-impl conda-forge/linux-64::cuda-nvcc-impl-12.6.85-h85509e4_0 2025-05-07T19:45:49.1486404Z cuda-nvcc-tools conda-forge/linux-64::cuda-nvcc-tools-12.6.85-he02047a_0 2025-05-07T19:45:49.1486966Z cuda-nvcc_linux-64 conda-forge/linux-64::cuda-nvcc_linux-64-12.6.85-h04802cd_0 2025-05-07T19:45:49.1487498Z cuda-nvdisasm conda-forge/linux-64::cuda-nvdisasm-12.6.77-hbd13f7d_1 2025-05-07T19:45:49.1488022Z cuda-nvml-dev conda-forge/linux-64::cuda-nvml-dev-12.6.77-hbd13f7d_1 2025-05-07T19:45:49.1488534Z cuda-nvprof conda-forge/linux-64::cuda-nvprof-12.6.80-hbd13f7d_0 2025-05-07T19:45:49.1489028Z cuda-nvprune conda-forge/linux-64::cuda-nvprune-12.6.77-hbd13f7d_1 2025-05-07T19:45:49.1489528Z cuda-nvrtc conda-forge/linux-64::cuda-nvrtc-12.6.85-hbd13f7d_0 2025-05-07T19:45:49.1490403Z cuda-nvrtc-dev conda-forge/linux-64::cuda-nvrtc-dev-12.6.85-h5888daf_0 2025-05-07T19:45:49.1490992Z cuda-nvtx conda-forge/linux-64::cuda-nvtx-12.6.77-hbd13f7d_0 2025-05-07T19:45:49.1491547Z cuda-nvvm-dev_lin~ conda-forge/noarch::cuda-nvvm-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:45:49.1492161Z cuda-nvvm-impl conda-forge/linux-64::cuda-nvvm-impl-12.6.85-he02047a_0 2025-05-07T19:45:49.1492756Z cuda-nvvm-tools conda-forge/linux-64::cuda-nvvm-tools-12.6.85-he02047a_0 2025-05-07T19:45:49.1493291Z cuda-nvvp conda-forge/linux-64::cuda-nvvp-12.6.80-hbd13f7d_1 2025-05-07T19:45:49.1493820Z cuda-opencl conda-forge/linux-64::cuda-opencl-12.6.77-hbd13f7d_0 2025-05-07T19:45:49.1514052Z cuda-opencl-dev conda-forge/linux-64::cuda-opencl-dev-12.6.77-h5888daf_0 2025-05-07T19:45:49.1514800Z cuda-profiler-api conda-forge/linux-64::cuda-profiler-api-12.6.77-h7938cbb_0 2025-05-07T19:45:49.1515438Z cuda-runtime conda-forge/noarch::cuda-runtime-12.6.3-ha804496_0 2025-05-07T19:45:49.1516073Z cuda-sanitizer-api conda-forge/linux-64::cuda-sanitizer-api-12.6.77-hbd13f7d_1 2025-05-07T19:45:49.1516665Z cuda-toolkit conda-forge/noarch::cuda-toolkit-12.6.3-ha804496_0 2025-05-07T19:45:49.1517196Z cuda-tools conda-forge/linux-64::cuda-tools-12.6.3-ha770c72_0 2025-05-07T19:45:49.1517721Z cuda-version conda-forge/noarch::cuda-version-12.6-h7480c83_3 2025-05-07T19:45:49.1518287Z cuda-visual-tools conda-forge/linux-64::cuda-visual-tools-12.6.3-ha770c72_0 2025-05-07T19:45:49.1518885Z cxx-compiler conda-forge/linux-64::cxx-compiler-1.5.2-hf52228f_0 2025-05-07T19:45:49.1519362Z dbus conda-forge/linux-64::dbus-1.13.6-h5008d03_3 2025-05-07T19:45:49.1519793Z gcc conda-forge/linux-64::gcc-11.4.0-h602e360_13 2025-05-07T19:45:49.1520259Z gds-tools conda-forge/linux-64::gds-tools-1.11.1.6-h5888daf_4 2025-05-07T19:45:49.1520707Z gmp conda-forge/linux-64::gmp-6.3.0-hac33072_2 2025-05-07T19:45:49.1521132Z gxx conda-forge/linux-64::gxx-11.4.0-h602e360_13 2025-05-07T19:45:49.1521740Z libcap conda-forge/linux-64::libcap-2.75-h39aace5_0 2025-05-07T19:45:49.1522238Z libcublas conda-forge/linux-64::libcublas-12.6.4.1-h5888daf_1 2025-05-07T19:45:49.1522795Z libcublas-dev conda-forge/linux-64::libcublas-dev-12.6.4.1-h5888daf_1 2025-05-07T19:45:49.1523322Z libcufft conda-forge/linux-64::libcufft-11.3.0.4-hbd13f7d_0 2025-05-07T19:45:49.1523855Z libcufft-dev conda-forge/linux-64::libcufft-dev-11.3.0.4-h5888daf_0 2025-05-07T19:45:49.1524380Z libcufile conda-forge/linux-64::libcufile-1.11.1.6-h12f29b5_4 2025-05-07T19:45:49.1524935Z libcufile-dev conda-forge/linux-64::libcufile-dev-1.11.1.6-h5888daf_4 2025-05-07T19:45:49.1525490Z libcurand conda-forge/linux-64::libcurand-10.3.7.77-hbd13f7d_0 2025-05-07T19:45:49.1526030Z libcurand-dev conda-forge/linux-64::libcurand-dev-10.3.7.77-h5888daf_0 2025-05-07T19:45:49.1526615Z libcusolver conda-forge/linux-64::libcusolver-11.7.1.2-h5888daf_1 2025-05-07T19:45:49.1527181Z libcusolver-dev conda-forge/linux-64::libcusolver-dev-11.7.1.2-h5888daf_1 2025-05-07T19:45:49.1527770Z libcusparse conda-forge/linux-64::libcusparse-12.5.4.2-hbd13f7d_0 2025-05-07T19:45:49.1528351Z libcusparse-dev conda-forge/linux-64::libcusparse-dev-12.5.4.2-h5888daf_0 2025-05-07T19:45:49.1529477Z libgcrypt-lib conda-forge/linux-64::libgcrypt-lib-1.11.0-hb9d3cd8_2 2025-05-07T19:45:49.1530342Z libgpg-error conda-forge/linux-64::libgpg-error-1.55-h3f2d84a_0 2025-05-07T19:45:49.1531089Z libnl conda-forge/linux-64::libnl-3.11.0-hb9d3cd8_0 2025-05-07T19:45:49.1531664Z libnpp conda-forge/linux-64::libnpp-12.3.1.54-h5888daf_0 2025-05-07T19:45:49.1532174Z libnpp-dev conda-forge/linux-64::libnpp-dev-12.3.1.54-h5888daf_0 2025-05-07T19:45:49.1532666Z libnuma conda-forge/linux-64::libnuma-2.0.18-h4ab18f5_2 2025-05-07T19:45:49.1533367Z libnvfatbin conda-forge/linux-64::libnvfatbin-12.6.77-hbd13f7d_0 2025-05-07T19:45:49.1533933Z libnvfatbin-dev conda-forge/linux-64::libnvfatbin-dev-12.6.77-h5888daf_0 2025-05-07T19:45:49.1534524Z libnvjitlink conda-forge/linux-64::libnvjitlink-12.6.85-hbd13f7d_0 2025-05-07T19:45:49.1535123Z libnvjitlink-dev conda-forge/linux-64::libnvjitlink-dev-12.6.85-h5888daf_0 2025-05-07T19:45:49.1535689Z libnvjpeg conda-forge/linux-64::libnvjpeg-12.3.3.54-h5888daf_0 2025-05-07T19:45:49.1536252Z libnvjpeg-dev conda-forge/linux-64::libnvjpeg-dev-12.3.3.54-ha770c72_0 2025-05-07T19:45:49.1536799Z libsystemd0 conda-forge/linux-64::libsystemd0-257.4-h4e0b6ca_1 2025-05-07T19:45:49.1537307Z libudev1 conda-forge/linux-64::libudev1-257.4-hbe16f8c_1 2025-05-07T19:45:49.1537824Z libxkbcommon conda-forge/linux-64::libxkbcommon-1.9.2-h65c71a3_0 2025-05-07T19:45:49.1538348Z libxkbfile conda-forge/linux-64::libxkbfile-1.1.0-h166bdaf_1 2025-05-07T19:45:49.1538852Z libxml2 conda-forge/linux-64::libxml2-2.13.8-h4bc477f_0 2025-05-07T19:45:49.1539291Z lz4-c conda-forge/linux-64::lz4-c-1.10.0-h5888daf_1 2025-05-07T19:45:49.1539829Z nsight-compute conda-forge/linux-64::nsight-compute-2024.3.2.3-hb5ebaad_0 2025-05-07T19:45:49.1540360Z nspr conda-forge/linux-64::nspr-4.36-h5888daf_0 2025-05-07T19:45:49.1540792Z nss conda-forge/linux-64::nss-3.111-h159eef7_0 2025-05-07T19:45:49.1541222Z ocl-icd conda-forge/linux-64::ocl-icd-2.3.3-hb9d3cd8_0 2025-05-07T19:45:49.1541744Z opencl-headers conda-forge/linux-64::opencl-headers-2024.10.24-h5888daf_0 2025-05-07T19:45:49.1542299Z rdma-core conda-forge/linux-64::rdma-core-57.0-h5888daf_0 2025-05-07T19:45:49.1542773Z wayland conda-forge/linux-64::wayland-1.23.1-h3e06ad9_0 2025-05-07T19:45:49.1543392Z xcb-util conda-forge/linux-64::xcb-util-0.4.1-hb711507_2 2025-05-07T19:45:49.1544047Z xcb-util-cursor conda-forge/linux-64::xcb-util-cursor-0.1.5-hb9d3cd8_0 2025-05-07T19:45:49.1544614Z xcb-util-image conda-forge/linux-64::xcb-util-image-0.4.0-hb711507_2 2025-05-07T19:45:49.1545205Z xcb-util-keysyms conda-forge/linux-64::xcb-util-keysyms-0.4.1-hb711507_0 2025-05-07T19:45:49.1545841Z xcb-util-renderut~ conda-forge/linux-64::xcb-util-renderutil-0.3.10-hb711507_0 2025-05-07T19:45:49.1546415Z xcb-util-wm conda-forge/linux-64::xcb-util-wm-0.4.2-hb711507_0 2025-05-07T19:45:49.1546984Z xkeyboard-config conda-forge/linux-64::xkeyboard-config-2.44-hb9d3cd8_0 2025-05-07T19:45:49.1547598Z xorg-libxcomposite conda-forge/linux-64::xorg-libxcomposite-0.4.6-hb9d3cd8_2 2025-05-07T19:45:49.1548226Z xorg-libxdamage conda-forge/linux-64::xorg-libxdamage-1.1.6-hb9d3cd8_0 2025-05-07T19:45:49.1548570Z 2025-05-07T19:45:49.1548677Z 2025-05-07T19:45:49.1548680Z 2025-05-07T19:45:49.1548850Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:49.1549257Z nsight-compute-2024. | 443.1 MB | | 0% 2025-05-07T19:45:49.1549532Z 2025-05-07T19:45:49.1549861Z libcublas-12.6.4.1 | 256.2 MB | | 0%  2025-05-07T19:45:49.1550119Z 2025-05-07T19:45:49.1550123Z 2025-05-07T19:45:49.1550362Z libcufft-11.3.0.4 | 156.2 MB | | 0%  2025-05-07T19:45:49.1550624Z 2025-05-07T19:45:49.1550627Z 2025-05-07T19:45:49.1550631Z 2025-05-07T19:45:49.1550867Z libcusparse-12.5.4.2 | 118.6 MB | | 0%  2025-05-07T19:45:49.1551164Z 2025-05-07T19:45:49.1551168Z 2025-05-07T19:45:49.1551171Z 2025-05-07T19:45:49.1551175Z 2025-05-07T19:45:49.1561889Z cuda-nsight-12.6.77 | 113.2 MB | | 0%  2025-05-07T19:45:49.1562198Z 2025-05-07T19:45:49.1562202Z 2025-05-07T19:45:49.1562205Z 2025-05-07T19:45:49.1562463Z 2025-05-07T19:45:49.1562801Z 2025-05-07T19:45:49.1563473Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:45:49.1565378Z 2025-05-07T19:45:49.1565383Z 2025-05-07T19:45:49.1565387Z 2025-05-07T19:45:49.1565408Z 2025-05-07T19:45:49.1565412Z 2025-05-07T19:45:49.1565442Z 2025-05-07T19:45:49.1565764Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:45:49.1566064Z 2025-05-07T19:45:49.1566067Z 2025-05-07T19:45:49.1566072Z 2025-05-07T19:45:49.1566076Z 2025-05-07T19:45:49.1566081Z 2025-05-07T19:45:49.1566085Z 2025-05-07T19:45:49.1566089Z 2025-05-07T19:45:49.1566356Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:45:49.1566635Z 2025-05-07T19:45:49.1566639Z 2025-05-07T19:45:49.1566642Z 2025-05-07T19:45:49.1566646Z 2025-05-07T19:45:49.1566649Z 2025-05-07T19:45:49.1566653Z 2025-05-07T19:45:49.1566656Z 2025-05-07T19:45:49.1566660Z 2025-05-07T19:45:49.1566961Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:45:49.1567266Z 2025-05-07T19:45:49.1567269Z 2025-05-07T19:45:49.1567280Z 2025-05-07T19:45:49.1567284Z 2025-05-07T19:45:49.1567287Z 2025-05-07T19:45:49.1567290Z 2025-05-07T19:45:49.1567299Z 2025-05-07T19:45:49.1567302Z 2025-05-07T19:45:49.1567306Z 2025-05-07T19:45:49.1567596Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:45:49.1567900Z 2025-05-07T19:45:49.1567903Z 2025-05-07T19:45:49.1567907Z 2025-05-07T19:45:49.1567910Z 2025-05-07T19:45:49.1567913Z 2025-05-07T19:45:49.1567917Z 2025-05-07T19:45:49.1567920Z 2025-05-07T19:45:49.1567924Z 2025-05-07T19:45:49.1567927Z 2025-05-07T19:45:49.1567931Z 2025-05-07T19:45:49.1568431Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:45:49.1568739Z 2025-05-07T19:45:49.1568758Z 2025-05-07T19:45:49.1568763Z 2025-05-07T19:45:49.1568766Z 2025-05-07T19:45:49.1568770Z 2025-05-07T19:45:49.1568774Z 2025-05-07T19:45:49.1568777Z 2025-05-07T19:45:49.1568780Z 2025-05-07T19:45:49.1568801Z 2025-05-07T19:45:49.1568806Z 2025-05-07T19:45:49.1568823Z 2025-05-07T19:45:49.1569602Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:45:49.1570051Z 2025-05-07T19:45:49.1570055Z 2025-05-07T19:45:49.1570058Z 2025-05-07T19:45:49.1570062Z 2025-05-07T19:45:49.1570084Z 2025-05-07T19:45:49.1570088Z 2025-05-07T19:45:49.1570091Z 2025-05-07T19:45:49.1570095Z 2025-05-07T19:45:49.1570098Z 2025-05-07T19:45:49.1570102Z 2025-05-07T19:45:49.1570105Z 2025-05-07T19:45:49.1570108Z 2025-05-07T19:45:49.1578534Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:45:49.1578875Z 2025-05-07T19:45:49.1578878Z 2025-05-07T19:45:49.1578881Z 2025-05-07T19:45:49.1578885Z 2025-05-07T19:45:49.1578888Z 2025-05-07T19:45:49.1578892Z 2025-05-07T19:45:49.1578895Z 2025-05-07T19:45:49.1578899Z 2025-05-07T19:45:49.1578902Z 2025-05-07T19:45:49.1578905Z 2025-05-07T19:45:49.1578909Z 2025-05-07T19:45:49.1578912Z 2025-05-07T19:45:49.1578915Z 2025-05-07T19:45:49.1579216Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:45:49.1579554Z 2025-05-07T19:45:49.1579558Z 2025-05-07T19:45:49.1579561Z 2025-05-07T19:45:49.1579565Z 2025-05-07T19:45:49.1579568Z 2025-05-07T19:45:49.1579572Z 2025-05-07T19:45:49.1579575Z 2025-05-07T19:45:49.1579578Z 2025-05-07T19:45:49.1579582Z 2025-05-07T19:45:49.1579585Z 2025-05-07T19:45:49.1579588Z 2025-05-07T19:45:49.1579600Z 2025-05-07T19:45:49.1579603Z 2025-05-07T19:45:49.1579607Z 2025-05-07T19:45:49.1580774Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:45:49.1581098Z 2025-05-07T19:45:49.1581118Z 2025-05-07T19:45:49.1581122Z 2025-05-07T19:45:49.1581125Z 2025-05-07T19:45:49.1581129Z 2025-05-07T19:45:49.1581132Z 2025-05-07T19:45:49.1581136Z 2025-05-07T19:45:49.1581140Z 2025-05-07T19:45:49.1581143Z 2025-05-07T19:45:49.1581146Z 2025-05-07T19:45:49.1581149Z 2025-05-07T19:45:49.1581153Z 2025-05-07T19:45:49.1581263Z 2025-05-07T19:45:49.1581266Z 2025-05-07T19:45:49.1581270Z 2025-05-07T19:45:49.1581889Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:45:49.1582217Z 2025-05-07T19:45:49.1582236Z 2025-05-07T19:45:49.1582240Z 2025-05-07T19:45:49.1582243Z 2025-05-07T19:45:49.1582264Z 2025-05-07T19:45:49.1582267Z 2025-05-07T19:45:49.1582270Z 2025-05-07T19:45:49.1582274Z 2025-05-07T19:45:49.1582277Z 2025-05-07T19:45:49.1582280Z 2025-05-07T19:45:49.1582284Z 2025-05-07T19:45:49.1582287Z 2025-05-07T19:45:49.1582290Z 2025-05-07T19:45:49.1582294Z 2025-05-07T19:45:49.1582297Z 2025-05-07T19:45:49.1582756Z 2025-05-07T19:45:49.1583109Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:45:49.1583481Z 2025-05-07T19:45:49.1583504Z 2025-05-07T19:45:49.1583507Z 2025-05-07T19:45:49.1583510Z 2025-05-07T19:45:49.1583514Z 2025-05-07T19:45:49.1583517Z 2025-05-07T19:45:49.1583520Z 2025-05-07T19:45:49.1583530Z 2025-05-07T19:45:49.1583533Z 2025-05-07T19:45:49.1583537Z 2025-05-07T19:45:49.1583541Z 2025-05-07T19:45:49.1583550Z 2025-05-07T19:45:49.1583553Z 2025-05-07T19:45:49.1583556Z 2025-05-07T19:45:49.1583560Z 2025-05-07T19:45:49.1583563Z 2025-05-07T19:45:49.1583566Z 2025-05-07T19:45:49.1583894Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:45:49.1584225Z 2025-05-07T19:45:49.1584229Z 2025-05-07T19:45:49.1584232Z 2025-05-07T19:45:49.1584236Z 2025-05-07T19:45:49.1584239Z 2025-05-07T19:45:49.1584243Z 2025-05-07T19:45:49.1584246Z 2025-05-07T19:45:49.1584249Z 2025-05-07T19:45:49.1584253Z 2025-05-07T19:45:49.1584256Z 2025-05-07T19:45:49.1584277Z 2025-05-07T19:45:49.1584281Z 2025-05-07T19:45:49.1584284Z 2025-05-07T19:45:49.1584297Z 2025-05-07T19:45:49.1584301Z 2025-05-07T19:45:49.1584304Z 2025-05-07T19:45:49.1584307Z 2025-05-07T19:45:49.1585860Z 2025-05-07T19:45:49.1587053Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:45:49.1587427Z 2025-05-07T19:45:49.1588320Z 2025-05-07T19:45:49.1588326Z 2025-05-07T19:45:49.1588329Z 2025-05-07T19:45:49.1588333Z 2025-05-07T19:45:49.1588336Z 2025-05-07T19:45:49.1588340Z 2025-05-07T19:45:49.1588343Z 2025-05-07T19:45:49.1588347Z 2025-05-07T19:45:49.1588350Z 2025-05-07T19:45:49.1588353Z 2025-05-07T19:45:49.1588357Z 2025-05-07T19:45:49.1588360Z 2025-05-07T19:45:49.1588364Z 2025-05-07T19:45:49.1588367Z 2025-05-07T19:45:49.1588371Z 2025-05-07T19:45:49.1588374Z 2025-05-07T19:45:49.1588377Z 2025-05-07T19:45:49.1588380Z 2025-05-07T19:45:49.2519709Z ... (more hidden) ... 2025-05-07T19:45:49.2520719Z nsight-compute-2024. | 443.1 MB | | 0% 2025-05-07T19:45:49.2521021Z 2025-05-07T19:45:49.2529586Z libcublas-12.6.4.1 | 256.2 MB | | 0%  2025-05-07T19:45:49.2529994Z 2025-05-07T19:45:49.2530012Z 2025-05-07T19:45:49.2539365Z libcufft-11.3.0.4 | 156.2 MB | 2 | 3%  2025-05-07T19:45:49.2540213Z 2025-05-07T19:45:49.2540257Z 2025-05-07T19:45:49.2540268Z 2025-05-07T19:45:49.2556594Z libcusparse-12.5.4.2 | 118.6 MB | 1 | 2%  2025-05-07T19:45:49.2557504Z 2025-05-07T19:45:49.2557516Z 2025-05-07T19:45:49.2557527Z 2025-05-07T19:45:49.2557538Z 2025-05-07T19:45:49.3522530Z cuda-nsight-12.6.77 | 113.2 MB | 1 | 2%  2025-05-07T19:45:49.3523000Z nsight-compute-2024. | 443.1 MB | | 1% 2025-05-07T19:45:49.3523253Z 2025-05-07T19:45:49.3532588Z libcublas-12.6.4.1 | 256.2 MB | 2 | 3%  2025-05-07T19:45:49.3532864Z 2025-05-07T19:45:49.3532875Z 2025-05-07T19:45:49.3538028Z libcufft-11.3.0.4 | 156.2 MB | 6 | 7%  2025-05-07T19:45:49.3538289Z 2025-05-07T19:45:49.3538299Z 2025-05-07T19:45:49.3538303Z 2025-05-07T19:45:49.3559145Z libcusparse-12.5.4.2 | 118.6 MB | 7 | 7%  2025-05-07T19:45:49.3559686Z 2025-05-07T19:45:49.3559692Z 2025-05-07T19:45:49.3559697Z 2025-05-07T19:45:49.3559700Z 2025-05-07T19:45:49.4526264Z cuda-nsight-12.6.77 | 113.2 MB | 7 | 7%  2025-05-07T19:45:49.4527197Z nsight-compute-2024. | 443.1 MB | 1 | 2% 2025-05-07T19:45:49.4527461Z 2025-05-07T19:45:49.4538926Z libcublas-12.6.4.1 | 256.2 MB | 4 | 5%  2025-05-07T19:45:49.4539196Z 2025-05-07T19:45:49.4539200Z 2025-05-07T19:45:49.4539977Z 2025-05-07T19:45:49.4560445Z libcusparse-12.5.4.2 | 118.6 MB | #1 | 11%  2025-05-07T19:45:49.4561332Z 2025-05-07T19:45:49.4561345Z 2025-05-07T19:45:49.4561376Z 2025-05-07T19:45:49.4561386Z 2025-05-07T19:45:49.4656371Z cuda-nsight-12.6.77 | 113.2 MB | #1 | 12%  2025-05-07T19:45:49.4656660Z 2025-05-07T19:45:49.4656665Z 2025-05-07T19:45:49.5529916Z libcufft-11.3.0.4 | 156.2 MB | # | 10%  2025-05-07T19:45:49.5530534Z nsight-compute-2024. | 443.1 MB | 2 | 3% 2025-05-07T19:45:49.5531878Z 2025-05-07T19:45:49.5541559Z libcublas-12.6.4.1 | 256.2 MB | 6 | 6%  2025-05-07T19:45:49.5541836Z 2025-05-07T19:45:49.5541840Z 2025-05-07T19:45:49.5542969Z 2025-05-07T19:45:49.5561561Z libcusparse-12.5.4.2 | 118.6 MB | #5 | 15%  2025-05-07T19:45:49.5562464Z 2025-05-07T19:45:49.5562478Z 2025-05-07T19:45:49.5562488Z 2025-05-07T19:45:49.5562498Z 2025-05-07T19:45:49.5700741Z cuda-nsight-12.6.77 | 113.2 MB | #5 | 16%  2025-05-07T19:45:49.5701064Z 2025-05-07T19:45:49.5701483Z 2025-05-07T19:45:49.6537081Z libcufft-11.3.0.4 | 156.2 MB | #3 | 14%  2025-05-07T19:45:49.6537367Z 2025-05-07T19:45:49.6540463Z libcublas-12.6.4.1 | 256.2 MB | 8 | 8%  2025-05-07T19:45:49.6544766Z nsight-compute-2024. | 443.1 MB | 3 | 4% 2025-05-07T19:45:49.6545033Z 2025-05-07T19:45:49.6545037Z 2025-05-07T19:45:49.6545040Z 2025-05-07T19:45:49.6562543Z libcusparse-12.5.4.2 | 118.6 MB | #9 | 19%  2025-05-07T19:45:49.6563450Z 2025-05-07T19:45:49.6563463Z 2025-05-07T19:45:49.6563474Z 2025-05-07T19:45:49.6563927Z 2025-05-07T19:45:49.6701684Z cuda-nsight-12.6.77 | 113.2 MB | ## | 20%  2025-05-07T19:45:49.6702598Z 2025-05-07T19:45:49.6702612Z 2025-05-07T19:45:49.7546972Z libcufft-11.3.0.4 | 156.2 MB | #6 | 17%  2025-05-07T19:45:49.7556632Z nsight-compute-2024. | 443.1 MB | 4 | 5% 2025-05-07T19:45:49.7556918Z 2025-05-07T19:45:49.7556927Z 2025-05-07T19:45:49.7559940Z 2025-05-07T19:45:49.7603757Z libcusparse-12.5.4.2 | 118.6 MB | ##3 | 23%  2025-05-07T19:45:49.7604100Z 2025-05-07T19:45:49.7632799Z libcublas-12.6.4.1 | 256.2 MB | 9 | 10%  2025-05-07T19:45:49.7633081Z 2025-05-07T19:45:49.7633085Z 2025-05-07T19:45:49.7633088Z 2025-05-07T19:45:49.7633092Z 2025-05-07T19:45:49.7747715Z cuda-nsight-12.6.77 | 113.2 MB | ##4 | 24%  2025-05-07T19:45:49.7748037Z 2025-05-07T19:45:49.7748054Z 2025-05-07T19:45:49.8547631Z libcufft-11.3.0.4 | 156.2 MB | ## | 20%  2025-05-07T19:45:49.8564591Z nsight-compute-2024. | 443.1 MB | 5 | 6% 2025-05-07T19:45:49.8565401Z 2025-05-07T19:45:49.8565415Z 2025-05-07T19:45:49.8565426Z 2025-05-07T19:45:49.8605083Z libcusparse-12.5.4.2 | 118.6 MB | ##7 | 28%  2025-05-07T19:45:49.8605400Z 2025-05-07T19:45:49.8630919Z libcublas-12.6.4.1 | 256.2 MB | #1 | 12%  2025-05-07T19:45:49.8631218Z 2025-05-07T19:45:49.8631223Z 2025-05-07T19:45:49.8631227Z 2025-05-07T19:45:49.8632334Z 2025-05-07T19:45:49.8747307Z cuda-nsight-12.6.77 | 113.2 MB | ##8 | 29%  2025-05-07T19:45:49.8747619Z 2025-05-07T19:45:49.8747671Z 2025-05-07T19:45:49.9564953Z libcufft-11.3.0.4 | 156.2 MB | ##3 | 24%  2025-05-07T19:45:49.9565244Z 2025-05-07T19:45:49.9565400Z 2025-05-07T19:45:49.9565408Z 2025-05-07T19:45:49.9606188Z libcusparse-12.5.4.2 | 118.6 MB | ###2 | 32%  2025-05-07T19:45:49.9607507Z 2025-05-07T19:45:49.9630970Z libcublas-12.6.4.1 | 256.2 MB | #3 | 14%  2025-05-07T19:45:49.9631251Z 2025-05-07T19:45:49.9631256Z 2025-05-07T19:45:49.9631259Z 2025-05-07T19:45:49.9632391Z 2025-05-07T19:45:49.9637209Z cuda-nsight-12.6.77 | 113.2 MB | ###4 | 34%  2025-05-07T19:45:49.9746846Z nsight-compute-2024. | 443.1 MB | 6 | 7% 2025-05-07T19:45:49.9747209Z 2025-05-07T19:45:49.9747285Z 2025-05-07T19:45:50.0568909Z libcufft-11.3.0.4 | 156.2 MB | ##7 | 27%  2025-05-07T19:45:50.0569230Z 2025-05-07T19:45:50.0569454Z 2025-05-07T19:45:50.0569458Z 2025-05-07T19:45:50.0608455Z libcusparse-12.5.4.2 | 118.6 MB | ###6 | 37%  2025-05-07T19:45:50.0608765Z 2025-05-07T19:45:50.0636824Z libcublas-12.6.4.1 | 256.2 MB | #5 | 15%  2025-05-07T19:45:50.0637116Z 2025-05-07T19:45:50.0637121Z 2025-05-07T19:45:50.0637125Z 2025-05-07T19:45:50.0637129Z 2025-05-07T19:45:50.0747324Z cuda-nsight-12.6.77 | 113.2 MB | ###8 | 39%  2025-05-07T19:45:50.0747806Z nsight-compute-2024. | 443.1 MB | 7 | 8% 2025-05-07T19:45:50.0748064Z 2025-05-07T19:45:50.0748069Z 2025-05-07T19:45:50.1567768Z libcufft-11.3.0.4 | 156.2 MB | ### | 31%  2025-05-07T19:45:50.1568077Z 2025-05-07T19:45:50.1568184Z 2025-05-07T19:45:50.1568193Z 2025-05-07T19:45:50.1608663Z libcusparse-12.5.4.2 | 118.6 MB | ####1 | 41%  2025-05-07T19:45:50.1608966Z 2025-05-07T19:45:50.1637425Z libcublas-12.6.4.1 | 256.2 MB | #7 | 17%  2025-05-07T19:45:50.1637725Z 2025-05-07T19:45:50.1637729Z 2025-05-07T19:45:50.1637733Z 2025-05-07T19:45:50.1638796Z 2025-05-07T19:45:50.1748073Z cuda-nsight-12.6.77 | 113.2 MB | ####4 | 44%  2025-05-07T19:45:50.1748442Z 2025-05-07T19:45:50.1748447Z 2025-05-07T19:45:50.2076928Z libcufft-11.3.0.4 | 156.2 MB | ###4 | 34%  2025-05-07T19:45:50.2612527Z nsight-compute-2024. | 443.1 MB | 8 | 8% 2025-05-07T19:45:50.2613345Z 2025-05-07T19:45:50.2613359Z 2025-05-07T19:45:50.2613371Z 2025-05-07T19:45:50.2614078Z libcusparse-12.5.4.2 | 118.6 MB | ####5 | 46%  2025-05-07T19:45:50.2614372Z 2025-05-07T19:45:50.2638046Z libcublas-12.6.4.1 | 256.2 MB | #9 | 19%  2025-05-07T19:45:50.2638373Z 2025-05-07T19:45:50.2638378Z 2025-05-07T19:45:50.2638382Z 2025-05-07T19:45:50.2638545Z 2025-05-07T19:45:50.2777775Z cuda-nsight-12.6.77 | 113.2 MB | ####9 | 49%  2025-05-07T19:45:50.2778098Z 2025-05-07T19:45:50.2778103Z 2025-05-07T19:45:50.3164799Z libcufft-11.3.0.4 | 156.2 MB | ###7 | 38%  2025-05-07T19:45:50.3611175Z nsight-compute-2024. | 443.1 MB | 9 | 9% 2025-05-07T19:45:50.3611453Z 2025-05-07T19:45:50.3611458Z 2025-05-07T19:45:50.3611463Z 2025-05-07T19:45:50.3614491Z libcusparse-12.5.4.2 | 118.6 MB | ##### | 50%  2025-05-07T19:45:50.3615520Z 2025-05-07T19:45:50.3638625Z libcublas-12.6.4.1 | 256.2 MB | ##1 | 21%  2025-05-07T19:45:50.3638924Z 2025-05-07T19:45:50.3638928Z 2025-05-07T19:45:50.3638940Z 2025-05-07T19:45:50.3638949Z 2025-05-07T19:45:50.3778484Z cuda-nsight-12.6.77 | 113.2 MB | #####4 | 54%  2025-05-07T19:45:50.3778799Z 2025-05-07T19:45:50.3778804Z 2025-05-07T19:45:50.4416763Z libcufft-11.3.0.4 | 156.2 MB | ####1 | 42%  2025-05-07T19:45:50.4612273Z nsight-compute-2024. | 443.1 MB | # | 10% 2025-05-07T19:45:50.4612575Z 2025-05-07T19:45:50.4612581Z 2025-05-07T19:45:50.4612754Z 2025-05-07T19:45:50.4615170Z libcusparse-12.5.4.2 | 118.6 MB | #####4 | 55%  2025-05-07T19:45:50.4616204Z 2025-05-07T19:45:50.4639978Z libcublas-12.6.4.1 | 256.2 MB | ##3 | 23%  2025-05-07T19:45:50.4640249Z 2025-05-07T19:45:50.4640253Z 2025-05-07T19:45:50.4640271Z 2025-05-07T19:45:50.4641147Z 2025-05-07T19:45:50.4781237Z cuda-nsight-12.6.77 | 113.2 MB | #####9 | 60%  2025-05-07T19:45:50.4782451Z 2025-05-07T19:45:50.4782477Z 2025-05-07T19:45:50.5614377Z libcufft-11.3.0.4 | 156.2 MB | ####5 | 46%  2025-05-07T19:45:50.5614668Z 2025-05-07T19:45:50.5614673Z 2025-05-07T19:45:50.5614677Z 2025-05-07T19:45:50.5616433Z libcusparse-12.5.4.2 | 118.6 MB | #####9 | 59%  2025-05-07T19:45:50.5616727Z 2025-05-07T19:45:50.5645726Z libcublas-12.6.4.1 | 256.2 MB | ##5 | 26%  2025-05-07T19:45:50.5646026Z 2025-05-07T19:45:50.5646031Z 2025-05-07T19:45:50.5646034Z 2025-05-07T19:45:50.5646052Z 2025-05-07T19:45:50.5786535Z cuda-nsight-12.6.77 | 113.2 MB | ######5 | 65%  2025-05-07T19:45:50.5786837Z 2025-05-07T19:45:50.5786841Z 2025-05-07T19:45:50.5812495Z libcufft-11.3.0.4 | 156.2 MB | ####9 | 50%  2025-05-07T19:45:50.6615629Z nsight-compute-2024. | 443.1 MB | # | 11% 2025-05-07T19:45:50.6615947Z 2025-05-07T19:45:50.6615968Z 2025-05-07T19:45:50.6615980Z 2025-05-07T19:45:50.6626678Z libcusparse-12.5.4.2 | 118.6 MB | ######4 | 64%  2025-05-07T19:45:50.6628066Z 2025-05-07T19:45:50.6645252Z libcublas-12.6.4.1 | 256.2 MB | ##7 | 28%  2025-05-07T19:45:50.6645534Z 2025-05-07T19:45:50.6645607Z 2025-05-07T19:45:50.6645611Z 2025-05-07T19:45:50.6645681Z 2025-05-07T19:45:50.6825885Z cuda-nsight-12.6.77 | 113.2 MB | ####### | 71%  2025-05-07T19:45:50.6826215Z 2025-05-07T19:45:50.6826220Z 2025-05-07T19:45:50.6927388Z libcufft-11.3.0.4 | 156.2 MB | #####3 | 53%  2025-05-07T19:45:50.7800152Z nsight-compute-2024. | 443.1 MB | #1 | 12% 2025-05-07T19:45:50.7800486Z 2025-05-07T19:45:50.7800716Z 2025-05-07T19:45:50.7800724Z 2025-05-07T19:45:50.7851699Z libcusparse-12.5.4.2 | 118.6 MB | ######8 | 69%  2025-05-07T19:45:50.7852022Z 2025-05-07T19:45:50.7852026Z 2025-05-07T19:45:50.7852030Z 2025-05-07T19:45:50.7852033Z 2025-05-07T19:45:50.7876212Z cuda-nsight-12.6.77 | 113.2 MB | #######5 | 76%  2025-05-07T19:45:50.7878426Z 2025-05-07T19:45:50.7926851Z libcublas-12.6.4.1 | 256.2 MB | ##9 | 30%  2025-05-07T19:45:50.8075018Z nsight-compute-2024. | 443.1 MB | #2 | 12% 2025-05-07T19:45:50.8075313Z 2025-05-07T19:45:50.8075318Z 2025-05-07T19:45:50.8811753Z libcufft-11.3.0.4 | 156.2 MB | #####7 | 57%  2025-05-07T19:45:50.8812580Z 2025-05-07T19:45:50.8812594Z 2025-05-07T19:45:50.8812605Z 2025-05-07T19:45:50.8858893Z libcusparse-12.5.4.2 | 118.6 MB | #######3 | 73%  2025-05-07T19:45:50.8859218Z 2025-05-07T19:45:50.8859222Z 2025-05-07T19:45:50.8859226Z 2025-05-07T19:45:50.8859230Z 2025-05-07T19:45:50.8921155Z cuda-nsight-12.6.77 | 113.2 MB | ######## | 81%  2025-05-07T19:45:50.8921474Z 2025-05-07T19:45:50.9075915Z libcublas-12.6.4.1 | 256.2 MB | ###1 | 32%  2025-05-07T19:45:50.9076701Z 2025-05-07T19:45:50.9076729Z 2025-05-07T19:45:50.9812323Z libcufft-11.3.0.4 | 156.2 MB | ######1 | 61%  2025-05-07T19:45:50.9812613Z 2025-05-07T19:45:50.9812631Z 2025-05-07T19:45:50.9812635Z 2025-05-07T19:45:50.9860853Z libcusparse-12.5.4.2 | 118.6 MB | #######8 | 78%  2025-05-07T19:45:50.9861761Z 2025-05-07T19:45:50.9861775Z 2025-05-07T19:45:50.9861786Z 2025-05-07T19:45:50.9861796Z 2025-05-07T19:45:50.9925421Z cuda-nsight-12.6.77 | 113.2 MB | ########7 | 88%  2025-05-07T19:45:50.9925749Z 2025-05-07T19:45:51.0075657Z libcublas-12.6.4.1 | 256.2 MB | ###4 | 34%  2025-05-07T19:45:51.0075955Z 2025-05-07T19:45:51.0076016Z 2025-05-07T19:45:51.0489988Z libcufft-11.3.0.4 | 156.2 MB | ######6 | 66%  2025-05-07T19:45:51.0879646Z nsight-compute-2024. | 443.1 MB | #3 | 13% 2025-05-07T19:45:51.0879944Z 2025-05-07T19:45:51.0880004Z 2025-05-07T19:45:51.0880008Z 2025-05-07T19:45:51.0880011Z 2025-05-07T19:45:51.0927733Z cuda-nsight-12.6.77 | 113.2 MB | #########3 | 93%  2025-05-07T19:45:51.0929046Z 2025-05-07T19:45:51.0980271Z libcublas-12.6.4.1 | 256.2 MB | ###6 | 36%  2025-05-07T19:45:51.0981499Z 2025-05-07T19:45:51.0981514Z 2025-05-07T19:45:51.0981525Z 2025-05-07T19:45:51.1076039Z libcusparse-12.5.4.2 | 118.6 MB | ########2 | 83%  2025-05-07T19:45:51.1076350Z 2025-05-07T19:45:51.1076355Z 2025-05-07T19:45:51.1879818Z libcufft-11.3.0.4 | 156.2 MB | ####### | 70%  2025-05-07T19:45:51.1880128Z 2025-05-07T19:45:51.1880132Z 2025-05-07T19:45:51.1880136Z 2025-05-07T19:45:51.1880139Z 2025-05-07T19:45:51.1930240Z cuda-nsight-12.6.77 | 113.2 MB | #########9 | 99%  2025-05-07T19:45:51.1931153Z 2025-05-07T19:45:51.1980194Z libcublas-12.6.4.1 | 256.2 MB | ###8 | 39%  2025-05-07T19:45:51.1980482Z 2025-05-07T19:45:51.1980486Z 2025-05-07T19:45:51.1980490Z 2025-05-07T19:45:51.2009454Z libcusparse-12.5.4.2 | 118.6 MB | ########8 | 88%  2025-05-07T19:45:51.2077645Z nsight-compute-2024. | 443.1 MB | #3 | 14% 2025-05-07T19:45:51.2078073Z 2025-05-07T19:45:51.2078119Z 2025-05-07T19:45:51.2929977Z libcufft-11.3.0.4 | 156.2 MB | #######4 | 75%  2025-05-07T19:45:51.2930302Z 2025-05-07T19:45:51.2983367Z libcublas-12.6.4.1 | 256.2 MB | #### | 41%  2025-05-07T19:45:51.2983659Z 2025-05-07T19:45:51.2983663Z 2025-05-07T19:45:51.2985290Z 2025-05-07T19:45:51.3010792Z libcusparse-12.5.4.2 | 118.6 MB | #########3 | 93%  2025-05-07T19:45:51.3079207Z nsight-compute-2024. | 443.1 MB | #4 | 15% 2025-05-07T19:45:51.3079602Z 2025-05-07T19:45:51.3079969Z 2025-05-07T19:45:51.3955642Z libcufft-11.3.0.4 | 156.2 MB | #######8 | 79%  2025-05-07T19:45:51.3955930Z 2025-05-07T19:45:51.3986036Z libcublas-12.6.4.1 | 256.2 MB | ####2 | 43%  2025-05-07T19:45:51.3986333Z 2025-05-07T19:45:51.3986619Z 2025-05-07T19:45:51.3986627Z 2025-05-07T19:45:51.4013804Z libcusparse-12.5.4.2 | 118.6 MB | #########8 | 99%  2025-05-07T19:45:51.4080135Z nsight-compute-2024. | 443.1 MB | #6 | 16% 2025-05-07T19:45:51.4080586Z 2025-05-07T19:45:51.4080637Z 2025-05-07T19:45:51.4956934Z libcufft-11.3.0.4 | 156.2 MB | ########2 | 83%  2025-05-07T19:45:51.4957412Z 2025-05-07T19:45:51.5012153Z libcublas-12.6.4.1 | 256.2 MB | ####5 | 46%  2025-05-07T19:45:51.5173307Z nsight-compute-2024. | 443.1 MB | #7 | 18% 2025-05-07T19:45:51.5173585Z 2025-05-07T19:45:51.5173589Z 2025-05-07T19:45:51.5958254Z libcufft-11.3.0.4 | 156.2 MB | ########7 | 87%  2025-05-07T19:45:51.5958546Z 2025-05-07T19:45:51.6013373Z libcublas-12.6.4.1 | 256.2 MB | ####8 | 49%  2025-05-07T19:45:51.6174221Z nsight-compute-2024. | 443.1 MB | #9 | 20% 2025-05-07T19:45:51.6174499Z 2025-05-07T19:45:51.6174612Z 2025-05-07T19:45:51.6959686Z libcufft-11.3.0.4 | 156.2 MB | #########2 | 92%  2025-05-07T19:45:51.6959991Z 2025-05-07T19:45:51.7061921Z libcublas-12.6.4.1 | 256.2 MB | #####2 | 52%  2025-05-07T19:45:51.7175874Z nsight-compute-2024. | 443.1 MB | ##1 | 21% 2025-05-07T19:45:51.7176170Z 2025-05-07T19:45:51.7176192Z 2025-05-07T19:45:51.7959964Z libcufft-11.3.0.4 | 156.2 MB | #########8 | 98%  2025-05-07T19:45:51.7960278Z 2025-05-07T19:45:51.8064590Z libcublas-12.6.4.1 | 256.2 MB | #####6 | 56%  2025-05-07T19:45:51.8960846Z nsight-compute-2024. | 443.1 MB | ##3 | 23% 2025-05-07T19:45:51.8961279Z 2025-05-07T19:45:51.9066228Z libcublas-12.6.4.1 | 256.2 MB | #####9 | 60%  2025-05-07T19:45:52.0066499Z nsight-compute-2024. | 443.1 MB | ##5 | 25% 2025-05-07T19:45:52.0274441Z nsight-compute-2024. | 443.1 MB | ##7 | 28% 2025-05-07T19:45:52.0274920Z 2025-05-07T19:45:52.1070274Z libcublas-12.6.4.1 | 256.2 MB | ######5 | 65%  2025-05-07T19:45:52.1225988Z nsight-compute-2024. | 443.1 MB | ### | 31% 2025-05-07T19:45:52.1226488Z 2025-05-07T19:45:52.1226555Z 2025-05-07T19:45:52.1226561Z 2025-05-07T19:45:52.1226566Z 2025-05-07T19:45:52.1686613Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:45:52.1687205Z 2025-05-07T19:45:52.1687210Z 2025-05-07T19:45:52.1687214Z 2025-05-07T19:45:52.1687219Z 2025-05-07T19:45:52.1687239Z 2025-05-07T19:45:52.2070352Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:45:52.2209589Z nsight-compute-2024. | 443.1 MB | ###2 | 33% 2025-05-07T19:45:52.2210095Z 2025-05-07T19:45:52.2693352Z libcublas-12.6.4.1 | 256.2 MB | ######8 | 69%  2025-05-07T19:45:52.2694180Z 2025-05-07T19:45:52.2694194Z 2025-05-07T19:45:52.2694206Z 2025-05-07T19:45:52.2694217Z 2025-05-07T19:45:52.2694227Z 2025-05-07T19:45:52.3299929Z cuda-nvvp-12.6.80 | 109.3 MB | 8 | 8%  2025-05-07T19:45:52.3300806Z 2025-05-07T19:45:52.3300819Z 2025-05-07T19:45:52.3300867Z 2025-05-07T19:45:52.3455424Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:45:52.3577639Z nsight-compute-2024. | 443.1 MB | ###5 | 35% 2025-05-07T19:45:52.3577930Z 2025-05-07T19:45:52.3691778Z libcublas-12.6.4.1 | 256.2 MB | #######1 | 72%  2025-05-07T19:45:52.3692096Z 2025-05-07T19:45:52.3692115Z 2025-05-07T19:45:52.3692128Z 2025-05-07T19:45:52.3692132Z 2025-05-07T19:45:52.3692135Z 2025-05-07T19:45:52.3770072Z cuda-nvvp-12.6.80 | 109.3 MB | #4 | 14%  2025-05-07T19:45:52.3770972Z 2025-05-07T19:45:52.3770986Z 2025-05-07T19:45:52.3770997Z 2025-05-07T19:45:52.3771008Z 2025-05-07T19:45:52.3771042Z 2025-05-07T19:45:52.3771053Z 2025-05-07T19:45:52.4769628Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:45:52.4770067Z 2025-05-07T19:45:52.4770072Z 2025-05-07T19:45:52.4770076Z 2025-05-07T19:45:52.4770080Z 2025-05-07T19:45:52.4770083Z 2025-05-07T19:45:52.4770142Z 2025-05-07T19:45:52.4820889Z libcusolver-11.7.1.2 | 95.8 MB | 5 | 5%  2025-05-07T19:45:52.4821212Z 2025-05-07T19:45:52.4821217Z 2025-05-07T19:45:52.4821221Z 2025-05-07T19:45:52.4821224Z 2025-05-07T19:45:52.4825890Z 2025-05-07T19:45:52.5045362Z cuda-nvvp-12.6.80 | 109.3 MB | #9 | 19%  2025-05-07T19:45:52.5045671Z 2025-05-07T19:45:52.5291020Z libcublas-12.6.4.1 | 256.2 MB | #######4 | 75%  2025-05-07T19:45:52.5773630Z nsight-compute-2024. | 443.1 MB | ###7 | 37% 2025-05-07T19:45:52.5774411Z 2025-05-07T19:45:52.5774426Z 2025-05-07T19:45:52.5774437Z 2025-05-07T19:45:52.5774448Z 2025-05-07T19:45:52.5774458Z 2025-05-07T19:45:52.5774468Z 2025-05-07T19:45:52.5916984Z libcusolver-11.7.1.2 | 95.8 MB | # | 10%  2025-05-07T19:45:52.5917352Z 2025-05-07T19:45:52.5917357Z 2025-05-07T19:45:52.5917361Z 2025-05-07T19:45:52.5917366Z 2025-05-07T19:45:52.5917644Z 2025-05-07T19:45:52.6389753Z cuda-nvvp-12.6.80 | 109.3 MB | ##4 | 25%  2025-05-07T19:45:52.6391258Z 2025-05-07T19:45:52.6771916Z libcublas-12.6.4.1 | 256.2 MB | #######7 | 77%  2025-05-07T19:45:52.6772210Z 2025-05-07T19:45:52.6772215Z 2025-05-07T19:45:52.6772233Z 2025-05-07T19:45:52.6772261Z 2025-05-07T19:45:52.6772266Z 2025-05-07T19:45:52.6772278Z 2025-05-07T19:45:52.6920606Z libcusolver-11.7.1.2 | 95.8 MB | #6 | 16%  2025-05-07T19:45:52.6921523Z 2025-05-07T19:45:52.6921535Z 2025-05-07T19:45:52.6921545Z 2025-05-07T19:45:52.6921555Z 2025-05-07T19:45:52.6921604Z 2025-05-07T19:45:52.6999910Z cuda-nvvp-12.6.80 | 109.3 MB | ##9 | 30%  2025-05-07T19:45:52.7618353Z nsight-compute-2024. | 443.1 MB | ###8 | 39% 2025-05-07T19:45:52.7618632Z 2025-05-07T19:45:52.7772497Z libcublas-12.6.4.1 | 256.2 MB | #######9 | 80%  2025-05-07T19:45:52.7772799Z 2025-05-07T19:45:52.7772804Z 2025-05-07T19:45:52.7772807Z 2025-05-07T19:45:52.7772811Z 2025-05-07T19:45:52.7772814Z 2025-05-07T19:45:52.7772817Z 2025-05-07T19:45:52.7935247Z libcusolver-11.7.1.2 | 95.8 MB | ##1 | 21%  2025-05-07T19:45:52.7935585Z 2025-05-07T19:45:52.7935590Z 2025-05-07T19:45:52.7935594Z 2025-05-07T19:45:52.7935597Z 2025-05-07T19:45:52.7939031Z 2025-05-07T19:45:52.8493732Z cuda-nvvp-12.6.80 | 109.3 MB | ###4 | 35%  2025-05-07T19:45:52.8773149Z nsight-compute-2024. | 443.1 MB | #### | 40% 2025-05-07T19:45:52.8773447Z 2025-05-07T19:45:52.8773451Z 2025-05-07T19:45:52.8773455Z 2025-05-07T19:45:52.8773458Z 2025-05-07T19:45:52.8773462Z 2025-05-07T19:45:52.8773615Z 2025-05-07T19:45:52.8853254Z libcusolver-11.7.1.2 | 95.8 MB | ##7 | 27%  2025-05-07T19:45:52.8853663Z 2025-05-07T19:45:52.8956536Z libcublas-12.6.4.1 | 256.2 MB | ########1 | 82%  2025-05-07T19:45:52.8957352Z 2025-05-07T19:45:52.8957357Z 2025-05-07T19:45:52.8957360Z 2025-05-07T19:45:52.8957364Z 2025-05-07T19:45:52.8957367Z 2025-05-07T19:45:52.9776552Z cuda-nvvp-12.6.80 | 109.3 MB | ###9 | 40%  2025-05-07T19:45:52.9776868Z 2025-05-07T19:45:52.9776873Z 2025-05-07T19:45:52.9776876Z 2025-05-07T19:45:52.9776880Z 2025-05-07T19:45:52.9776884Z 2025-05-07T19:45:52.9777141Z 2025-05-07T19:45:52.9808091Z libcusolver-11.7.1.2 | 95.8 MB | ###2 | 33%  2025-05-07T19:45:52.9963734Z nsight-compute-2024. | 443.1 MB | ####1 | 42% 2025-05-07T19:45:52.9964139Z 2025-05-07T19:45:53.0042178Z libcublas-12.6.4.1 | 256.2 MB | ########3 | 84%  2025-05-07T19:45:53.0042470Z 2025-05-07T19:45:53.0042474Z 2025-05-07T19:45:53.0042478Z 2025-05-07T19:45:53.0042482Z 2025-05-07T19:45:53.0042485Z 2025-05-07T19:45:53.0777615Z cuda-nvvp-12.6.80 | 109.3 MB | ####4 | 44%  2025-05-07T19:45:53.0777940Z 2025-05-07T19:45:53.0777945Z 2025-05-07T19:45:53.0777948Z 2025-05-07T19:45:53.0777952Z 2025-05-07T19:45:53.0777955Z 2025-05-07T19:45:53.0778955Z 2025-05-07T19:45:53.0953083Z libcusolver-11.7.1.2 | 95.8 MB | ###8 | 38%  2025-05-07T19:45:53.1031386Z nsight-compute-2024. | 443.1 MB | ####2 | 43% 2025-05-07T19:45:53.1031674Z 2025-05-07T19:45:53.1087247Z libcublas-12.6.4.1 | 256.2 MB | ########5 | 86%  2025-05-07T19:45:53.1087579Z 2025-05-07T19:45:53.1087583Z 2025-05-07T19:45:53.1087587Z 2025-05-07T19:45:53.1087813Z 2025-05-07T19:45:53.1087819Z 2025-05-07T19:45:53.1818397Z cuda-nvvp-12.6.80 | 109.3 MB | ####9 | 49%  2025-05-07T19:45:53.1818719Z 2025-05-07T19:45:53.1818724Z 2025-05-07T19:45:53.1818727Z 2025-05-07T19:45:53.1818731Z 2025-05-07T19:45:53.1818734Z 2025-05-07T19:45:53.1819212Z 2025-05-07T19:45:53.2037904Z libcusolver-11.7.1.2 | 95.8 MB | ####3 | 43%  2025-05-07T19:45:53.2038238Z 2025-05-07T19:45:53.2137846Z libcublas-12.6.4.1 | 256.2 MB | ########7 | 88%  2025-05-07T19:45:53.2138129Z 2025-05-07T19:45:53.2138134Z 2025-05-07T19:45:53.2138138Z 2025-05-07T19:45:53.2138143Z 2025-05-07T19:45:53.2138241Z 2025-05-07T19:45:53.2166926Z cuda-nvvp-12.6.80 | 109.3 MB | #####4 | 54%  2025-05-07T19:45:53.2852254Z nsight-compute-2024. | 443.1 MB | ####4 | 44% 2025-05-07T19:45:53.2852552Z 2025-05-07T19:45:53.2852573Z 2025-05-07T19:45:53.2852577Z 2025-05-07T19:45:53.2852580Z 2025-05-07T19:45:53.2852584Z 2025-05-07T19:45:53.2852597Z 2025-05-07T19:45:53.3047864Z libcusolver-11.7.1.2 | 95.8 MB | ####8 | 49%  2025-05-07T19:45:53.3048229Z 2025-05-07T19:45:53.3217907Z libcublas-12.6.4.1 | 256.2 MB | ########9 | 90%  2025-05-07T19:45:53.3456643Z nsight-compute-2024. | 443.1 MB | ####5 | 45% 2025-05-07T19:45:53.3456914Z 2025-05-07T19:45:53.3456924Z 2025-05-07T19:45:53.3456928Z 2025-05-07T19:45:53.3456948Z 2025-05-07T19:45:53.3457041Z 2025-05-07T19:45:53.3853738Z cuda-nvvp-12.6.80 | 109.3 MB | #####8 | 59%  2025-05-07T19:45:53.3854069Z 2025-05-07T19:45:53.3854075Z 2025-05-07T19:45:53.3854079Z 2025-05-07T19:45:53.3854083Z 2025-05-07T19:45:53.3854087Z 2025-05-07T19:45:53.3854811Z 2025-05-07T19:45:53.4052570Z libcusolver-11.7.1.2 | 95.8 MB | #####5 | 55%  2025-05-07T19:45:53.4053516Z 2025-05-07T19:45:53.4228267Z libcublas-12.6.4.1 | 256.2 MB | #########2 | 92%  2025-05-07T19:45:53.4661275Z nsight-compute-2024. | 443.1 MB | ####6 | 46% 2025-05-07T19:45:53.4662062Z 2025-05-07T19:45:53.4662077Z 2025-05-07T19:45:53.4662088Z 2025-05-07T19:45:53.4662098Z 2025-05-07T19:45:53.4662108Z 2025-05-07T19:45:53.4855555Z cuda-nvvp-12.6.80 | 109.3 MB | ######3 | 63%  2025-05-07T19:45:53.4856027Z 2025-05-07T19:45:53.4856033Z 2025-05-07T19:45:53.4856037Z 2025-05-07T19:45:53.4856043Z 2025-05-07T19:45:53.4856049Z 2025-05-07T19:45:53.4856064Z 2025-05-07T19:45:53.5054987Z libcusolver-11.7.1.2 | 95.8 MB | ######1 | 61%  2025-05-07T19:45:53.5056507Z 2025-05-07T19:45:53.5227130Z libcublas-12.6.4.1 | 256.2 MB | #########4 | 94%  2025-05-07T19:45:53.5953018Z nsight-compute-2024. | 443.1 MB | ####7 | 48% 2025-05-07T19:45:53.5953373Z 2025-05-07T19:45:53.5953380Z 2025-05-07T19:45:53.5953386Z 2025-05-07T19:45:53.5953392Z 2025-05-07T19:45:53.5953398Z 2025-05-07T19:45:53.6052368Z cuda-nvvp-12.6.80 | 109.3 MB | ######7 | 67%  2025-05-07T19:45:53.6052699Z 2025-05-07T19:45:53.6228299Z libcublas-12.6.4.1 | 256.2 MB | #########6 | 96%  2025-05-07T19:45:53.6343153Z nsight-compute-2024. | 443.1 MB | ####9 | 49% 2025-05-07T19:45:53.6343469Z 2025-05-07T19:45:53.6343474Z 2025-05-07T19:45:53.6343478Z 2025-05-07T19:45:53.6343482Z 2025-05-07T19:45:53.6343485Z 2025-05-07T19:45:53.6343489Z 2025-05-07T19:45:53.6670440Z libcusolver-11.7.1.2 | 95.8 MB | ######7 | 67%  2025-05-07T19:45:53.6670777Z 2025-05-07T19:45:53.6670782Z 2025-05-07T19:45:53.6670785Z 2025-05-07T19:45:53.6670789Z 2025-05-07T19:45:53.6829196Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:45:53.6829678Z 2025-05-07T19:45:53.6829683Z 2025-05-07T19:45:53.6977432Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:45:53.6978297Z 2025-05-07T19:45:53.6978302Z 2025-05-07T19:45:53.6978322Z 2025-05-07T19:45:53.6978325Z 2025-05-07T19:45:53.6978334Z 2025-05-07T19:45:53.7055172Z cuda-nvvp-12.6.80 | 109.3 MB | ####### | 71%  2025-05-07T19:45:53.7057686Z 2025-05-07T19:45:53.7231749Z libcublas-12.6.4.1 | 256.2 MB | #########8 | 98%  2025-05-07T19:45:53.7346679Z nsight-compute-2024. | 443.1 MB | ##### | 50% 2025-05-07T19:45:53.7347064Z 2025-05-07T19:45:53.7347135Z 2025-05-07T19:45:53.7347140Z 2025-05-07T19:45:53.7347144Z 2025-05-07T19:45:53.7347163Z 2025-05-07T19:45:53.7347166Z 2025-05-07T19:45:53.7452639Z libcusolver-11.7.1.2 | 95.8 MB | #######3 | 73%  2025-05-07T19:45:53.7452972Z 2025-05-07T19:45:53.7452976Z 2025-05-07T19:45:53.7452980Z 2025-05-07T19:45:53.7452984Z 2025-05-07T19:45:53.7452988Z 2025-05-07T19:45:53.7452991Z 2025-05-07T19:45:53.7453017Z 2025-05-07T19:45:53.8233478Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:45:53.8345941Z nsight-compute-2024. | 443.1 MB | #####1 | 52% 2025-05-07T19:45:53.8346350Z 2025-05-07T19:45:53.8346663Z 2025-05-07T19:45:53.8346679Z 2025-05-07T19:45:53.8346701Z 2025-05-07T19:45:53.8346705Z 2025-05-07T19:45:53.8346712Z 2025-05-07T19:45:53.8456235Z libcusolver-11.7.1.2 | 95.8 MB | #######9 | 80%  2025-05-07T19:45:53.8456553Z 2025-05-07T19:45:53.8456569Z 2025-05-07T19:45:53.8456573Z 2025-05-07T19:45:53.8456577Z 2025-05-07T19:45:53.8456580Z 2025-05-07T19:45:53.8456583Z 2025-05-07T19:45:53.8456587Z 2025-05-07T19:45:53.8837873Z libnpp-12.3.1.54 | 93.4 MB | 6 | 7%  2025-05-07T19:45:53.8838183Z 2025-05-07T19:45:53.8838188Z 2025-05-07T19:45:53.8838191Z 2025-05-07T19:45:53.8838195Z 2025-05-07T19:45:53.8838198Z 2025-05-07T19:45:53.9314255Z cuda-nvvp-12.6.80 | 109.3 MB | #######4 | 75%  2025-05-07T19:45:53.9349952Z nsight-compute-2024. | 443.1 MB | #####2 | 53% 2025-05-07T19:45:53.9350319Z 2025-05-07T19:45:53.9350359Z 2025-05-07T19:45:53.9350585Z 2025-05-07T19:45:53.9350595Z 2025-05-07T19:45:53.9350599Z 2025-05-07T19:45:53.9350602Z 2025-05-07T19:45:53.9457565Z libcusolver-11.7.1.2 | 95.8 MB | ########5 | 86%  2025-05-07T19:45:53.9457909Z 2025-05-07T19:45:53.9457914Z 2025-05-07T19:45:53.9457918Z 2025-05-07T19:45:53.9457921Z 2025-05-07T19:45:53.9457924Z 2025-05-07T19:45:53.9457928Z 2025-05-07T19:45:53.9457931Z 2025-05-07T19:45:53.9847099Z libnpp-12.3.1.54 | 93.4 MB | #2 | 13%  2025-05-07T19:45:53.9847992Z 2025-05-07T19:45:53.9848005Z 2025-05-07T19:45:53.9848016Z 2025-05-07T19:45:53.9848026Z 2025-05-07T19:45:53.9848036Z 2025-05-07T19:45:54.0349753Z cuda-nvvp-12.6.80 | 109.3 MB | #######9 | 80%  2025-05-07T19:45:54.0350082Z 2025-05-07T19:45:54.0350086Z 2025-05-07T19:45:54.0350090Z 2025-05-07T19:45:54.0350093Z 2025-05-07T19:45:54.0350097Z 2025-05-07T19:45:54.0350100Z 2025-05-07T19:45:54.0350590Z libcusolver-11.7.1.2 | 95.8 MB | #########1 | 92%  2025-05-07T19:45:54.0461361Z nsight-compute-2024. | 443.1 MB | #####4 | 54% 2025-05-07T19:45:54.0462173Z 2025-05-07T19:45:54.0462188Z 2025-05-07T19:45:54.0462200Z 2025-05-07T19:45:54.0462210Z 2025-05-07T19:45:54.0462243Z 2025-05-07T19:45:54.0462253Z 2025-05-07T19:45:54.0462274Z 2025-05-07T19:45:54.0847762Z libnpp-12.3.1.54 | 93.4 MB | #9 | 19%  2025-05-07T19:45:54.0848073Z 2025-05-07T19:45:54.0848078Z 2025-05-07T19:45:54.0848082Z 2025-05-07T19:45:54.0848085Z 2025-05-07T19:45:54.0848089Z 2025-05-07T19:45:54.1356486Z cuda-nvvp-12.6.80 | 109.3 MB | ########5 | 85%  2025-05-07T19:45:54.1356802Z 2025-05-07T19:45:54.1356807Z 2025-05-07T19:45:54.1356811Z 2025-05-07T19:45:54.1356815Z 2025-05-07T19:45:54.1356819Z 2025-05-07T19:45:54.1356822Z 2025-05-07T19:45:54.1455226Z libcusolver-11.7.1.2 | 95.8 MB | #########7 | 98%  2025-05-07T19:45:54.1468434Z nsight-compute-2024. | 443.1 MB | #####5 | 55% 2025-05-07T19:45:54.1468742Z 2025-05-07T19:45:54.1468803Z 2025-05-07T19:45:54.1468976Z 2025-05-07T19:45:54.1469224Z 2025-05-07T19:45:54.1847195Z 2025-05-07T19:45:54.1847228Z 2025-05-07T19:45:54.1847233Z 2025-05-07T19:45:54.1847654Z libnpp-12.3.1.54 | 93.4 MB | ##5 | 25%  2025-05-07T19:45:54.1847965Z 2025-05-07T19:45:54.1847969Z 2025-05-07T19:45:54.1847972Z 2025-05-07T19:45:54.1847976Z 2025-05-07T19:45:54.1847979Z 2025-05-07T19:45:54.2454075Z cuda-nvvp-12.6.80 | 109.3 MB | ######### | 91%  2025-05-07T19:45:54.2471209Z nsight-compute-2024. | 443.1 MB | #####6 | 57% 2025-05-07T19:45:54.2472049Z 2025-05-07T19:45:54.2472063Z 2025-05-07T19:45:54.2472074Z 2025-05-07T19:45:54.2472084Z 2025-05-07T19:45:54.2472095Z 2025-05-07T19:45:54.2472105Z 2025-05-07T19:45:54.2472116Z 2025-05-07T19:45:54.2847835Z libnpp-12.3.1.54 | 93.4 MB | ###2 | 32%  2025-05-07T19:45:54.2848166Z 2025-05-07T19:45:54.2848183Z 2025-05-07T19:45:54.2848187Z 2025-05-07T19:45:54.2848191Z 2025-05-07T19:45:54.2848195Z 2025-05-07T19:45:54.3455335Z cuda-nvvp-12.6.80 | 109.3 MB | #########7 | 97%  2025-05-07T19:45:54.3829399Z nsight-compute-2024. | 443.1 MB | #####8 | 59% 2025-05-07T19:45:54.3829780Z 2025-05-07T19:45:54.3829854Z 2025-05-07T19:45:54.3829857Z 2025-05-07T19:45:54.3829876Z 2025-05-07T19:45:54.3829880Z 2025-05-07T19:45:54.3829884Z 2025-05-07T19:45:54.3829887Z 2025-05-07T19:45:54.4459423Z libnpp-12.3.1.54 | 93.4 MB | ###8 | 39%  2025-05-07T19:45:54.4829947Z nsight-compute-2024. | 443.1 MB | ###### | 60% 2025-05-07T19:45:54.4830286Z 2025-05-07T19:45:54.4830439Z 2025-05-07T19:45:54.4830446Z 2025-05-07T19:45:54.4830454Z 2025-05-07T19:45:54.4830458Z 2025-05-07T19:45:54.4830461Z 2025-05-07T19:45:54.4830488Z 2025-05-07T19:45:54.5460242Z libnpp-12.3.1.54 | 93.4 MB | ####7 | 47%  2025-05-07T19:45:54.5924009Z nsight-compute-2024. | 443.1 MB | ######1 | 62% 2025-05-07T19:45:54.5924512Z 2025-05-07T19:45:54.5924519Z 2025-05-07T19:45:54.5924539Z 2025-05-07T19:45:54.5924545Z 2025-05-07T19:45:54.5924549Z 2025-05-07T19:45:54.5924554Z 2025-05-07T19:45:54.5924558Z 2025-05-07T19:45:54.6852396Z libnpp-12.3.1.54 | 93.4 MB | #####3 | 54%  2025-05-07T19:45:54.6926049Z nsight-compute-2024. | 443.1 MB | ######3 | 63% 2025-05-07T19:45:54.6926305Z 2025-05-07T19:45:54.6926309Z 2025-05-07T19:45:54.6926313Z 2025-05-07T19:45:54.6926380Z 2025-05-07T19:45:54.6926487Z 2025-05-07T19:45:54.6926492Z 2025-05-07T19:45:54.6926500Z 2025-05-07T19:45:54.7852693Z libnpp-12.3.1.54 | 93.4 MB | ######1 | 62%  2025-05-07T19:45:54.8180900Z nsight-compute-2024. | 443.1 MB | ######5 | 66% 2025-05-07T19:45:54.8181194Z 2025-05-07T19:45:54.8181200Z 2025-05-07T19:45:54.8181204Z 2025-05-07T19:45:54.8181209Z 2025-05-07T19:45:54.8181214Z 2025-05-07T19:45:54.8181220Z 2025-05-07T19:45:54.8181246Z 2025-05-07T19:45:54.8882073Z libnpp-12.3.1.54 | 93.4 MB | ######8 | 69%  2025-05-07T19:45:55.0012306Z nsight-compute-2024. | 443.1 MB | ######7 | 67% 2025-05-07T19:45:55.0252387Z nsight-compute-2024. | 443.1 MB | ######9 | 69% 2025-05-07T19:45:55.0252685Z 2025-05-07T19:45:55.0252690Z 2025-05-07T19:45:55.0252693Z 2025-05-07T19:45:55.0252698Z 2025-05-07T19:45:55.0252702Z 2025-05-07T19:45:55.0252706Z 2025-05-07T19:45:55.0252797Z 2025-05-07T19:45:55.1066998Z libnpp-12.3.1.54 | 93.4 MB | #######5 | 75%  2025-05-07T19:45:55.1253239Z nsight-compute-2024. | 443.1 MB | #######1 | 71% 2025-05-07T19:45:55.1253527Z 2025-05-07T19:45:55.1253533Z 2025-05-07T19:45:55.1253538Z 2025-05-07T19:45:55.1253542Z 2025-05-07T19:45:55.1253546Z 2025-05-07T19:45:55.1253551Z 2025-05-07T19:45:55.1253560Z 2025-05-07T19:45:55.1467922Z libnpp-12.3.1.54 | 93.4 MB | ########5 | 85%  2025-05-07T19:45:55.1468263Z 2025-05-07T19:45:55.1468268Z 2025-05-07T19:45:55.1468272Z 2025-05-07T19:45:55.1468276Z 2025-05-07T19:45:55.1468553Z 2025-05-07T19:45:55.1468879Z 2025-05-07T19:45:55.1865394Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:45:55.1865959Z 2025-05-07T19:45:55.1866018Z 2025-05-07T19:45:55.1866024Z 2025-05-07T19:45:55.1866027Z 2025-05-07T19:45:55.1866031Z 2025-05-07T19:45:55.1866035Z 2025-05-07T19:45:55.1866038Z 2025-05-07T19:45:55.1866042Z 2025-05-07T19:45:55.2100424Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:45:55.2293665Z nsight-compute-2024. | 443.1 MB | #######2 | 73% 2025-05-07T19:45:55.2293937Z 2025-05-07T19:45:55.2293942Z 2025-05-07T19:45:55.2293945Z 2025-05-07T19:45:55.2293949Z 2025-05-07T19:45:55.2293952Z 2025-05-07T19:45:55.2293956Z 2025-05-07T19:45:55.2293965Z 2025-05-07T19:45:55.2867069Z libnpp-12.3.1.54 | 93.4 MB | #########2 | 92%  2025-05-07T19:45:55.2867432Z 2025-05-07T19:45:55.2867437Z 2025-05-07T19:45:55.2867441Z 2025-05-07T19:45:55.2867445Z 2025-05-07T19:45:55.2867463Z 2025-05-07T19:45:55.2867467Z 2025-05-07T19:45:55.2867470Z 2025-05-07T19:45:55.2867474Z 2025-05-07T19:45:55.3294471Z cuda-nvdisasm-12.6.7 | 47.6 MB | #3 | 13%  2025-05-07T19:45:55.3294804Z 2025-05-07T19:45:55.3294809Z 2025-05-07T19:45:55.3294813Z 2025-05-07T19:45:55.3294816Z 2025-05-07T19:45:55.3294819Z 2025-05-07T19:45:55.3294823Z 2025-05-07T19:45:55.3294827Z 2025-05-07T19:45:55.3359312Z libnpp-12.3.1.54 | 93.4 MB | #########9 | 99%  2025-05-07T19:45:55.3868712Z nsight-compute-2024. | 443.1 MB | #######4 | 75% 2025-05-07T19:45:55.3869019Z 2025-05-07T19:45:55.3869024Z 2025-05-07T19:45:55.3869027Z 2025-05-07T19:45:55.3869031Z 2025-05-07T19:45:55.3869034Z 2025-05-07T19:45:55.3869037Z 2025-05-07T19:45:55.3869041Z 2025-05-07T19:45:55.3869044Z 2025-05-07T19:45:55.4358757Z cuda-nvdisasm-12.6.7 | 47.6 MB | ##7 | 28%  2025-05-07T19:45:55.4868963Z nsight-compute-2024. | 443.1 MB | #######6 | 76% 2025-05-07T19:45:55.4869484Z 2025-05-07T19:45:55.4988943Z 2025-05-07T19:45:55.4988950Z 2025-05-07T19:45:55.4988955Z 2025-05-07T19:45:55.4988960Z 2025-05-07T19:45:55.4988965Z 2025-05-07T19:45:55.4988970Z 2025-05-07T19:45:55.4988974Z 2025-05-07T19:45:55.4989443Z cuda-nvdisasm-12.6.7 | 47.6 MB | ####4 | 44%  2025-05-07T19:45:55.4989774Z 2025-05-07T19:45:55.4989779Z 2025-05-07T19:45:55.4989782Z 2025-05-07T19:45:55.4989786Z 2025-05-07T19:45:55.4989790Z 2025-05-07T19:45:55.5462515Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:45:55.5463382Z 2025-05-07T19:45:55.5463397Z 2025-05-07T19:45:55.5463408Z 2025-05-07T19:45:55.5463419Z 2025-05-07T19:45:55.5463429Z 2025-05-07T19:45:55.5463439Z 2025-05-07T19:45:55.5463449Z 2025-05-07T19:45:55.5463472Z 2025-05-07T19:45:55.5463483Z 2025-05-07T19:45:55.5499747Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:45:55.5869026Z nsight-compute-2024. | 443.1 MB | #######7 | 78% 2025-05-07T19:45:55.5869407Z 2025-05-07T19:45:55.5869560Z 2025-05-07T19:45:55.5869569Z 2025-05-07T19:45:55.5869602Z 2025-05-07T19:45:55.5869607Z 2025-05-07T19:45:55.5869612Z 2025-05-07T19:45:55.5869617Z 2025-05-07T19:45:55.5869621Z 2025-05-07T19:45:55.6460752Z cuda-nvdisasm-12.6.7 | 47.6 MB | ###### | 60%  2025-05-07T19:45:55.6461089Z 2025-05-07T19:45:55.6461093Z 2025-05-07T19:45:55.6461097Z 2025-05-07T19:45:55.6461113Z 2025-05-07T19:45:55.6461117Z 2025-05-07T19:45:55.6461120Z 2025-05-07T19:45:55.6461123Z 2025-05-07T19:45:55.6461127Z 2025-05-07T19:45:55.6461131Z 2025-05-07T19:45:55.6652312Z libcurand-10.3.7.77 | 39.9 MB | #6 | 16%  2025-05-07T19:45:55.6931128Z nsight-compute-2024. | 443.1 MB | #######9 | 80% 2025-05-07T19:45:55.6931422Z 2025-05-07T19:45:55.6931439Z 2025-05-07T19:45:55.6931443Z 2025-05-07T19:45:55.6931447Z 2025-05-07T19:45:55.6931451Z 2025-05-07T19:45:55.6931659Z 2025-05-07T19:45:55.6931665Z 2025-05-07T19:45:55.6931668Z 2025-05-07T19:45:55.7461502Z cuda-nvdisasm-12.6.7 | 47.6 MB | #######4 | 74%  2025-05-07T19:45:55.7461839Z 2025-05-07T19:45:55.7461843Z 2025-05-07T19:45:55.7461847Z 2025-05-07T19:45:55.7461850Z 2025-05-07T19:45:55.7461854Z 2025-05-07T19:45:55.7461857Z 2025-05-07T19:45:55.7461860Z 2025-05-07T19:45:55.7461864Z 2025-05-07T19:45:55.7461867Z 2025-05-07T19:45:55.7806577Z libcurand-10.3.7.77 | 39.9 MB | ###2 | 33%  2025-05-07T19:45:55.7977717Z nsight-compute-2024. | 443.1 MB | ########1 | 81% 2025-05-07T19:45:55.7978044Z 2025-05-07T19:45:55.7978048Z 2025-05-07T19:45:55.7978052Z 2025-05-07T19:45:55.7978056Z 2025-05-07T19:45:55.7978060Z 2025-05-07T19:45:55.7978063Z 2025-05-07T19:45:55.7978067Z 2025-05-07T19:45:55.7978070Z 2025-05-07T19:45:55.8216297Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########7 | 88%  2025-05-07T19:45:55.8216647Z 2025-05-07T19:45:55.8216664Z 2025-05-07T19:45:55.8216668Z 2025-05-07T19:45:55.8463614Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:45:55.8463924Z 2025-05-07T19:45:55.8464039Z 2025-05-07T19:45:55.8464047Z 2025-05-07T19:45:55.8464052Z 2025-05-07T19:45:55.8464057Z 2025-05-07T19:45:55.8464061Z 2025-05-07T19:45:55.8464066Z 2025-05-07T19:45:55.8464070Z 2025-05-07T19:45:55.8464242Z 2025-05-07T19:45:55.9001423Z libcurand-10.3.7.77 | 39.9 MB | ####8 | 48%  2025-05-07T19:45:55.9465299Z nsight-compute-2024. | 443.1 MB | ########2 | 83% 2025-05-07T19:45:55.9465805Z 2025-05-07T19:45:55.9465822Z 2025-05-07T19:45:55.9465827Z 2025-05-07T19:45:55.9465831Z 2025-05-07T19:45:55.9465863Z 2025-05-07T19:45:55.9465868Z 2025-05-07T19:45:55.9465872Z 2025-05-07T19:45:55.9465877Z 2025-05-07T19:45:55.9465881Z 2025-05-07T19:45:56.0004075Z libcurand-10.3.7.77 | 39.9 MB | ######5 | 66%  2025-05-07T19:45:56.0468896Z nsight-compute-2024. | 443.1 MB | ########4 | 84% 2025-05-07T19:45:56.0469376Z 2025-05-07T19:45:56.0469392Z 2025-05-07T19:45:56.0469397Z 2025-05-07T19:45:56.0469402Z 2025-05-07T19:45:56.0469406Z 2025-05-07T19:45:56.0469411Z 2025-05-07T19:45:56.0469437Z 2025-05-07T19:45:56.0469443Z 2025-05-07T19:45:56.0469448Z 2025-05-07T19:45:56.1008871Z libcurand-10.3.7.77 | 39.9 MB | ########4 | 85%  2025-05-07T19:45:56.2007529Z nsight-compute-2024. | 443.1 MB | ########5 | 86% 2025-05-07T19:45:56.3427114Z nsight-compute-2024. | 443.1 MB | ########7 | 88% 2025-05-07T19:45:56.4714518Z nsight-compute-2024. | 443.1 MB | ########9 | 90% 2025-05-07T19:45:56.5028086Z nsight-compute-2024. | 443.1 MB | #########1 | 91% 2025-05-07T19:45:56.5029332Z 2025-05-07T19:45:56.5029338Z 2025-05-07T19:45:56.5029342Z 2025-05-07T19:45:56.5029346Z 2025-05-07T19:45:56.5029365Z 2025-05-07T19:45:56.5029370Z 2025-05-07T19:45:56.5029373Z 2025-05-07T19:45:56.5029377Z 2025-05-07T19:45:56.5188589Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:45:56.5189145Z 2025-05-07T19:45:56.5189174Z 2025-05-07T19:45:56.5481253Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:45:56.5481625Z 2025-05-07T19:45:56.5481631Z 2025-05-07T19:45:56.5481636Z 2025-05-07T19:45:56.5481643Z 2025-05-07T19:45:56.5481648Z 2025-05-07T19:45:56.5481653Z 2025-05-07T19:45:56.5481660Z 2025-05-07T19:45:56.5481683Z 2025-05-07T19:45:56.5481689Z 2025-05-07T19:45:56.5481697Z 2025-05-07T19:45:56.6111392Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:45:56.6111723Z 2025-05-07T19:45:56.6111727Z 2025-05-07T19:45:56.6111731Z 2025-05-07T19:45:56.6111734Z 2025-05-07T19:45:56.6111738Z 2025-05-07T19:45:56.6111741Z 2025-05-07T19:45:56.6111745Z 2025-05-07T19:45:56.6111761Z 2025-05-07T19:45:56.6111778Z 2025-05-07T19:45:56.6237666Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:45:56.6483117Z nsight-compute-2024. | 443.1 MB | #########2 | 93% 2025-05-07T19:45:56.6483412Z 2025-05-07T19:45:56.6483563Z 2025-05-07T19:45:56.6483570Z 2025-05-07T19:45:56.6483573Z 2025-05-07T19:45:56.6483577Z 2025-05-07T19:45:56.6483581Z 2025-05-07T19:45:56.6483585Z 2025-05-07T19:45:56.6483588Z 2025-05-07T19:45:56.6483592Z 2025-05-07T19:45:56.6483595Z 2025-05-07T19:45:56.6557367Z gds-tools-1.11.1.6 | 37.8 MB | ##2 | 23%  2025-05-07T19:45:56.6557792Z 2025-05-07T19:45:56.6557796Z 2025-05-07T19:45:56.6557799Z 2025-05-07T19:45:56.6557803Z 2025-05-07T19:45:56.6557807Z 2025-05-07T19:45:56.6557810Z 2025-05-07T19:45:56.6557814Z 2025-05-07T19:45:56.6557817Z 2025-05-07T19:45:56.6557820Z 2025-05-07T19:45:56.6557824Z 2025-05-07T19:45:56.6557827Z 2025-05-07T19:45:56.6934855Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:45:56.6935199Z 2025-05-07T19:45:56.6935203Z 2025-05-07T19:45:56.6935215Z 2025-05-07T19:45:56.6935218Z 2025-05-07T19:45:56.6935222Z 2025-05-07T19:45:56.6935225Z 2025-05-07T19:45:56.6935234Z 2025-05-07T19:45:56.7388636Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:45:56.7388929Z 2025-05-07T19:45:56.7388933Z 2025-05-07T19:45:56.7388936Z 2025-05-07T19:45:56.7388940Z 2025-05-07T19:45:56.7388943Z 2025-05-07T19:45:56.7388947Z 2025-05-07T19:45:56.7388950Z 2025-05-07T19:45:56.7388954Z 2025-05-07T19:45:56.7388957Z 2025-05-07T19:45:56.7388960Z 2025-05-07T19:45:56.7388963Z 2025-05-07T19:45:56.7388967Z 2025-05-07T19:45:56.7478607Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:45:56.7483360Z nsight-compute-2024. | 443.1 MB | #########3 | 94% 2025-05-07T19:45:56.7484481Z 2025-05-07T19:45:56.7484492Z 2025-05-07T19:45:56.7484503Z 2025-05-07T19:45:56.7484942Z 2025-05-07T19:45:56.7484962Z 2025-05-07T19:45:56.7484978Z 2025-05-07T19:45:56.7485012Z 2025-05-07T19:45:56.7485050Z 2025-05-07T19:45:56.7485072Z 2025-05-07T19:45:56.7485094Z 2025-05-07T19:45:56.7564723Z gds-tools-1.11.1.6 | 37.8 MB | ###9 | 39%  2025-05-07T19:45:56.7565191Z 2025-05-07T19:45:56.7565233Z 2025-05-07T19:45:56.7565241Z 2025-05-07T19:45:56.7565248Z 2025-05-07T19:45:56.7565256Z 2025-05-07T19:45:56.7565261Z 2025-05-07T19:45:56.7565266Z 2025-05-07T19:45:56.7565271Z 2025-05-07T19:45:56.7565277Z 2025-05-07T19:45:56.7565283Z 2025-05-07T19:45:56.7565288Z 2025-05-07T19:45:56.8391292Z cuda-nvcc-tools-12.6 | 23.0 MB | ##3 | 24%  2025-05-07T19:45:56.8391633Z 2025-05-07T19:45:56.8391637Z 2025-05-07T19:45:56.8391641Z 2025-05-07T19:45:56.8391644Z 2025-05-07T19:45:56.8391647Z 2025-05-07T19:45:56.8391651Z 2025-05-07T19:45:56.8391655Z 2025-05-07T19:45:56.8391658Z 2025-05-07T19:45:56.8391661Z 2025-05-07T19:45:56.8391680Z 2025-05-07T19:45:56.8391684Z 2025-05-07T19:45:56.8391687Z 2025-05-07T19:45:56.8567204Z cuda-nvrtc-12.6.85 | 17.3 MB | ###1 | 31%  2025-05-07T19:45:56.8567544Z 2025-05-07T19:45:56.8567551Z 2025-05-07T19:45:56.8567557Z 2025-05-07T19:45:56.8567562Z 2025-05-07T19:45:56.8567568Z 2025-05-07T19:45:56.8567573Z 2025-05-07T19:45:56.8567581Z 2025-05-07T19:45:56.8567587Z 2025-05-07T19:45:56.8567590Z 2025-05-07T19:45:56.8567594Z 2025-05-07T19:45:56.8567597Z 2025-05-07T19:45:56.8610548Z cuda-nvcc-tools-12.6 | 23.0 MB | ####5 | 46%  2025-05-07T19:45:56.8610864Z 2025-05-07T19:45:56.8610936Z 2025-05-07T19:45:56.8610997Z 2025-05-07T19:45:56.8611000Z 2025-05-07T19:45:56.8611014Z 2025-05-07T19:45:56.8611017Z 2025-05-07T19:45:56.8611126Z 2025-05-07T19:45:56.8611134Z 2025-05-07T19:45:56.8611141Z 2025-05-07T19:45:56.8611146Z 2025-05-07T19:45:56.8635121Z gds-tools-1.11.1.6 | 37.8 MB | #####3 | 54%  2025-05-07T19:45:56.9392209Z nsight-compute-2024. | 443.1 MB | #########5 | 95% 2025-05-07T19:45:56.9392667Z 2025-05-07T19:45:56.9392672Z 2025-05-07T19:45:56.9392676Z 2025-05-07T19:45:56.9392679Z 2025-05-07T19:45:56.9392682Z 2025-05-07T19:45:56.9392687Z 2025-05-07T19:45:56.9392690Z 2025-05-07T19:45:56.9392694Z 2025-05-07T19:45:56.9392698Z 2025-05-07T19:45:56.9392702Z 2025-05-07T19:45:56.9392705Z 2025-05-07T19:45:56.9392795Z 2025-05-07T19:45:56.9568508Z cuda-nvrtc-12.6.85 | 17.3 MB | ######6 | 67%  2025-05-07T19:45:56.9568864Z 2025-05-07T19:45:56.9568869Z 2025-05-07T19:45:56.9568873Z 2025-05-07T19:45:56.9568876Z 2025-05-07T19:45:56.9568880Z 2025-05-07T19:45:56.9568884Z 2025-05-07T19:45:56.9568887Z 2025-05-07T19:45:56.9568890Z 2025-05-07T19:45:56.9568894Z 2025-05-07T19:45:56.9568897Z 2025-05-07T19:45:56.9568900Z 2025-05-07T19:45:56.9613098Z cuda-nvcc-tools-12.6 | 23.0 MB | #######2 | 72%  2025-05-07T19:45:56.9613470Z 2025-05-07T19:45:56.9613475Z 2025-05-07T19:45:56.9613478Z 2025-05-07T19:45:56.9613491Z 2025-05-07T19:45:56.9613495Z 2025-05-07T19:45:56.9613498Z 2025-05-07T19:45:56.9613502Z 2025-05-07T19:45:56.9613505Z 2025-05-07T19:45:56.9613508Z 2025-05-07T19:45:56.9613512Z 2025-05-07T19:45:57.0034591Z gds-tools-1.11.1.6 | 37.8 MB | ######8 | 69%  2025-05-07T19:45:57.0394030Z nsight-compute-2024. | 443.1 MB | #########6 | 96% 2025-05-07T19:45:57.0395025Z 2025-05-07T19:45:57.0395039Z 2025-05-07T19:45:57.0395049Z 2025-05-07T19:45:57.0395060Z 2025-05-07T19:45:57.0395070Z 2025-05-07T19:45:57.0395081Z 2025-05-07T19:45:57.0395091Z 2025-05-07T19:45:57.0395102Z 2025-05-07T19:45:57.0395111Z 2025-05-07T19:45:57.0395122Z 2025-05-07T19:45:57.0395143Z 2025-05-07T19:45:57.0395154Z 2025-05-07T19:45:57.0567494Z cuda-nvrtc-12.6.85 | 17.3 MB | #########6 | 96%  2025-05-07T19:45:57.0568036Z 2025-05-07T19:45:57.0568043Z 2025-05-07T19:45:57.0568049Z 2025-05-07T19:45:57.0568055Z 2025-05-07T19:45:57.0568072Z 2025-05-07T19:45:57.0568076Z 2025-05-07T19:45:57.0568080Z 2025-05-07T19:45:57.0568083Z 2025-05-07T19:45:57.0568086Z 2025-05-07T19:45:57.0568089Z 2025-05-07T19:45:57.0568092Z 2025-05-07T19:45:57.0688679Z cuda-nvcc-tools-12.6 | 23.0 MB | #########5 | 95%  2025-05-07T19:45:57.0689007Z 2025-05-07T19:45:57.0689013Z 2025-05-07T19:45:57.0689019Z 2025-05-07T19:45:57.0689024Z 2025-05-07T19:45:57.0689031Z 2025-05-07T19:45:57.0689036Z 2025-05-07T19:45:57.0689041Z 2025-05-07T19:45:57.0689046Z 2025-05-07T19:45:57.0689051Z 2025-05-07T19:45:57.0689056Z 2025-05-07T19:45:57.1037438Z gds-tools-1.11.1.6 | 37.8 MB | ########2 | 83%  2025-05-07T19:45:57.2039094Z nsight-compute-2024. | 443.1 MB | #########7 | 97% 2025-05-07T19:45:57.2068940Z nsight-compute-2024. | 443.1 MB | #########9 | 99% 2025-05-07T19:45:57.2069281Z 2025-05-07T19:45:57.2069285Z 2025-05-07T19:45:57.2069289Z 2025-05-07T19:45:57.2069293Z 2025-05-07T19:45:57.2069305Z 2025-05-07T19:45:57.2069309Z 2025-05-07T19:45:57.2069312Z 2025-05-07T19:45:57.2069316Z 2025-05-07T19:45:57.3069621Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:45:57.3069978Z 2025-05-07T19:45:57.3069982Z 2025-05-07T19:45:57.3069986Z 2025-05-07T19:45:57.3069990Z 2025-05-07T19:45:57.3069993Z 2025-05-07T19:45:57.3069997Z 2025-05-07T19:45:57.3070000Z 2025-05-07T19:45:57.3070003Z 2025-05-07T19:45:57.3070007Z 2025-05-07T19:45:57.3070010Z 2025-05-07T19:45:57.3070013Z 2025-05-07T19:45:57.3070016Z 2025-05-07T19:45:57.3398944Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:45:57.3399286Z 2025-05-07T19:45:57.3399291Z 2025-05-07T19:45:57.3399295Z 2025-05-07T19:45:57.3399299Z 2025-05-07T19:45:57.3399302Z 2025-05-07T19:45:57.3399306Z 2025-05-07T19:45:57.3399309Z 2025-05-07T19:45:57.3399326Z 2025-05-07T19:45:57.3399329Z 2025-05-07T19:45:57.3399333Z 2025-05-07T19:45:57.3399336Z 2025-05-07T19:45:57.3399535Z 2025-05-07T19:45:57.3399540Z 2025-05-07T19:45:57.3844880Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:45:57.3845240Z 2025-05-07T19:45:57.3875962Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:45:57.3876255Z 2025-05-07T19:45:57.3876488Z 2025-05-07T19:45:57.3876496Z 2025-05-07T19:45:57.3876502Z 2025-05-07T19:45:57.3876506Z 2025-05-07T19:45:57.3876511Z 2025-05-07T19:45:57.3876516Z 2025-05-07T19:45:57.3876522Z 2025-05-07T19:45:57.3876527Z 2025-05-07T19:45:57.3876531Z 2025-05-07T19:45:57.3876536Z 2025-05-07T19:45:57.4292527Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:45:57.4292886Z 2025-05-07T19:45:57.4292891Z 2025-05-07T19:45:57.4292895Z 2025-05-07T19:45:57.4292899Z 2025-05-07T19:45:57.4292902Z 2025-05-07T19:45:57.4292917Z 2025-05-07T19:45:57.4292921Z 2025-05-07T19:45:57.4292924Z 2025-05-07T19:45:57.4292928Z 2025-05-07T19:45:57.4292936Z 2025-05-07T19:45:57.4292940Z 2025-05-07T19:45:57.4292943Z 2025-05-07T19:45:57.4292947Z 2025-05-07T19:45:57.4292950Z 2025-05-07T19:45:57.4401348Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:45:57.4402395Z 2025-05-07T19:45:57.4402408Z 2025-05-07T19:45:57.4402419Z 2025-05-07T19:45:57.4402429Z 2025-05-07T19:45:57.4402439Z 2025-05-07T19:45:57.4402450Z 2025-05-07T19:45:57.4402461Z 2025-05-07T19:45:57.4402471Z 2025-05-07T19:45:57.4402481Z 2025-05-07T19:45:57.4402491Z 2025-05-07T19:45:57.4402501Z 2025-05-07T19:45:57.4402511Z 2025-05-07T19:45:57.4402545Z 2025-05-07T19:45:57.4403421Z libnvjitlink-12.6.85 | 14.9 MB | ######6 | 67%  2025-05-07T19:45:57.4404364Z 2025-05-07T19:45:57.4404374Z 2025-05-07T19:45:57.4404384Z 2025-05-07T19:45:57.4404395Z 2025-05-07T19:45:57.4404807Z 2025-05-07T19:45:57.4404818Z 2025-05-07T19:45:57.4404829Z 2025-05-07T19:45:57.4404839Z 2025-05-07T19:45:57.4404866Z 2025-05-07T19:45:57.4404895Z 2025-05-07T19:45:57.4404906Z 2025-05-07T19:45:57.4404917Z 2025-05-07T19:45:57.4404927Z 2025-05-07T19:45:57.4404937Z 2025-05-07T19:45:57.4404947Z 2025-05-07T19:45:57.5293608Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:45:57.5293977Z 2025-05-07T19:45:57.5293981Z 2025-05-07T19:45:57.5293984Z 2025-05-07T19:45:57.5293988Z 2025-05-07T19:45:57.5293991Z 2025-05-07T19:45:57.5293995Z 2025-05-07T19:45:57.5293998Z 2025-05-07T19:45:57.5294002Z 2025-05-07T19:45:57.5294006Z 2025-05-07T19:45:57.5294010Z 2025-05-07T19:45:57.5294013Z 2025-05-07T19:45:57.5294017Z 2025-05-07T19:45:57.5294021Z 2025-05-07T19:45:57.5294025Z 2025-05-07T19:45:57.5952475Z cuda-nvcc-dev_linux- | 10.8 MB | #######4 | 74%  2025-05-07T19:45:57.5952835Z 2025-05-07T19:45:57.5952856Z 2025-05-07T19:45:57.5952860Z 2025-05-07T19:45:57.5952863Z 2025-05-07T19:45:57.5952867Z 2025-05-07T19:45:57.5952880Z 2025-05-07T19:45:57.5952883Z 2025-05-07T19:45:57.5952887Z 2025-05-07T19:45:57.5952890Z 2025-05-07T19:45:57.5952893Z 2025-05-07T19:45:57.5952897Z 2025-05-07T19:45:57.5952900Z 2025-05-07T19:45:57.5952917Z 2025-05-07T19:45:57.5952921Z 2025-05-07T19:45:57.5952924Z 2025-05-07T19:45:57.6462090Z cuda-nvvm-tools-12.6 | 10.4 MB | #2 | 12%  2025-05-07T19:45:57.6462496Z 2025-05-07T19:45:57.6462501Z 2025-05-07T19:45:57.6462506Z 2025-05-07T19:45:57.6462511Z 2025-05-07T19:45:57.6462529Z 2025-05-07T19:45:57.6462533Z 2025-05-07T19:45:57.6462538Z 2025-05-07T19:45:57.6462542Z 2025-05-07T19:45:57.6462546Z 2025-05-07T19:45:57.6462550Z 2025-05-07T19:45:57.6462826Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:45:57.6463117Z 2025-05-07T19:45:57.6463120Z 2025-05-07T19:45:57.6463136Z 2025-05-07T19:45:57.6463140Z 2025-05-07T19:45:57.6463157Z 2025-05-07T19:45:57.6463160Z 2025-05-07T19:45:57.6463366Z 2025-05-07T19:45:57.6463372Z 2025-05-07T19:45:57.6463375Z 2025-05-07T19:45:57.6463379Z 2025-05-07T19:45:57.6876058Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:45:57.6876391Z 2025-05-07T19:45:57.6876410Z 2025-05-07T19:45:57.6876413Z 2025-05-07T19:45:57.6876417Z 2025-05-07T19:45:57.6876421Z 2025-05-07T19:45:57.6876424Z 2025-05-07T19:45:57.6876428Z 2025-05-07T19:45:57.6876431Z 2025-05-07T19:45:57.6876434Z 2025-05-07T19:45:57.6876438Z 2025-05-07T19:45:57.6876441Z 2025-05-07T19:45:57.6876444Z 2025-05-07T19:45:57.6876448Z 2025-05-07T19:45:57.6876451Z 2025-05-07T19:45:57.6913474Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:45:57.6913854Z 2025-05-07T19:45:57.6913859Z 2025-05-07T19:45:57.6913862Z 2025-05-07T19:45:57.6913866Z 2025-05-07T19:45:57.6913881Z 2025-05-07T19:45:57.6913884Z 2025-05-07T19:45:57.6913888Z 2025-05-07T19:45:57.6913891Z 2025-05-07T19:45:57.6913902Z 2025-05-07T19:45:57.6913905Z 2025-05-07T19:45:57.6913909Z 2025-05-07T19:45:57.6913912Z 2025-05-07T19:45:57.6913915Z 2025-05-07T19:45:57.6952101Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:45:57.6952449Z 2025-05-07T19:45:57.6952485Z 2025-05-07T19:45:57.6952488Z 2025-05-07T19:45:57.6952492Z 2025-05-07T19:45:57.6952496Z 2025-05-07T19:45:57.6952613Z 2025-05-07T19:45:57.6952622Z 2025-05-07T19:45:57.6952626Z 2025-05-07T19:45:57.6952629Z 2025-05-07T19:45:57.6952633Z 2025-05-07T19:45:57.6952636Z 2025-05-07T19:45:57.6952640Z 2025-05-07T19:45:57.6952644Z 2025-05-07T19:45:57.6952648Z 2025-05-07T19:45:57.6952652Z 2025-05-07T19:45:57.7082071Z cuda-nvvm-tools-12.6 | 10.4 MB | #########6 | 97%  2025-05-07T19:45:57.7082422Z 2025-05-07T19:45:57.7082427Z 2025-05-07T19:45:57.7082626Z 2025-05-07T19:45:57.7082630Z 2025-05-07T19:45:57.7082633Z 2025-05-07T19:45:57.7082637Z 2025-05-07T19:45:57.7082648Z 2025-05-07T19:45:57.7082653Z 2025-05-07T19:45:57.7082659Z 2025-05-07T19:45:57.7082665Z 2025-05-07T19:45:57.7082671Z 2025-05-07T19:45:57.7082676Z 2025-05-07T19:45:57.7082681Z 2025-05-07T19:45:57.7082686Z 2025-05-07T19:45:57.7082711Z 2025-05-07T19:45:57.7082716Z 2025-05-07T19:45:57.7288003Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:45:57.7288459Z 2025-05-07T19:45:57.7288463Z 2025-05-07T19:45:57.7288467Z 2025-05-07T19:45:57.7288470Z 2025-05-07T19:45:57.7288473Z 2025-05-07T19:45:57.7288477Z 2025-05-07T19:45:57.7288480Z 2025-05-07T19:45:57.7288483Z 2025-05-07T19:45:57.7288487Z 2025-05-07T19:45:57.7288490Z 2025-05-07T19:45:57.7288494Z 2025-05-07T19:45:57.7288497Z 2025-05-07T19:45:57.7288500Z 2025-05-07T19:45:57.7288503Z 2025-05-07T19:45:57.7288507Z 2025-05-07T19:45:57.7288510Z 2025-05-07T19:45:57.7288524Z 2025-05-07T19:45:57.7288528Z 2025-05-07T19:45:57.7539083Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:45:57.7539452Z 2025-05-07T19:45:57.7539457Z 2025-05-07T19:45:57.7539460Z 2025-05-07T19:45:57.7539464Z 2025-05-07T19:45:57.7539467Z 2025-05-07T19:45:57.7539470Z 2025-05-07T19:45:57.7539475Z 2025-05-07T19:45:57.7539478Z 2025-05-07T19:45:57.7539481Z 2025-05-07T19:45:57.7539498Z 2025-05-07T19:45:57.7539501Z 2025-05-07T19:45:57.7539505Z 2025-05-07T19:45:57.7539508Z 2025-05-07T19:45:57.7539511Z 2025-05-07T19:45:57.7539515Z 2025-05-07T19:45:57.7539518Z 2025-05-07T19:45:57.7539521Z 2025-05-07T19:45:57.8083041Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:45:57.8083417Z 2025-05-07T19:45:57.8083421Z 2025-05-07T19:45:57.8083426Z 2025-05-07T19:45:57.8083429Z 2025-05-07T19:45:57.8083433Z 2025-05-07T19:45:57.8083436Z 2025-05-07T19:45:57.8083452Z 2025-05-07T19:45:57.8083455Z 2025-05-07T19:45:57.8083459Z 2025-05-07T19:45:57.8083462Z 2025-05-07T19:45:57.8083691Z 2025-05-07T19:45:57.8083696Z 2025-05-07T19:45:57.8083699Z 2025-05-07T19:45:57.8083703Z 2025-05-07T19:45:57.8083707Z 2025-05-07T19:45:57.8083710Z 2025-05-07T19:45:57.8237739Z cuda-sanitizer-api-1 | 8.9 MB | ######9 | 69%  2025-05-07T19:45:57.8238113Z 2025-05-07T19:45:57.8238117Z 2025-05-07T19:45:57.8238120Z 2025-05-07T19:45:57.8238124Z 2025-05-07T19:45:57.8238128Z 2025-05-07T19:45:57.8238131Z 2025-05-07T19:45:57.8238135Z 2025-05-07T19:45:57.8238138Z 2025-05-07T19:45:57.8238141Z 2025-05-07T19:45:57.8238144Z 2025-05-07T19:45:57.8238148Z 2025-05-07T19:45:57.8238151Z 2025-05-07T19:45:57.8238154Z 2025-05-07T19:45:57.8238170Z 2025-05-07T19:45:57.8238174Z 2025-05-07T19:45:57.8238177Z 2025-05-07T19:45:57.8238180Z 2025-05-07T19:45:57.8238184Z 2025-05-07T19:45:57.8357274Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:45:57.8357669Z 2025-05-07T19:45:57.8357680Z 2025-05-07T19:45:57.8357699Z 2025-05-07T19:45:57.8357702Z 2025-05-07T19:45:57.8357706Z 2025-05-07T19:45:57.8357709Z 2025-05-07T19:45:57.8357713Z 2025-05-07T19:45:57.8357716Z 2025-05-07T19:45:57.8357719Z 2025-05-07T19:45:57.8357723Z 2025-05-07T19:45:57.8357726Z 2025-05-07T19:45:57.8357730Z 2025-05-07T19:45:57.8357733Z 2025-05-07T19:45:57.8357736Z 2025-05-07T19:45:57.8357739Z 2025-05-07T19:45:57.8540179Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:45:57.8540556Z 2025-05-07T19:45:57.8540561Z 2025-05-07T19:45:57.8540567Z 2025-05-07T19:45:57.8540570Z 2025-05-07T19:45:57.8540574Z 2025-05-07T19:45:57.8540577Z 2025-05-07T19:45:57.8540581Z 2025-05-07T19:45:57.8540584Z 2025-05-07T19:45:57.8540587Z 2025-05-07T19:45:57.8540591Z 2025-05-07T19:45:57.8540594Z 2025-05-07T19:45:57.8540597Z 2025-05-07T19:45:57.8540803Z 2025-05-07T19:45:57.8540807Z 2025-05-07T19:45:57.8540810Z 2025-05-07T19:45:57.8540814Z 2025-05-07T19:45:57.8540824Z 2025-05-07T19:45:57.8557971Z cuda-nvvm-impl-12.6. | 7.7 MB | ######6 | 66%  2025-05-07T19:45:57.8558348Z 2025-05-07T19:45:57.8558353Z 2025-05-07T19:45:57.8558357Z 2025-05-07T19:45:57.8558360Z 2025-05-07T19:45:57.8558364Z 2025-05-07T19:45:57.8558367Z 2025-05-07T19:45:57.8558654Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:45:57.8558952Z 2025-05-07T19:45:57.8558956Z 2025-05-07T19:45:57.8558959Z 2025-05-07T19:45:57.8558962Z 2025-05-07T19:45:57.8558966Z 2025-05-07T19:45:57.8558969Z 2025-05-07T19:45:57.8558972Z 2025-05-07T19:45:57.8558976Z 2025-05-07T19:45:57.8558979Z 2025-05-07T19:45:57.8558982Z 2025-05-07T19:45:57.8558986Z 2025-05-07T19:45:57.8559004Z 2025-05-07T19:45:57.8559007Z 2025-05-07T19:45:57.8559010Z 2025-05-07T19:45:57.8559014Z 2025-05-07T19:45:57.8559029Z 2025-05-07T19:45:57.8559032Z 2025-05-07T19:45:57.8559036Z 2025-05-07T19:45:57.8559540Z 2025-05-07T19:45:57.9275896Z ... (more hidden) ... 2025-05-07T19:45:57.9276237Z 2025-05-07T19:45:57.9276241Z 2025-05-07T19:45:57.9276245Z 2025-05-07T19:45:57.9276248Z 2025-05-07T19:45:57.9276252Z 2025-05-07T19:45:57.9276255Z 2025-05-07T19:45:57.9276259Z 2025-05-07T19:45:57.9276262Z 2025-05-07T19:45:57.9276266Z 2025-05-07T19:45:57.9276269Z 2025-05-07T19:45:57.9276272Z 2025-05-07T19:45:57.9276276Z 2025-05-07T19:45:57.9276279Z 2025-05-07T19:45:57.9276282Z 2025-05-07T19:45:57.9276285Z 2025-05-07T19:45:57.9276289Z 2025-05-07T19:45:57.9276292Z 2025-05-07T19:45:57.9276295Z 2025-05-07T19:45:57.9276298Z 2025-05-07T19:45:57.9652270Z ... (more hidden) ... 2025-05-07T19:45:57.9652590Z 2025-05-07T19:45:57.9652595Z 2025-05-07T19:45:57.9652599Z 2025-05-07T19:45:57.9652602Z 2025-05-07T19:45:57.9652618Z 2025-05-07T19:45:57.9652622Z 2025-05-07T19:45:57.9652625Z 2025-05-07T19:45:57.9652628Z 2025-05-07T19:45:57.9652817Z 2025-05-07T19:45:57.9652822Z 2025-05-07T19:45:57.9652842Z 2025-05-07T19:45:57.9652845Z 2025-05-07T19:45:57.9652849Z 2025-05-07T19:45:57.9652852Z 2025-05-07T19:45:57.9652855Z 2025-05-07T19:45:57.9652863Z 2025-05-07T19:45:57.9927236Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:45:57.9927628Z 2025-05-07T19:45:57.9927633Z 2025-05-07T19:45:57.9927636Z 2025-05-07T19:45:57.9927640Z 2025-05-07T19:45:57.9927644Z 2025-05-07T19:45:57.9927647Z 2025-05-07T19:45:57.9927651Z 2025-05-07T19:45:57.9927654Z 2025-05-07T19:45:57.9927658Z 2025-05-07T19:45:57.9927661Z 2025-05-07T19:45:57.9927665Z 2025-05-07T19:45:57.9927668Z 2025-05-07T19:45:57.9927671Z 2025-05-07T19:45:57.9927675Z 2025-05-07T19:45:57.9927678Z 2025-05-07T19:45:57.9927681Z 2025-05-07T19:45:57.9927685Z 2025-05-07T19:45:58.0180847Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:45:58.0181229Z 2025-05-07T19:45:58.0181234Z 2025-05-07T19:45:58.0181238Z 2025-05-07T19:45:58.0181241Z 2025-05-07T19:45:58.0181245Z 2025-05-07T19:45:58.0181248Z 2025-05-07T19:45:58.0181251Z 2025-05-07T19:45:58.0181255Z 2025-05-07T19:45:58.0181258Z 2025-05-07T19:45:58.1621200Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:45:58.1621535Z 2025-05-07T19:45:58.1621539Z 2025-05-07T19:45:58.1621543Z 2025-05-07T19:45:58.1621546Z 2025-05-07T19:45:58.1621550Z 2025-05-07T19:45:58.3975336Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:45:58.3975678Z 2025-05-07T19:45:58.3975683Z 2025-05-07T19:45:58.3975687Z 2025-05-07T19:45:58.3975692Z 2025-05-07T19:45:58.3975695Z 2025-05-07T19:45:58.3975700Z 2025-05-07T19:45:58.3975704Z 2025-05-07T19:45:58.3975708Z 2025-05-07T19:45:58.3975712Z 2025-05-07T19:45:58.3975715Z 2025-05-07T19:45:58.3975913Z 2025-05-07T19:45:58.3975917Z 2025-05-07T19:45:58.8294454Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:45:58.8294792Z 2025-05-07T19:45:58.8294797Z 2025-05-07T19:45:58.8294801Z 2025-05-07T19:45:58.8294805Z 2025-05-07T19:45:58.8294810Z 2025-05-07T19:45:58.8294815Z 2025-05-07T19:45:58.8294821Z 2025-05-07T19:45:58.8294827Z 2025-05-07T19:45:58.8294846Z 2025-05-07T19:45:58.8294852Z 2025-05-07T19:45:58.8294857Z 2025-05-07T19:45:59.0755829Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:45:59.0756171Z 2025-05-07T19:45:59.0756176Z 2025-05-07T19:45:59.0756179Z 2025-05-07T19:45:59.0756183Z 2025-05-07T19:45:59.0756187Z 2025-05-07T19:45:59.0756191Z 2025-05-07T19:45:59.0756195Z 2025-05-07T19:45:59.0756199Z 2025-05-07T19:45:59.0756202Z 2025-05-07T19:45:59.0756205Z 2025-05-07T19:45:59.1738871Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:45:59.1739214Z 2025-05-07T19:45:59.1739219Z 2025-05-07T19:45:59.1739223Z 2025-05-07T19:45:59.1739234Z 2025-05-07T19:45:59.1739237Z 2025-05-07T19:45:59.1739241Z 2025-05-07T19:45:59.1739244Z 2025-05-07T19:45:59.2792986Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:45:59.2793303Z 2025-05-07T19:45:59.2793307Z 2025-05-07T19:45:59.2793311Z 2025-05-07T19:45:59.2793315Z 2025-05-07T19:45:59.2793333Z 2025-05-07T19:45:59.2793336Z 2025-05-07T19:45:59.2793340Z 2025-05-07T19:45:59.2793343Z 2025-05-07T19:45:59.2793347Z 2025-05-07T19:45:59.2793350Z 2025-05-07T19:45:59.2793353Z 2025-05-07T19:45:59.2793357Z 2025-05-07T19:45:59.2793360Z 2025-05-07T19:45:59.2793363Z 2025-05-07T19:45:59.3865114Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:45:59.3865493Z 2025-05-07T19:45:59.3865498Z 2025-05-07T19:45:59.3865502Z 2025-05-07T19:45:59.3865505Z 2025-05-07T19:45:59.3865509Z 2025-05-07T19:45:59.3865525Z 2025-05-07T19:45:59.3865528Z 2025-05-07T19:45:59.3865532Z 2025-05-07T19:45:59.3865535Z 2025-05-07T19:45:59.3865766Z 2025-05-07T19:45:59.3865771Z 2025-05-07T19:45:59.3865774Z 2025-05-07T19:45:59.3865778Z 2025-05-07T19:45:59.3865781Z 2025-05-07T19:45:59.3865784Z 2025-05-07T19:45:59.3865788Z 2025-05-07T19:45:59.3865791Z 2025-05-07T19:45:59.3865794Z 2025-05-07T19:45:59.3866163Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:45:59.3866507Z 2025-05-07T19:45:59.3866510Z 2025-05-07T19:45:59.3866514Z 2025-05-07T19:45:59.3866517Z 2025-05-07T19:45:59.3866521Z 2025-05-07T19:45:59.3866524Z 2025-05-07T19:45:59.3866527Z 2025-05-07T19:45:59.3866531Z 2025-05-07T19:45:59.3866535Z 2025-05-07T19:45:59.3866538Z 2025-05-07T19:45:59.3866542Z 2025-05-07T19:45:59.3866558Z 2025-05-07T19:45:59.3866562Z 2025-05-07T19:45:59.3866565Z 2025-05-07T19:45:59.3866568Z 2025-05-07T19:45:59.3866571Z 2025-05-07T19:45:59.3866580Z 2025-05-07T19:45:59.3866583Z 2025-05-07T19:45:59.3971111Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:45:59.3971481Z 2025-05-07T19:45:59.3971500Z 2025-05-07T19:45:59.3971503Z 2025-05-07T19:45:59.3971507Z 2025-05-07T19:45:59.3971522Z 2025-05-07T19:45:59.3971525Z 2025-05-07T19:45:59.3971529Z 2025-05-07T19:45:59.3971532Z 2025-05-07T19:45:59.3971536Z 2025-05-07T19:45:59.3971539Z 2025-05-07T19:45:59.3971543Z 2025-05-07T19:45:59.3971546Z 2025-05-07T19:45:59.3971549Z 2025-05-07T19:45:59.4512592Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:45:59.4512969Z 2025-05-07T19:45:59.4512973Z 2025-05-07T19:45:59.4512977Z 2025-05-07T19:45:59.4512980Z 2025-05-07T19:45:59.4512984Z 2025-05-07T19:45:59.4512987Z 2025-05-07T19:45:59.4512990Z 2025-05-07T19:45:59.4512993Z 2025-05-07T19:45:59.4512997Z 2025-05-07T19:45:59.4513000Z 2025-05-07T19:45:59.4513003Z 2025-05-07T19:45:59.4513177Z 2025-05-07T19:45:59.4513181Z 2025-05-07T19:45:59.4513184Z 2025-05-07T19:45:59.4513188Z 2025-05-07T19:45:59.4513210Z 2025-05-07T19:45:59.4513214Z 2025-05-07T19:45:59.4513217Z 2025-05-07T19:45:59.4513220Z 2025-05-07T19:45:59.4513488Z ... (more hidden) ... 2025-05-07T19:45:59.4513779Z 2025-05-07T19:45:59.4513782Z 2025-05-07T19:45:59.4513786Z 2025-05-07T19:45:59.4513789Z 2025-05-07T19:45:59.4513792Z 2025-05-07T19:45:59.4513796Z 2025-05-07T19:45:59.4513811Z 2025-05-07T19:45:59.4513815Z 2025-05-07T19:45:59.4513818Z 2025-05-07T19:45:59.4513821Z 2025-05-07T19:45:59.4513824Z 2025-05-07T19:45:59.4513828Z 2025-05-07T19:45:59.4513831Z 2025-05-07T19:45:59.4513834Z 2025-05-07T19:45:59.4513837Z 2025-05-07T19:45:59.4513841Z 2025-05-07T19:45:59.4513844Z 2025-05-07T19:45:59.4513847Z 2025-05-07T19:45:59.4513851Z 2025-05-07T19:45:59.5580084Z ... (more hidden) ... 2025-05-07T19:45:59.5580441Z 2025-05-07T19:45:59.5580445Z 2025-05-07T19:45:59.5580449Z 2025-05-07T19:45:59.5580453Z 2025-05-07T19:45:59.5580463Z 2025-05-07T19:45:59.5580467Z 2025-05-07T19:45:59.5580470Z 2025-05-07T19:45:59.5580474Z 2025-05-07T19:45:59.5580477Z 2025-05-07T19:45:59.5580480Z 2025-05-07T19:45:59.5580484Z 2025-05-07T19:45:59.5580487Z 2025-05-07T19:45:59.5580490Z 2025-05-07T19:45:59.5580493Z 2025-05-07T19:45:59.5580497Z 2025-05-07T19:45:59.6641625Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:45:59.6641985Z 2025-05-07T19:45:59.6641990Z 2025-05-07T19:45:59.6641993Z 2025-05-07T19:45:59.6641997Z 2025-05-07T19:45:59.6642000Z 2025-05-07T19:45:59.6642003Z 2025-05-07T19:45:59.6642007Z 2025-05-07T19:45:59.6642010Z 2025-05-07T19:45:59.6642014Z 2025-05-07T19:45:59.6642017Z 2025-05-07T19:45:59.6642020Z 2025-05-07T19:45:59.6642024Z 2025-05-07T19:45:59.6642027Z 2025-05-07T19:45:59.6642030Z 2025-05-07T19:45:59.6642056Z 2025-05-07T19:45:59.6642060Z 2025-05-07T19:45:59.6949784Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:45:59.6950181Z 2025-05-07T19:45:59.6950185Z 2025-05-07T19:45:59.6950189Z 2025-05-07T19:45:59.6950192Z 2025-05-07T19:45:59.6950208Z 2025-05-07T19:45:59.6950212Z 2025-05-07T19:45:59.6950215Z 2025-05-07T19:45:59.6950219Z 2025-05-07T19:45:59.6950222Z 2025-05-07T19:45:59.6950225Z 2025-05-07T19:45:59.6950229Z 2025-05-07T19:45:59.6950232Z 2025-05-07T19:45:59.6950235Z 2025-05-07T19:45:59.6950239Z 2025-05-07T19:45:59.6950242Z 2025-05-07T19:45:59.6950245Z 2025-05-07T19:45:59.6950249Z 2025-05-07T19:46:00.5602339Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:01.3872840Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:01.3873151Z 2025-05-07T19:46:03.9813577Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:46:03.9819142Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:03.9819527Z 2025-05-07T19:46:03.9819651Z 2025-05-07T19:46:03.9819688Z 2025-05-07T19:46:03.9819692Z 2025-05-07T19:46:03.9819720Z 2025-05-07T19:46:03.9819723Z 2025-05-07T19:46:03.9819758Z 2025-05-07T19:46:03.9819764Z 2025-05-07T19:46:03.9819819Z 2025-05-07T19:46:03.9819823Z 2025-05-07T19:46:03.9819829Z 2025-05-07T19:46:03.9819860Z 2025-05-07T19:46:03.9819929Z 2025-05-07T19:46:03.9819933Z 2025-05-07T19:46:03.9819966Z 2025-05-07T19:46:03.9819990Z 2025-05-07T19:46:03.9820040Z 2025-05-07T19:46:03.9820046Z 2025-05-07T19:46:03.9820050Z 2025-05-07T19:46:03.9820171Z 2025-05-07T19:46:03.9820546Z  2025-05-07T19:46:03.9820889Z 2025-05-07T19:46:03.9821095Z 2025-05-07T19:46:03.9822229Z  2025-05-07T19:46:03.9822461Z 2025-05-07T19:46:03.9822728Z 2025-05-07T19:46:03.9822924Z  2025-05-07T19:46:03.9823150Z 2025-05-07T19:46:03.9823154Z 2025-05-07T19:46:03.9823158Z 2025-05-07T19:46:03.9823336Z  2025-05-07T19:46:03.9823572Z 2025-05-07T19:46:03.9823576Z 2025-05-07T19:46:03.9823579Z 2025-05-07T19:46:03.9823583Z 2025-05-07T19:46:03.9823758Z  2025-05-07T19:46:03.9823988Z 2025-05-07T19:46:03.9823992Z 2025-05-07T19:46:03.9823995Z 2025-05-07T19:46:03.9823998Z 2025-05-07T19:46:03.9824002Z 2025-05-07T19:46:03.9824193Z  2025-05-07T19:46:03.9824414Z 2025-05-07T19:46:03.9824418Z 2025-05-07T19:46:03.9824421Z 2025-05-07T19:46:03.9824425Z 2025-05-07T19:46:03.9824428Z 2025-05-07T19:46:03.9824432Z 2025-05-07T19:46:03.9824634Z  2025-05-07T19:46:03.9824865Z 2025-05-07T19:46:03.9824869Z 2025-05-07T19:46:03.9824874Z 2025-05-07T19:46:03.9824878Z 2025-05-07T19:46:03.9824886Z 2025-05-07T19:46:03.9824889Z 2025-05-07T19:46:03.9824893Z 2025-05-07T19:46:03.9825134Z  2025-05-07T19:46:03.9825368Z 2025-05-07T19:46:03.9825372Z 2025-05-07T19:46:03.9825376Z 2025-05-07T19:46:03.9825379Z 2025-05-07T19:46:03.9825382Z 2025-05-07T19:46:03.9825386Z 2025-05-07T19:46:03.9825389Z 2025-05-07T19:46:03.9825392Z 2025-05-07T19:46:03.9825601Z  2025-05-07T19:46:03.9825828Z 2025-05-07T19:46:03.9825832Z 2025-05-07T19:46:03.9825835Z 2025-05-07T19:46:03.9825839Z 2025-05-07T19:46:03.9825842Z 2025-05-07T19:46:03.9825846Z 2025-05-07T19:46:03.9825849Z 2025-05-07T19:46:03.9825852Z 2025-05-07T19:46:03.9825856Z 2025-05-07T19:46:03.9826044Z  2025-05-07T19:46:03.9826296Z 2025-05-07T19:46:03.9826300Z 2025-05-07T19:46:03.9826304Z 2025-05-07T19:46:03.9826433Z 2025-05-07T19:46:03.9826437Z 2025-05-07T19:46:03.9826441Z 2025-05-07T19:46:03.9826444Z 2025-05-07T19:46:03.9826447Z 2025-05-07T19:46:03.9826451Z 2025-05-07T19:46:03.9826454Z 2025-05-07T19:46:03.9826674Z  2025-05-07T19:46:03.9826910Z 2025-05-07T19:46:03.9826913Z 2025-05-07T19:46:03.9826916Z 2025-05-07T19:46:03.9826921Z 2025-05-07T19:46:03.9826924Z 2025-05-07T19:46:03.9826927Z 2025-05-07T19:46:03.9826931Z 2025-05-07T19:46:03.9826935Z 2025-05-07T19:46:03.9826938Z 2025-05-07T19:46:03.9826942Z 2025-05-07T19:46:03.9826945Z 2025-05-07T19:46:03.9827176Z  2025-05-07T19:46:03.9827414Z 2025-05-07T19:46:03.9827418Z 2025-05-07T19:46:03.9827421Z 2025-05-07T19:46:03.9827425Z 2025-05-07T19:46:03.9827435Z 2025-05-07T19:46:03.9827438Z 2025-05-07T19:46:03.9827442Z 2025-05-07T19:46:03.9827445Z 2025-05-07T19:46:03.9827452Z 2025-05-07T19:46:03.9827456Z 2025-05-07T19:46:03.9827459Z 2025-05-07T19:46:03.9827463Z 2025-05-07T19:46:03.9827686Z  2025-05-07T19:46:03.9827926Z 2025-05-07T19:46:03.9827930Z 2025-05-07T19:46:03.9827933Z 2025-05-07T19:46:03.9827937Z 2025-05-07T19:46:03.9827988Z 2025-05-07T19:46:03.9827992Z 2025-05-07T19:46:03.9828014Z 2025-05-07T19:46:03.9828018Z 2025-05-07T19:46:03.9828021Z 2025-05-07T19:46:03.9828024Z 2025-05-07T19:46:03.9828028Z 2025-05-07T19:46:03.9828031Z 2025-05-07T19:46:03.9828034Z 2025-05-07T19:46:03.9828240Z  2025-05-07T19:46:03.9828723Z 2025-05-07T19:46:03.9828728Z 2025-05-07T19:46:03.9828732Z 2025-05-07T19:46:03.9828754Z 2025-05-07T19:46:03.9828757Z 2025-05-07T19:46:03.9828760Z 2025-05-07T19:46:03.9828870Z 2025-05-07T19:46:03.9828874Z 2025-05-07T19:46:03.9828878Z 2025-05-07T19:46:03.9828882Z 2025-05-07T19:46:03.9828889Z 2025-05-07T19:46:03.9828893Z 2025-05-07T19:46:03.9828896Z 2025-05-07T19:46:03.9828899Z 2025-05-07T19:46:03.9829117Z  2025-05-07T19:46:03.9829385Z 2025-05-07T19:46:03.9829388Z 2025-05-07T19:46:03.9829391Z 2025-05-07T19:46:03.9829395Z 2025-05-07T19:46:03.9829398Z 2025-05-07T19:46:03.9829401Z 2025-05-07T19:46:03.9829405Z 2025-05-07T19:46:03.9829408Z 2025-05-07T19:46:03.9829411Z 2025-05-07T19:46:03.9829414Z 2025-05-07T19:46:03.9829418Z 2025-05-07T19:46:03.9829421Z 2025-05-07T19:46:03.9829425Z 2025-05-07T19:46:03.9829428Z 2025-05-07T19:46:03.9829432Z 2025-05-07T19:46:03.9829647Z  2025-05-07T19:46:03.9829910Z 2025-05-07T19:46:03.9829913Z 2025-05-07T19:46:03.9829921Z 2025-05-07T19:46:03.9829925Z 2025-05-07T19:46:03.9829928Z 2025-05-07T19:46:03.9829932Z 2025-05-07T19:46:03.9829939Z 2025-05-07T19:46:03.9829943Z 2025-05-07T19:46:03.9829946Z 2025-05-07T19:46:03.9829950Z 2025-05-07T19:46:03.9829953Z 2025-05-07T19:46:03.9829956Z 2025-05-07T19:46:03.9829960Z 2025-05-07T19:46:03.9829963Z 2025-05-07T19:46:03.9829967Z 2025-05-07T19:46:03.9829970Z 2025-05-07T19:46:03.9830213Z  2025-05-07T19:46:03.9830462Z 2025-05-07T19:46:03.9830466Z 2025-05-07T19:46:03.9830469Z 2025-05-07T19:46:03.9830473Z 2025-05-07T19:46:03.9830476Z 2025-05-07T19:46:03.9830480Z 2025-05-07T19:46:03.9830483Z 2025-05-07T19:46:03.9830486Z 2025-05-07T19:46:03.9830492Z 2025-05-07T19:46:03.9830495Z 2025-05-07T19:46:03.9830499Z 2025-05-07T19:46:03.9830521Z 2025-05-07T19:46:03.9830525Z 2025-05-07T19:46:03.9830528Z 2025-05-07T19:46:03.9830531Z 2025-05-07T19:46:03.9830534Z 2025-05-07T19:46:03.9830544Z 2025-05-07T19:46:03.9830853Z  2025-05-07T19:46:03.9831105Z 2025-05-07T19:46:03.9831109Z 2025-05-07T19:46:03.9831113Z 2025-05-07T19:46:03.9831133Z 2025-05-07T19:46:03.9831136Z 2025-05-07T19:46:03.9831139Z 2025-05-07T19:46:03.9831145Z 2025-05-07T19:46:03.9831148Z 2025-05-07T19:46:03.9831151Z 2025-05-07T19:46:03.9831155Z 2025-05-07T19:46:03.9831158Z 2025-05-07T19:46:03.9831162Z 2025-05-07T19:46:03.9831165Z 2025-05-07T19:46:03.9831168Z 2025-05-07T19:46:03.9831174Z 2025-05-07T19:46:03.9831178Z 2025-05-07T19:46:03.9831181Z 2025-05-07T19:46:03.9831184Z 2025-05-07T19:46:03.9831426Z  2025-05-07T19:46:03.9831697Z 2025-05-07T19:46:03.9831700Z 2025-05-07T19:46:03.9831800Z  2025-05-07T19:46:03.9831909Z 2025-05-07T19:46:03.9831913Z 2025-05-07T19:46:03.9832033Z  2025-05-07T19:46:03.9832159Z 2025-05-07T19:46:03.9832162Z 2025-05-07T19:46:03.9832166Z 2025-05-07T19:46:03.9832269Z  2025-05-07T19:46:03.9832409Z 2025-05-07T19:46:03.9832416Z 2025-05-07T19:46:03.9832419Z 2025-05-07T19:46:03.9832423Z 2025-05-07T19:46:03.9832529Z  2025-05-07T19:46:03.9832649Z 2025-05-07T19:46:03.9832653Z 2025-05-07T19:46:03.9832656Z 2025-05-07T19:46:03.9832660Z 2025-05-07T19:46:03.9832666Z 2025-05-07T19:46:03.9832787Z  2025-05-07T19:46:03.9832916Z 2025-05-07T19:46:03.9832919Z 2025-05-07T19:46:03.9832922Z 2025-05-07T19:46:03.9832926Z 2025-05-07T19:46:03.9832929Z 2025-05-07T19:46:03.9832933Z 2025-05-07T19:46:03.9833125Z  2025-05-07T19:46:03.9833255Z 2025-05-07T19:46:03.9833259Z 2025-05-07T19:46:03.9833263Z 2025-05-07T19:46:03.9833284Z 2025-05-07T19:46:03.9833287Z 2025-05-07T19:46:03.9833290Z 2025-05-07T19:46:03.9833294Z 2025-05-07T19:46:03.9833408Z  2025-05-07T19:46:03.9833551Z 2025-05-07T19:46:03.9833622Z 2025-05-07T19:46:03.9833626Z 2025-05-07T19:46:03.9833629Z 2025-05-07T19:46:03.9833633Z 2025-05-07T19:46:03.9833640Z 2025-05-07T19:46:03.9833644Z 2025-05-07T19:46:03.9833664Z 2025-05-07T19:46:03.9833784Z  2025-05-07T19:46:03.9833939Z 2025-05-07T19:46:03.9833942Z 2025-05-07T19:46:03.9833945Z 2025-05-07T19:46:03.9833949Z 2025-05-07T19:46:03.9833953Z 2025-05-07T19:46:03.9833956Z 2025-05-07T19:46:03.9833960Z 2025-05-07T19:46:03.9833963Z 2025-05-07T19:46:03.9833967Z 2025-05-07T19:46:03.9834108Z  2025-05-07T19:46:03.9834268Z 2025-05-07T19:46:03.9834272Z 2025-05-07T19:46:03.9834275Z 2025-05-07T19:46:03.9834278Z 2025-05-07T19:46:03.9834282Z 2025-05-07T19:46:03.9834285Z 2025-05-07T19:46:03.9834289Z 2025-05-07T19:46:03.9834292Z 2025-05-07T19:46:03.9834295Z 2025-05-07T19:46:03.9834299Z 2025-05-07T19:46:03.9834449Z  2025-05-07T19:46:03.9834620Z 2025-05-07T19:46:03.9834623Z 2025-05-07T19:46:03.9834626Z 2025-05-07T19:46:03.9834633Z 2025-05-07T19:46:03.9834637Z 2025-05-07T19:46:03.9834640Z 2025-05-07T19:46:03.9834644Z 2025-05-07T19:46:03.9834650Z 2025-05-07T19:46:03.9834654Z 2025-05-07T19:46:03.9834657Z 2025-05-07T19:46:03.9834661Z 2025-05-07T19:46:03.9834807Z  2025-05-07T19:46:03.9834989Z 2025-05-07T19:46:03.9834992Z 2025-05-07T19:46:03.9834996Z 2025-05-07T19:46:03.9834999Z 2025-05-07T19:46:03.9835002Z 2025-05-07T19:46:03.9835006Z 2025-05-07T19:46:03.9835009Z 2025-05-07T19:46:03.9835013Z 2025-05-07T19:46:03.9835016Z 2025-05-07T19:46:03.9835019Z 2025-05-07T19:46:03.9835022Z 2025-05-07T19:46:03.9835025Z 2025-05-07T19:46:03.9835176Z  2025-05-07T19:46:03.9835366Z 2025-05-07T19:46:03.9835369Z 2025-05-07T19:46:03.9835373Z 2025-05-07T19:46:03.9835376Z 2025-05-07T19:46:03.9835379Z 2025-05-07T19:46:03.9835382Z 2025-05-07T19:46:03.9835386Z 2025-05-07T19:46:03.9835389Z 2025-05-07T19:46:03.9835392Z 2025-05-07T19:46:03.9835396Z 2025-05-07T19:46:03.9835403Z 2025-05-07T19:46:03.9835423Z 2025-05-07T19:46:03.9835426Z 2025-05-07T19:46:03.9835662Z  2025-05-07T19:46:03.9835871Z 2025-05-07T19:46:03.9835875Z 2025-05-07T19:46:03.9835878Z 2025-05-07T19:46:03.9835882Z 2025-05-07T19:46:03.9835885Z 2025-05-07T19:46:03.9835888Z 2025-05-07T19:46:03.9835891Z 2025-05-07T19:46:03.9835895Z 2025-05-07T19:46:03.9835898Z 2025-05-07T19:46:03.9835918Z 2025-05-07T19:46:03.9835921Z 2025-05-07T19:46:03.9835924Z 2025-05-07T19:46:03.9835928Z 2025-05-07T19:46:03.9835931Z 2025-05-07T19:46:03.9836075Z  2025-05-07T19:46:03.9836276Z 2025-05-07T19:46:03.9836279Z 2025-05-07T19:46:03.9836282Z 2025-05-07T19:46:03.9836286Z 2025-05-07T19:46:03.9836289Z 2025-05-07T19:46:03.9836309Z 2025-05-07T19:46:03.9836312Z 2025-05-07T19:46:03.9836316Z 2025-05-07T19:46:03.9836319Z 2025-05-07T19:46:03.9836322Z 2025-05-07T19:46:03.9836326Z 2025-05-07T19:46:03.9836329Z 2025-05-07T19:46:03.9836337Z 2025-05-07T19:46:03.9836340Z 2025-05-07T19:46:03.9836344Z 2025-05-07T19:46:03.9836500Z  2025-05-07T19:46:03.9836708Z 2025-05-07T19:46:03.9836732Z 2025-05-07T19:46:03.9836735Z 2025-05-07T19:46:03.9836739Z 2025-05-07T19:46:03.9836742Z 2025-05-07T19:46:03.9836745Z 2025-05-07T19:46:03.9836749Z 2025-05-07T19:46:03.9836752Z 2025-05-07T19:46:03.9836755Z 2025-05-07T19:46:03.9836759Z 2025-05-07T19:46:03.9836762Z 2025-05-07T19:46:03.9836766Z 2025-05-07T19:46:03.9836769Z 2025-05-07T19:46:03.9836772Z 2025-05-07T19:46:03.9836775Z 2025-05-07T19:46:03.9836779Z 2025-05-07T19:46:03.9836943Z  2025-05-07T19:46:03.9837174Z 2025-05-07T19:46:03.9837177Z 2025-05-07T19:46:03.9837181Z 2025-05-07T19:46:03.9837184Z 2025-05-07T19:46:03.9837187Z 2025-05-07T19:46:03.9837190Z 2025-05-07T19:46:03.9837194Z 2025-05-07T19:46:03.9837197Z 2025-05-07T19:46:03.9837200Z 2025-05-07T19:46:03.9837203Z 2025-05-07T19:46:03.9837273Z 2025-05-07T19:46:03.9837277Z 2025-05-07T19:46:03.9837280Z 2025-05-07T19:46:03.9837283Z 2025-05-07T19:46:03.9837290Z 2025-05-07T19:46:03.9837294Z 2025-05-07T19:46:03.9837297Z 2025-05-07T19:46:03.9837480Z  2025-05-07T19:46:03.9837700Z 2025-05-07T19:46:03.9837703Z 2025-05-07T19:46:03.9837706Z 2025-05-07T19:46:03.9837710Z 2025-05-07T19:46:03.9837714Z 2025-05-07T19:46:03.9837717Z 2025-05-07T19:46:03.9837720Z 2025-05-07T19:46:03.9837723Z 2025-05-07T19:46:03.9837726Z 2025-05-07T19:46:03.9837730Z 2025-05-07T19:46:03.9837733Z 2025-05-07T19:46:03.9837737Z 2025-05-07T19:46:03.9837740Z 2025-05-07T19:46:03.9837760Z 2025-05-07T19:46:03.9837763Z 2025-05-07T19:46:03.9837767Z 2025-05-07T19:46:03.9837770Z 2025-05-07T19:46:03.9837773Z 2025-05-07T19:46:03.9837943Z  2025-05-07T19:46:03.9838173Z 2025-05-07T19:46:03.9838177Z 2025-05-07T19:46:03.9838293Z  2025-05-07T19:46:03.9838409Z 2025-05-07T19:46:03.9838416Z 2025-05-07T19:46:03.9838514Z  2025-05-07T19:46:03.9838648Z 2025-05-07T19:46:03.9838652Z 2025-05-07T19:46:03.9838659Z 2025-05-07T19:46:03.9838828Z  2025-05-07T19:46:03.9838956Z 2025-05-07T19:46:03.9838960Z 2025-05-07T19:46:03.9838963Z 2025-05-07T19:46:03.9838967Z 2025-05-07T19:46:03.9839071Z  2025-05-07T19:46:03.9839193Z 2025-05-07T19:46:03.9839196Z 2025-05-07T19:46:03.9839215Z 2025-05-07T19:46:03.9839219Z 2025-05-07T19:46:03.9839222Z 2025-05-07T19:46:03.9839327Z  2025-05-07T19:46:03.9839454Z 2025-05-07T19:46:03.9839458Z 2025-05-07T19:46:03.9839461Z 2025-05-07T19:46:03.9839465Z 2025-05-07T19:46:03.9839468Z 2025-05-07T19:46:03.9839472Z 2025-05-07T19:46:03.9839599Z  2025-05-07T19:46:03.9839728Z 2025-05-07T19:46:03.9839731Z 2025-05-07T19:46:03.9839735Z 2025-05-07T19:46:03.9839739Z 2025-05-07T19:46:03.9839742Z 2025-05-07T19:46:03.9839745Z 2025-05-07T19:46:03.9839749Z 2025-05-07T19:46:03.9839863Z  2025-05-07T19:46:03.9840026Z 2025-05-07T19:46:03.9840030Z 2025-05-07T19:46:03.9840034Z 2025-05-07T19:46:03.9840099Z 2025-05-07T19:46:03.9840103Z 2025-05-07T19:46:03.9840107Z 2025-05-07T19:46:03.9840110Z 2025-05-07T19:46:03.9840113Z 2025-05-07T19:46:03.9840231Z  2025-05-07T19:46:03.9840414Z 2025-05-07T19:46:03.9840418Z 2025-05-07T19:46:03.9840421Z 2025-05-07T19:46:03.9840424Z 2025-05-07T19:46:03.9840428Z 2025-05-07T19:46:03.9840431Z 2025-05-07T19:46:03.9840434Z 2025-05-07T19:46:03.9840438Z 2025-05-07T19:46:03.9840441Z 2025-05-07T19:46:03.9840564Z  2025-05-07T19:46:03.9840742Z 2025-05-07T19:46:03.9840745Z 2025-05-07T19:46:03.9840749Z 2025-05-07T19:46:03.9840752Z 2025-05-07T19:46:03.9840756Z 2025-05-07T19:46:03.9840759Z 2025-05-07T19:46:03.9840762Z 2025-05-07T19:46:03.9840766Z 2025-05-07T19:46:03.9840769Z 2025-05-07T19:46:03.9840772Z 2025-05-07T19:46:03.9840900Z  2025-05-07T19:46:03.9841090Z 2025-05-07T19:46:03.9841098Z 2025-05-07T19:46:03.9841101Z 2025-05-07T19:46:03.9841105Z 2025-05-07T19:46:03.9841108Z 2025-05-07T19:46:03.9841114Z 2025-05-07T19:46:03.9841118Z 2025-05-07T19:46:03.9841121Z 2025-05-07T19:46:03.9841124Z 2025-05-07T19:46:03.9841128Z 2025-05-07T19:46:03.9841131Z 2025-05-07T19:46:03.9841263Z  2025-05-07T19:46:03.9841461Z 2025-05-07T19:46:03.9841464Z 2025-05-07T19:46:03.9841467Z 2025-05-07T19:46:03.9841471Z 2025-05-07T19:46:03.9841474Z 2025-05-07T19:46:03.9841477Z 2025-05-07T19:46:03.9841481Z 2025-05-07T19:46:03.9841484Z 2025-05-07T19:46:03.9841487Z 2025-05-07T19:46:03.9841490Z 2025-05-07T19:46:03.9841494Z 2025-05-07T19:46:03.9841498Z 2025-05-07T19:46:03.9841631Z  2025-05-07T19:46:03.9841837Z 2025-05-07T19:46:03.9841841Z 2025-05-07T19:46:03.9841844Z 2025-05-07T19:46:03.9841848Z 2025-05-07T19:46:03.9841851Z 2025-05-07T19:46:03.9841854Z 2025-05-07T19:46:03.9841857Z 2025-05-07T19:46:03.9841924Z 2025-05-07T19:46:03.9841927Z 2025-05-07T19:46:03.9841931Z 2025-05-07T19:46:03.9841934Z 2025-05-07T19:46:03.9841941Z 2025-05-07T19:46:03.9841945Z 2025-05-07T19:46:03.9842086Z  2025-05-07T19:46:03.9842300Z 2025-05-07T19:46:03.9842304Z 2025-05-07T19:46:03.9842308Z 2025-05-07T19:46:03.9842311Z 2025-05-07T19:46:03.9842314Z 2025-05-07T19:46:03.9842317Z 2025-05-07T19:46:03.9842321Z 2025-05-07T19:46:03.9842324Z 2025-05-07T19:46:03.9842327Z 2025-05-07T19:46:03.9842330Z 2025-05-07T19:46:03.9842334Z 2025-05-07T19:46:03.9842337Z 2025-05-07T19:46:03.9842340Z 2025-05-07T19:46:03.9842343Z 2025-05-07T19:46:03.9842515Z  2025-05-07T19:46:03.9842705Z 2025-05-07T19:46:03.9842708Z 2025-05-07T19:46:03.9842760Z 2025-05-07T19:46:03.9842765Z 2025-05-07T19:46:03.9842768Z 2025-05-07T19:46:03.9842771Z 2025-05-07T19:46:03.9842775Z 2025-05-07T19:46:03.9842778Z 2025-05-07T19:46:03.9842781Z 2025-05-07T19:46:03.9842784Z 2025-05-07T19:46:03.9842792Z 2025-05-07T19:46:03.9842795Z 2025-05-07T19:46:03.9842799Z 2025-05-07T19:46:03.9842802Z 2025-05-07T19:46:03.9842809Z 2025-05-07T19:46:03.9842961Z  2025-05-07T19:46:03.9843188Z 2025-05-07T19:46:03.9843192Z 2025-05-07T19:46:03.9843195Z 2025-05-07T19:46:03.9843199Z 2025-05-07T19:46:03.9843202Z 2025-05-07T19:46:03.9843205Z 2025-05-07T19:46:03.9843209Z 2025-05-07T19:46:03.9843212Z 2025-05-07T19:46:03.9843215Z 2025-05-07T19:46:03.9843218Z 2025-05-07T19:46:03.9843222Z 2025-05-07T19:46:03.9843225Z 2025-05-07T19:46:03.9843229Z 2025-05-07T19:46:03.9843232Z 2025-05-07T19:46:03.9843235Z 2025-05-07T19:46:03.9843238Z 2025-05-07T19:46:03.9843417Z  2025-05-07T19:46:03.9843633Z 2025-05-07T19:46:03.9843637Z 2025-05-07T19:46:03.9843641Z 2025-05-07T19:46:03.9843644Z 2025-05-07T19:46:03.9843648Z 2025-05-07T19:46:03.9843651Z 2025-05-07T19:46:03.9843655Z 2025-05-07T19:46:03.9843658Z 2025-05-07T19:46:03.9843665Z 2025-05-07T19:46:03.9843668Z 2025-05-07T19:46:03.9843671Z 2025-05-07T19:46:03.9843675Z 2025-05-07T19:46:03.9843735Z 2025-05-07T19:46:03.9843739Z 2025-05-07T19:46:03.9843742Z 2025-05-07T19:46:03.9843746Z 2025-05-07T19:46:03.9843777Z 2025-05-07T19:46:03.9843940Z  2025-05-07T19:46:03.9844162Z 2025-05-07T19:46:03.9844165Z 2025-05-07T19:46:03.9844169Z 2025-05-07T19:46:03.9844172Z 2025-05-07T19:46:03.9844175Z 2025-05-07T19:46:03.9844179Z 2025-05-07T19:46:03.9844182Z 2025-05-07T19:46:03.9844185Z 2025-05-07T19:46:03.9844189Z 2025-05-07T19:46:03.9844192Z 2025-05-07T19:46:03.9844214Z 2025-05-07T19:46:03.9844217Z 2025-05-07T19:46:03.9844221Z 2025-05-07T19:46:03.9844224Z 2025-05-07T19:46:03.9844228Z 2025-05-07T19:46:03.9844231Z 2025-05-07T19:46:03.9844235Z 2025-05-07T19:46:03.9844239Z 2025-05-07T19:46:03.9844403Z  2025-05-07T19:46:03.9844630Z 2025-05-07T19:46:03.9844638Z 2025-05-07T19:46:03.9844753Z  2025-05-07T19:46:03.9844860Z 2025-05-07T19:46:03.9844863Z 2025-05-07T19:46:03.9844967Z  2025-05-07T19:46:03.9845097Z 2025-05-07T19:46:03.9845100Z 2025-05-07T19:46:03.9845104Z 2025-05-07T19:46:03.9845220Z  2025-05-07T19:46:03.9845333Z 2025-05-07T19:46:03.9845336Z 2025-05-07T19:46:03.9845340Z 2025-05-07T19:46:03.9845344Z 2025-05-07T19:46:03.9845475Z  2025-05-07T19:46:03.9845595Z 2025-05-07T19:46:03.9845599Z 2025-05-07T19:46:03.9845602Z 2025-05-07T19:46:03.9845606Z 2025-05-07T19:46:03.9845609Z 2025-05-07T19:46:03.9845715Z  2025-05-07T19:46:03.9845859Z 2025-05-07T19:46:03.9845863Z 2025-05-07T19:46:03.9845866Z 2025-05-07T19:46:03.9845870Z 2025-05-07T19:46:03.9845874Z 2025-05-07T19:46:03.9845877Z 2025-05-07T19:46:03.9846096Z  2025-05-07T19:46:03.9846247Z 2025-05-07T19:46:03.9846250Z 2025-05-07T19:46:03.9846254Z 2025-05-07T19:46:03.9846257Z 2025-05-07T19:46:03.9846261Z 2025-05-07T19:46:03.9846264Z 2025-05-07T19:46:03.9846329Z 2025-05-07T19:46:03.9846446Z  2025-05-07T19:46:03.9846586Z 2025-05-07T19:46:03.9846594Z 2025-05-07T19:46:03.9846618Z 2025-05-07T19:46:03.9846621Z 2025-05-07T19:46:03.9846624Z 2025-05-07T19:46:03.9846628Z 2025-05-07T19:46:03.9846631Z 2025-05-07T19:46:03.9846634Z 2025-05-07T19:46:03.9846753Z  2025-05-07T19:46:03.9846910Z 2025-05-07T19:46:03.9846913Z 2025-05-07T19:46:03.9846917Z 2025-05-07T19:46:03.9846921Z 2025-05-07T19:46:03.9846925Z 2025-05-07T19:46:03.9846945Z 2025-05-07T19:46:03.9846949Z 2025-05-07T19:46:03.9846953Z 2025-05-07T19:46:03.9846956Z 2025-05-07T19:46:03.9847080Z  2025-05-07T19:46:03.9847240Z 2025-05-07T19:46:03.9847243Z 2025-05-07T19:46:03.9847247Z 2025-05-07T19:46:03.9847250Z 2025-05-07T19:46:03.9847253Z 2025-05-07T19:46:03.9847257Z 2025-05-07T19:46:03.9847260Z 2025-05-07T19:46:03.9847282Z 2025-05-07T19:46:03.9847286Z 2025-05-07T19:46:03.9847289Z 2025-05-07T19:46:03.9847422Z  2025-05-07T19:46:03.9847596Z 2025-05-07T19:46:03.9847599Z 2025-05-07T19:46:03.9847607Z 2025-05-07T19:46:03.9847610Z 2025-05-07T19:46:03.9847614Z 2025-05-07T19:46:03.9847617Z 2025-05-07T19:46:03.9847621Z 2025-05-07T19:46:03.9847625Z 2025-05-07T19:46:03.9847645Z 2025-05-07T19:46:03.9847648Z 2025-05-07T19:46:03.9847651Z 2025-05-07T19:46:03.9847780Z  2025-05-07T19:46:03.9847960Z 2025-05-07T19:46:03.9847964Z 2025-05-07T19:46:03.9847967Z 2025-05-07T19:46:03.9847971Z 2025-05-07T19:46:03.9847975Z 2025-05-07T19:46:03.9847979Z 2025-05-07T19:46:03.9847982Z 2025-05-07T19:46:03.9847986Z 2025-05-07T19:46:03.9848006Z 2025-05-07T19:46:03.9848009Z 2025-05-07T19:46:03.9848012Z 2025-05-07T19:46:03.9848016Z 2025-05-07T19:46:03.9848148Z  2025-05-07T19:46:03.9848334Z 2025-05-07T19:46:03.9848337Z 2025-05-07T19:46:03.9848341Z 2025-05-07T19:46:03.9848345Z 2025-05-07T19:46:03.9848348Z 2025-05-07T19:46:03.9848355Z 2025-05-07T19:46:03.9848375Z 2025-05-07T19:46:03.9848378Z 2025-05-07T19:46:03.9848381Z 2025-05-07T19:46:03.9848443Z 2025-05-07T19:46:03.9848447Z 2025-05-07T19:46:03.9848451Z 2025-05-07T19:46:03.9848454Z 2025-05-07T19:46:03.9848597Z  2025-05-07T19:46:03.9848793Z 2025-05-07T19:46:03.9848797Z 2025-05-07T19:46:03.9848800Z 2025-05-07T19:46:03.9848804Z 2025-05-07T19:46:03.9848824Z 2025-05-07T19:46:03.9848828Z 2025-05-07T19:46:03.9848831Z 2025-05-07T19:46:03.9848834Z 2025-05-07T19:46:03.9848838Z 2025-05-07T19:46:03.9848841Z 2025-05-07T19:46:03.9848844Z 2025-05-07T19:46:03.9848848Z 2025-05-07T19:46:03.9848851Z 2025-05-07T19:46:03.9848854Z 2025-05-07T19:46:03.9849002Z  2025-05-07T19:46:03.9849205Z 2025-05-07T19:46:03.9849227Z 2025-05-07T19:46:03.9849230Z 2025-05-07T19:46:03.9849233Z 2025-05-07T19:46:03.9849237Z 2025-05-07T19:46:03.9849240Z 2025-05-07T19:46:03.9849243Z 2025-05-07T19:46:03.9849247Z 2025-05-07T19:46:03.9849253Z 2025-05-07T19:46:03.9849257Z 2025-05-07T19:46:03.9849260Z 2025-05-07T19:46:03.9849264Z 2025-05-07T19:46:03.9849270Z 2025-05-07T19:46:03.9849274Z 2025-05-07T19:46:03.9849277Z 2025-05-07T19:46:03.9849427Z  2025-05-07T19:46:03.9849651Z 2025-05-07T19:46:03.9849654Z 2025-05-07T19:46:03.9849659Z 2025-05-07T19:46:03.9849662Z 2025-05-07T19:46:03.9849666Z 2025-05-07T19:46:03.9849669Z 2025-05-07T19:46:03.9849672Z 2025-05-07T19:46:03.9849676Z 2025-05-07T19:46:03.9849679Z 2025-05-07T19:46:03.9849682Z 2025-05-07T19:46:03.9849686Z 2025-05-07T19:46:03.9849689Z 2025-05-07T19:46:03.9849693Z 2025-05-07T19:46:03.9849696Z 2025-05-07T19:46:03.9849699Z 2025-05-07T19:46:03.9849702Z 2025-05-07T19:46:03.9850010Z  2025-05-07T19:46:03.9850224Z 2025-05-07T19:46:03.9850228Z 2025-05-07T19:46:03.9850231Z 2025-05-07T19:46:03.9850235Z 2025-05-07T19:46:03.9850239Z 2025-05-07T19:46:03.9850242Z 2025-05-07T19:46:03.9850335Z 2025-05-07T19:46:03.9850338Z 2025-05-07T19:46:03.9850341Z 2025-05-07T19:46:03.9850345Z 2025-05-07T19:46:03.9850351Z 2025-05-07T19:46:03.9850355Z 2025-05-07T19:46:03.9850358Z 2025-05-07T19:46:03.9850361Z 2025-05-07T19:46:03.9850365Z 2025-05-07T19:46:03.9850385Z 2025-05-07T19:46:03.9850388Z 2025-05-07T19:46:03.9850550Z  2025-05-07T19:46:03.9850767Z 2025-05-07T19:46:03.9850770Z 2025-05-07T19:46:03.9850774Z 2025-05-07T19:46:03.9850777Z 2025-05-07T19:46:03.9850781Z 2025-05-07T19:46:03.9850784Z 2025-05-07T19:46:03.9850787Z 2025-05-07T19:46:03.9850791Z 2025-05-07T19:46:03.9850811Z 2025-05-07T19:46:03.9850815Z 2025-05-07T19:46:03.9850818Z 2025-05-07T19:46:03.9850822Z 2025-05-07T19:46:03.9850825Z 2025-05-07T19:46:03.9850828Z 2025-05-07T19:46:03.9850832Z 2025-05-07T19:46:03.9850835Z 2025-05-07T19:46:03.9850839Z 2025-05-07T19:46:03.9850842Z 2025-05-07T19:46:03.9851011Z  2025-05-07T19:46:03.9851236Z 2025-05-07T19:46:03.9851256Z 2025-05-07T19:46:03.9851351Z  2025-05-07T19:46:03.9851464Z 2025-05-07T19:46:03.9851468Z 2025-05-07T19:46:03.9851564Z  2025-05-07T19:46:03.9851691Z 2025-05-07T19:46:03.9851695Z 2025-05-07T19:46:03.9851698Z 2025-05-07T19:46:03.9851799Z  2025-05-07T19:46:03.9851908Z 2025-05-07T19:46:03.9851912Z 2025-05-07T19:46:03.9851916Z 2025-05-07T19:46:03.9851919Z 2025-05-07T19:46:03.9852042Z  2025-05-07T19:46:03.9852159Z 2025-05-07T19:46:03.9852162Z 2025-05-07T19:46:03.9852166Z 2025-05-07T19:46:03.9852169Z 2025-05-07T19:46:03.9852174Z 2025-05-07T19:46:03.9852295Z  2025-05-07T19:46:03.9852420Z 2025-05-07T19:46:03.9852424Z 2025-05-07T19:46:03.9852427Z 2025-05-07T19:46:03.9852430Z 2025-05-07T19:46:03.9852434Z 2025-05-07T19:46:03.9852437Z 2025-05-07T19:46:03.9852547Z  2025-05-07T19:46:03.9852693Z 2025-05-07T19:46:03.9852696Z 2025-05-07T19:46:03.9852699Z 2025-05-07T19:46:03.9852707Z 2025-05-07T19:46:03.9852711Z 2025-05-07T19:46:03.9852715Z 2025-05-07T19:46:03.9852718Z 2025-05-07T19:46:03.9854149Z  2025-05-07T19:46:03.9854300Z 2025-05-07T19:46:03.9854304Z 2025-05-07T19:46:03.9854325Z 2025-05-07T19:46:03.9854328Z 2025-05-07T19:46:03.9854332Z 2025-05-07T19:46:03.9854335Z 2025-05-07T19:46:03.9854339Z 2025-05-07T19:46:03.9854342Z 2025-05-07T19:46:03.9854466Z  2025-05-07T19:46:03.9854617Z 2025-05-07T19:46:03.9854621Z 2025-05-07T19:46:03.9854624Z 2025-05-07T19:46:03.9854628Z 2025-05-07T19:46:03.9854631Z 2025-05-07T19:46:03.9854651Z 2025-05-07T19:46:03.9854655Z 2025-05-07T19:46:03.9854658Z 2025-05-07T19:46:03.9854661Z 2025-05-07T19:46:03.9854781Z  2025-05-07T19:46:03.9854943Z 2025-05-07T19:46:03.9854947Z 2025-05-07T19:46:03.9854950Z 2025-05-07T19:46:03.9854953Z 2025-05-07T19:46:03.9854957Z 2025-05-07T19:46:03.9854960Z 2025-05-07T19:46:03.9854964Z 2025-05-07T19:46:03.9854984Z 2025-05-07T19:46:03.9854995Z 2025-05-07T19:46:03.9854998Z 2025-05-07T19:46:03.9855123Z  2025-05-07T19:46:03.9855296Z 2025-05-07T19:46:03.9855300Z 2025-05-07T19:46:03.9855304Z 2025-05-07T19:46:03.9855308Z 2025-05-07T19:46:03.9855311Z 2025-05-07T19:46:03.9855314Z 2025-05-07T19:46:03.9855318Z 2025-05-07T19:46:03.9855321Z 2025-05-07T19:46:03.9855342Z 2025-05-07T19:46:03.9855345Z 2025-05-07T19:46:03.9855349Z 2025-05-07T19:46:03.9855480Z  2025-05-07T19:46:03.9855662Z 2025-05-07T19:46:03.9855666Z 2025-05-07T19:46:03.9855669Z 2025-05-07T19:46:03.9855673Z 2025-05-07T19:46:03.9855676Z 2025-05-07T19:46:03.9855680Z 2025-05-07T19:46:03.9855684Z 2025-05-07T19:46:03.9855687Z 2025-05-07T19:46:03.9855710Z 2025-05-07T19:46:03.9855713Z 2025-05-07T19:46:03.9855717Z 2025-05-07T19:46:03.9855720Z 2025-05-07T19:46:03.9855851Z  2025-05-07T19:46:03.9856041Z 2025-05-07T19:46:03.9856044Z 2025-05-07T19:46:03.9856048Z 2025-05-07T19:46:03.9856123Z 2025-05-07T19:46:03.9856126Z 2025-05-07T19:46:03.9856129Z 2025-05-07T19:46:03.9856151Z 2025-05-07T19:46:03.9856158Z 2025-05-07T19:46:03.9856162Z 2025-05-07T19:46:03.9856166Z 2025-05-07T19:46:03.9856169Z 2025-05-07T19:46:03.9856173Z 2025-05-07T19:46:03.9856176Z 2025-05-07T19:46:03.9856431Z  2025-05-07T19:46:03.9856627Z 2025-05-07T19:46:03.9856631Z 2025-05-07T19:46:03.9856635Z 2025-05-07T19:46:03.9856638Z 2025-05-07T19:46:03.9856659Z 2025-05-07T19:46:03.9856663Z 2025-05-07T19:46:03.9856666Z 2025-05-07T19:46:03.9856670Z 2025-05-07T19:46:03.9856673Z 2025-05-07T19:46:03.9856677Z 2025-05-07T19:46:03.9856680Z 2025-05-07T19:46:03.9856683Z 2025-05-07T19:46:03.9856687Z 2025-05-07T19:46:03.9856690Z 2025-05-07T19:46:03.9856833Z  2025-05-07T19:46:03.9857026Z 2025-05-07T19:46:03.9857045Z 2025-05-07T19:46:03.9857049Z 2025-05-07T19:46:03.9857052Z 2025-05-07T19:46:03.9857055Z 2025-05-07T19:46:03.9857063Z 2025-05-07T19:46:03.9857066Z 2025-05-07T19:46:03.9857070Z 2025-05-07T19:46:03.9857073Z 2025-05-07T19:46:03.9857080Z 2025-05-07T19:46:03.9857083Z 2025-05-07T19:46:03.9857086Z 2025-05-07T19:46:03.9857090Z 2025-05-07T19:46:03.9857093Z 2025-05-07T19:46:03.9857096Z 2025-05-07T19:46:03.9857245Z  2025-05-07T19:46:03.9857460Z 2025-05-07T19:46:03.9857463Z 2025-05-07T19:46:03.9857467Z 2025-05-07T19:46:03.9857470Z 2025-05-07T19:46:03.9857473Z 2025-05-07T19:46:03.9857476Z 2025-05-07T19:46:03.9857480Z 2025-05-07T19:46:03.9857483Z 2025-05-07T19:46:03.9857487Z 2025-05-07T19:46:03.9857490Z 2025-05-07T19:46:03.9857493Z 2025-05-07T19:46:03.9857497Z 2025-05-07T19:46:03.9857500Z 2025-05-07T19:46:03.9857503Z 2025-05-07T19:46:03.9857507Z 2025-05-07T19:46:03.9857510Z 2025-05-07T19:46:03.9857678Z  2025-05-07T19:46:03.9857884Z 2025-05-07T19:46:03.9857887Z 2025-05-07T19:46:03.9857891Z 2025-05-07T19:46:03.9857898Z 2025-05-07T19:46:03.9857902Z 2025-05-07T19:46:03.9857905Z 2025-05-07T19:46:03.9857908Z 2025-05-07T19:46:03.9857973Z 2025-05-07T19:46:03.9857977Z 2025-05-07T19:46:03.9857980Z 2025-05-07T19:46:03.9857983Z 2025-05-07T19:46:03.9857988Z 2025-05-07T19:46:03.9857991Z 2025-05-07T19:46:03.9857995Z 2025-05-07T19:46:03.9857998Z 2025-05-07T19:46:03.9858018Z 2025-05-07T19:46:03.9858021Z 2025-05-07T19:46:03.9858177Z  2025-05-07T19:46:03.9858391Z 2025-05-07T19:46:03.9858394Z 2025-05-07T19:46:03.9858397Z 2025-05-07T19:46:03.9858401Z 2025-05-07T19:46:03.9858404Z 2025-05-07T19:46:03.9858407Z 2025-05-07T19:46:03.9858411Z 2025-05-07T19:46:03.9858414Z 2025-05-07T19:46:03.9858434Z 2025-05-07T19:46:03.9858437Z 2025-05-07T19:46:03.9858441Z 2025-05-07T19:46:03.9858444Z 2025-05-07T19:46:03.9858447Z 2025-05-07T19:46:03.9858450Z 2025-05-07T19:46:03.9858453Z 2025-05-07T19:46:03.9858456Z 2025-05-07T19:46:03.9858460Z 2025-05-07T19:46:03.9858463Z 2025-05-07T19:46:03.9858629Z  2025-05-07T19:46:03.9858846Z 2025-05-07T19:46:03.9858870Z 2025-05-07T19:46:03.9858964Z  2025-05-07T19:46:03.9859067Z 2025-05-07T19:46:03.9859070Z 2025-05-07T19:46:03.9859168Z  2025-05-07T19:46:03.9859293Z 2025-05-07T19:46:03.9859296Z 2025-05-07T19:46:03.9859300Z 2025-05-07T19:46:03.9859400Z  2025-05-07T19:46:03.9859510Z 2025-05-07T19:46:03.9859513Z 2025-05-07T19:46:03.9859516Z 2025-05-07T19:46:03.9859519Z 2025-05-07T19:46:03.9859639Z  2025-05-07T19:46:03.9859756Z 2025-05-07T19:46:03.9859759Z 2025-05-07T19:46:03.9859762Z 2025-05-07T19:46:03.9859766Z 2025-05-07T19:46:03.9859769Z 2025-05-07T19:46:03.9859893Z  2025-05-07T19:46:03.9860015Z 2025-05-07T19:46:03.9860019Z 2025-05-07T19:46:03.9860022Z 2025-05-07T19:46:03.9860025Z 2025-05-07T19:46:03.9860029Z 2025-05-07T19:46:03.9860032Z 2025-05-07T19:46:03.9860139Z  2025-05-07T19:46:03.9860285Z 2025-05-07T19:46:03.9860352Z 2025-05-07T19:46:03.9860356Z 2025-05-07T19:46:03.9860359Z 2025-05-07T19:46:03.9860362Z 2025-05-07T19:46:03.9860369Z 2025-05-07T19:46:03.9860372Z 2025-05-07T19:46:03.9860509Z  done 2025-05-07T19:46:04.1911276Z Preparing transaction: / - done 2025-05-07T19:46:04.9934680Z Verifying transaction: | / - \ | / - \ done 2025-05-07T19:46:05.2980849Z Executing transaction: / - \ done 2025-05-07T19:46:07.2885272Z [INSTALL] Fixing file placements for CUDA 12.6.3+ ... 2025-05-07T19:46:07.2885705Z [INSTALL] Creating symlinks: libnvToolsExt.so 2025-05-07T19:46:07.2886474Z + ln -sf /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:07.2887080Z 2025-05-07T19:46:07.2896393Z 2025-05-07T19:46:07.2897583Z + ln -sf /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:07.2898424Z 2025-05-07T19:46:07.2912632Z 2025-05-07T19:46:07.2912868Z [INSTALL] Copying nvtx3 headers ... 2025-05-07T19:46:07.2917241Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/include/ 2025-05-07T19:46:07.2921574Z 2025-05-07T19:46:07.3130826Z 2025-05-07T19:46:07.3135542Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/ 2025-05-07T19:46:07.3139817Z 2025-05-07T19:46:07.3150905Z 2025-05-07T19:46:07.3151202Z [INSTALL] Appending libcuda.so path to LD_LIBRARY_PATH ... 2025-05-07T19:46:07.3550128Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs ... 2025-05-07T19:46:09.1657627Z ERROR conda.cli.main_run:execute(125): `conda run printenv LD_LIBRARY_PATH` failed. (See above for error) 2025-05-07T19:46:09.2330802Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs 2025-05-07T19:46:09.2331369Z 2025-05-07T19:46:09.6451969Z 2025-05-07T19:46:09.6455882Z [INSTALL] Setting environment variable NVML_LIB_PATH ... 2025-05-07T19:46:09.6835221Z + conda env config vars set -n build_binary NVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:09.6835772Z 2025-05-07T19:46:10.1179348Z 2025-05-07T19:46:10.1179965Z [INSTALL] Setting environment variable CUDA_INCLUDE_DIRS ... 2025-05-07T19:46:10.1181422Z + conda env config vars set -n build_binary CUDA_INCLUDE_DIRS="/github/home/miniconda/envs/build_binary/include/:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/" 2025-05-07T19:46:10.1182204Z 2025-05-07T19:46:10.5395065Z 2025-05-07T19:46:12.5171760Z [CHECK] cuda_runtime.h found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/cuda_runtime.h 2025-05-07T19:46:14.4354481Z [CHECK] libcuda.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:46:16.3419306Z [CHECK] libnvToolsExt.so found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:16.3420194Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:18.3019952Z [CHECK] libnvidia-ml.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:20.0878117Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:46:20.0878509Z 2025-05-07T19:46:20.1437237Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:46:23.8148593Z /tmp/tmpbdlpvare: line 3: clang: command not found 2025-05-07T19:46:23.8148963Z 2025-05-07T19:46:23.8149339Z ERROR conda.cli.main_run:execute(125): `conda run clang --version` failed. (See above for error) 2025-05-07T19:46:23.8733221Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:46:23.8734250Z 2025-05-07T19:46:23.8751616Z total 56 2025-05-07T19:46:23.8752389Z drwxr-xr-x. 2 root root 16384 May 7 19:46 . 2025-05-07T19:46:23.8753448Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:46:23.8754681Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:46:23.8756078Z -rw-r--r--. 2 root root 11630 Jun 10 2024 activate-gcc_linux-64.sh 2025-05-07T19:46:23.8757442Z -rw-r--r--. 2 root root 5190 Jun 10 2024 activate-gxx_linux-64.sh 2025-05-07T19:46:23.8758909Z -rw-r--r--. 2 root root 136 Mar 27 01:27 libglib_activate.sh 2025-05-07T19:46:23.8759371Z -rw-r--r--. 2 root root 872 May 7 16:10 libxml2_activate.sh 2025-05-07T19:46:23.8759905Z -rw-r--r--. 2 root root 499 Mar 28 22:35 openjdk_activate.sh 2025-05-07T19:46:23.8760341Z -rw-r--r--. 2 root root 2932 Nov 20 20:32 ~cuda-nvcc_activate.sh 2025-05-07T19:46:23.8760610Z 2025-05-07T19:46:23.8760847Z [INSTALL] Removing the -ccbin=CXX hook from NVCC activation scripts ... 2025-05-07T19:46:23.8761535Z + sed -i /-ccbin=/d /github/home/miniconda/envs/build_binary/etc/conda/activate.d/*cuda-nvcc_activate.sh 2025-05-07T19:46:23.8761994Z 2025-05-07T19:46:23.8770537Z 2025-05-07T19:46:23.8771222Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:46:23.8772030Z 2025-05-07T19:46:25.7531282Z 2025-05-07T19:46:25.7532503Z [BUILD] Setting prepend flags for NVCC ... 2025-05-07T19:46:25.7534138Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-allow-unsupported-compiler" 2025-05-07T19:46:25.7535354Z 2025-05-07T19:46:26.1675403Z 2025-05-07T19:46:26.1676151Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:46:26.1676974Z 2025-05-07T19:46:27.9663606Z -allow-unsupported-compiler 2025-05-07T19:46:27.9664279Z 2025-05-07T19:46:28.0239920Z 2025-05-07T19:46:28.0240744Z [INFO] Printing out all preprocessor defines in nvcc ... 2025-05-07T19:46:28.0242315Z + conda run -n build_binary nvcc --compiler-options -dM -E -x cu - < /dev/null 2025-05-07T19:46:28.0243341Z 2025-05-07T19:46:29.8941597Z #define _GLIBCXX_DEPRECATED_SUGGEST(ALT) __attribute__ ((__deprecated__ ("use '" ALT "' instead"))) 2025-05-07T19:46:29.8942204Z #define M_PIl 3.141592653589793238462643383279502884L 2025-05-07T19:46:29.8942590Z #define _IO_CURRENTLY_PUTTING 0x800 2025-05-07T19:46:29.8942936Z #define __W_EXITCODE(ret,sig) ((ret) << 8 | (sig)) 2025-05-07T19:46:29.8943355Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:46:29.8943923Z #define _STL_PAIR_H 1 2025-05-07T19:46:29.8944181Z #define __cpp_attributes 200809L 2025-05-07T19:46:29.8944541Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:46:29.8944890Z #define __DELETE_THROW throw() 2025-05-07T19:46:29.8945167Z #define _PTRDIFF_T_ 2025-05-07T19:46:29.8945415Z #define M_PI_4 0.78539816339744830962 2025-05-07T19:46:29.8945706Z #define __UINT_LEAST16_MAX__ 0xffff 2025-05-07T19:46:29.8945998Z #define _IO_LEFT 02 2025-05-07T19:46:29.8946228Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:46:29.8946499Z #define _POSIX2_BC_SCALE_MAX 99 2025-05-07T19:46:29.8946770Z #define _GLIBCXX_USE_RANDOM_TR1 1 2025-05-07T19:46:29.8947222Z #define _GLIBCXX_MOVE_BACKWARD3(_Tp,_Up,_Vp) std::move_backward(_Tp, _Up, _Vp) 2025-05-07T19:46:29.8947660Z #define __FLT128_MAX_10_EXP__ 4932 2025-05-07T19:46:29.8947948Z #define RE_DUP_MAX (0x7fff) 2025-05-07T19:46:29.8948233Z #define _IOS_OUTPUT 2 2025-05-07T19:46:29.8948545Z #define __FLT_MIN__ 1.17549435082228750796873653722224568e-38F 2025-05-07T19:46:29.8948983Z #define toascii_l(c,l) __toascii_l ((c), (l)) 2025-05-07T19:46:29.8949333Z #define __GCC_IEC_559_COMPLEX 2 2025-05-07T19:46:29.8949648Z #define _GLIBCXX_USE_FCHMOD 1 2025-05-07T19:46:29.8949953Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:46:29.8950832Z #define __bswap_16(x) (__extension__ ({ unsigned short int __v, __x = (unsigned short int) (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_16 (__x); else __asm__ ("rorw $8, %w0" : "=r" (__v) : "0" (__x) : "cc"); __v; })) 2025-05-07T19:46:29.8951738Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:46:29.8952074Z #define __SIZEOF_FLOAT80__ 16 2025-05-07T19:46:29.8952436Z #define cudaTextureTypeCubemapLayered 0xFC 2025-05-07T19:46:29.8952777Z #define _T_WCHAR_ 2025-05-07T19:46:29.8953048Z #define stdout stdout 2025-05-07T19:46:29.8953410Z #define _GLIBCXX_ABI_TAG_CXX11 __attribute ((__abi_tag__ ("cxx11"))) 2025-05-07T19:46:29.8953857Z #define CHAR_BIT __CHAR_BIT__ 2025-05-07T19:46:29.8954166Z #define __flexarr [] 2025-05-07T19:46:29.8954427Z #define _GLIBCXX_HAVE_FINITEF 1 2025-05-07T19:46:29.8954933Z #define __islower_l(c,l) __isctype_l((c), _ISlower, (l)) 2025-05-07T19:46:29.8955320Z #define _IO_FLAGS2_USER_WBUF 8 2025-05-07T19:46:29.8955621Z #define _MATH_H 1 2025-05-07T19:46:29.8955903Z #define cudaOccupancyDisableCachingOverride 0x01 2025-05-07T19:46:29.8956291Z #define __S64_TYPE long int 2025-05-07T19:46:29.8956559Z #define __stub_fchflags 2025-05-07T19:46:29.8956869Z #define cudaDeviceScheduleMask 0x07 2025-05-07T19:46:29.8957181Z #define __SQUAD_TYPE long int 2025-05-07T19:46:29.8957488Z #define __INTMAX_C(c) c ## L 2025-05-07T19:46:29.8957790Z #define _BSD_SIZE_T_DEFINED_ 2025-05-07T19:46:29.8958065Z #define NL_NMAX INT_MAX 2025-05-07T19:46:29.8958345Z #define _BITS_TIME_H 1 2025-05-07T19:46:29.8958644Z #define M_LN10l 2.302585092994045684017991454684364208L 2025-05-07T19:46:29.8959038Z #define _GLIBCXX_TXN_SAFE_DYN 2025-05-07T19:46:29.8959368Z #define cudaStreamTailLaunch ((cudaStream_t)0x3) 2025-05-07T19:46:29.8959888Z #define M_El 2.718281828459045235360287471352662498L 2025-05-07T19:46:29.8960318Z #define _PSTL_PRAGMA_DECLARE_SIMD _PSTL_PRAGMA(omp declare simd) 2025-05-07T19:46:29.8960742Z #define __CHAR_BIT__ 8 2025-05-07T19:46:29.8961054Z #define __FSWORD_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:29.8961397Z #define _PSTL_STRING_CONCAT(x,y) x #y 2025-05-07T19:46:29.8961742Z #define _GLIBCXX98_USE_C99_MATH 1 2025-05-07T19:46:29.8962037Z #define FP_NAN 0 2025-05-07T19:46:29.8962352Z #define makedev(maj,min) gnu_dev_makedev (maj, min) 2025-05-07T19:46:29.8962832Z #define __glibcxx_requires_sorted_set_pred(_First1,_Last1,_First2,_Pred) 2025-05-07T19:46:29.8963400Z #define cudaGetDeviceProperties cudaGetDeviceProperties_v2 2025-05-07T19:46:29.8963819Z #define __cudaCDP2GetErrorString 2025-05-07T19:46:29.8964166Z #define SHRT_MAX __SHRT_MAX__ 2025-05-07T19:46:29.8964481Z #define _GLIBCXX_X86_RDSEED 1 2025-05-07T19:46:29.8964762Z #define __SM_80_RT_H__ 2025-05-07T19:46:29.8965110Z #define _NEW 2025-05-07T19:46:29.8965365Z #define CLOCK_PROCESS_CPUTIME_ID 2 2025-05-07T19:46:29.8965711Z #define __UINT8_MAX__ 0xff 2025-05-07T19:46:29.8966113Z #define _PSTL_ASSERT_MSG(_Condition,_Message) __glibcxx_assert(_Condition) 2025-05-07T19:46:29.8966587Z #define __SCHAR_WIDTH__ 8 2025-05-07T19:46:29.8966847Z #define __USE_ANSI 1 2025-05-07T19:46:29.8967220Z #define _IO_BE(expr,res) __builtin_expect ((expr), res) 2025-05-07T19:46:29.8967651Z #define __isupper_l(c,l) __isctype_l((c), _ISupper, (l)) 2025-05-07T19:46:29.8968079Z #define __cudaCDP2Memcpy2DAsync_ptsz 2025-05-07T19:46:29.8968420Z #define __WINT_MAX__ 0xffffffffU 2025-05-07T19:46:29.8968727Z #define __SIZEOF_PTHREAD_ATTR_T 56 2025-05-07T19:46:29.8969068Z #define __FLT32_MIN_EXP__ (-125) 2025-05-07T19:46:29.8969362Z #define _GLIBCXX_END_NAMESPACE_LDBL 2025-05-07T19:46:29.8969695Z #define PIPE_BUF 4096 2025-05-07T19:46:29.8970033Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC_2ARGS(PRM1,PRM2) 2025-05-07T19:46:29.8970552Z #define ADJ_TICK 0x4000 2025-05-07T19:46:29.8970850Z #define _PSTL_VERSION_PATCH (_PSTL_VERSION % 10) 2025-05-07T19:46:29.8971309Z #define MQ_PRIO_MAX 32768 2025-05-07T19:46:29.8971586Z #define __SIZEOF_PTHREAD_MUTEXATTR_T 4 2025-05-07T19:46:29.8971956Z #define __WAIT_INT(status) (*(int *) &(status)) 2025-05-07T19:46:29.8972477Z #define __GLIBC_PREREQ(maj,min) ((__GLIBC__ << 16) + __GLIBC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:29.8973042Z #define cudaCooperativeLaunchMultiDeviceNoPreSync 0x01 2025-05-07T19:46:29.8973461Z #define _XOPEN_SOURCE 700 2025-05-07T19:46:29.8973732Z #define _POSIX2_BC_DIM_MAX 2048 2025-05-07T19:46:29.8974052Z #define __VECTOR_FUNCTIONS_HPP__ 2025-05-07T19:46:29.8974359Z #define __cpp_static_assert 201411L 2025-05-07T19:46:29.8974713Z #define __WEXITSTATUS(status) (((status) & 0xff00) >> 8) 2025-05-07T19:46:29.8975062Z #define _GLIBCXX_HAVE_STRXFRM_L 1 2025-05-07T19:46:29.8975356Z #define _POSIX_TTY_NAME_MAX 9 2025-05-07T19:46:29.8975649Z #define _GLIBCXX_USE_WEAK_REF __GXX_WEAK__ 2025-05-07T19:46:29.8975960Z #define __OFF_T_MATCHES_OFF64_T 1 2025-05-07T19:46:29.8976332Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:46:29.8976637Z #define __SIZE_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:29.8977015Z #define __ispunct_l(c,l) __isctype_l((c), _ISpunct, (l)) 2025-05-07T19:46:29.8977364Z #define __WCHAR_MAX__ 0x7fffffff 2025-05-07T19:46:29.8977663Z #define _GLIBCXX_USE_CLOCK_MONOTONIC 1 2025-05-07T19:46:29.8978002Z #define __BLKCNT_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:29.8978417Z #define __isprint_l(c,l) __isctype_l((c), _ISprint, (l)) 2025-05-07T19:46:29.8978804Z #define cudaNvSciSyncAttrSignal 0x1 2025-05-07T19:46:29.8979101Z #define _GLIBCXX_USE_LONG_LONG 1 2025-05-07T19:46:29.8979421Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:46:29.8979747Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:46:29.8980086Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:46:29.8980501Z #define __DBL_DENORM_MIN__ double(4.94065645841246544176568792868221372e-324L) 2025-05-07T19:46:29.8980941Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:46:29.8981250Z #define ADJ_ESTERROR 0x0008 2025-05-07T19:46:29.8981528Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:46:29.8981821Z #define __GCC_IEC_559 2 2025-05-07T19:46:29.8982113Z #define __cpp_lib_transformation_trait_aliases 201304 2025-05-07T19:46:29.8982470Z #define _IO_flockfile(_fp) 2025-05-07T19:46:29.8982728Z #define CLOCK_MONOTONIC_RAW 4 2025-05-07T19:46:29.8983009Z #define __FLT32X_DECIMAL_DIG__ 17 2025-05-07T19:46:29.8983284Z #define _IOFBF 0 2025-05-07T19:46:29.8983528Z #define __USE_BSD 1 2025-05-07T19:46:29.8983767Z #define __FLT_EVAL_METHOD__ 0 2025-05-07T19:46:29.8984055Z #define SHRT_MIN (-SHRT_MAX - 1) 2025-05-07T19:46:29.8984328Z #define _IO_USER_LOCK 0x8000 2025-05-07T19:46:29.8984594Z #define _IO_NO_WRITES 8 2025-05-07T19:46:29.8984861Z #define _GLIBCXX_PSEUDO_VISIBILITY(V) 2025-05-07T19:46:29.8985575Z #define __ASMNAME2(prefix,cname) __STRING (prefix) cname 2025-05-07T19:46:29.8986025Z #define _GLIBCXX_HAVE_SYS_STAT_H 1 2025-05-07T19:46:29.8986328Z #define MB_CUR_MAX (__ctype_get_mb_cur_max ()) 2025-05-07T19:46:29.8986672Z #define __cpp_binary_literals 201304L 2025-05-07T19:46:29.8986965Z #define _CPP_TYPE_TRAITS_H 1 2025-05-07T19:46:29.8987246Z #define __BEGIN_NAMESPACE_C99 2025-05-07T19:46:29.8987511Z #define __FLT64_DECIMAL_DIG__ 17 2025-05-07T19:46:29.8987839Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_AFTER(A) 2025-05-07T19:46:29.8988393Z #define _G_HAVE_ST_BLKSIZE defined (_STATBUF_ST_BLKSIZE) 2025-05-07T19:46:29.8988767Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:46:29.8989092Z #define M_PI 3.14159265358979323846 2025-05-07T19:46:29.8989399Z #define _GLIBCXX_PACKAGE_NAME "package-unused" 2025-05-07T19:46:29.8989742Z #define _GLIBCXX_HAVE_BUILTIN_IS_SAME 1 2025-05-07T19:46:29.8990050Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:46:29.8990377Z #define _POSIX_DELAYTIMER_MAX 32 2025-05-07T19:46:29.8990654Z #define _GLIBCXX_USE_UTIME 1 2025-05-07T19:46:29.8990941Z #define _STL_ITERATOR_BASE_FUNCS_H 1 2025-05-07T19:46:29.8991565Z #define _IO_peekc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) && __underflow (_fp) == EOF ? EOF : *(unsigned char *) (_fp)->_IO_read_ptr) 2025-05-07T19:46:29.8992173Z #define _GLIBCXX_TR1_ELL_INTEGRAL_TCC 1 2025-05-07T19:46:29.8992513Z #define w_termsig __wait_terminated.__w_termsig 2025-05-07T19:46:29.8992842Z #define __FLOAT_WORD_ORDER __BYTE_ORDER 2025-05-07T19:46:29.8993156Z #define __cudaCDP2GetErrorName 2025-05-07T19:46:29.8993428Z #define XATTR_SIZE_MAX 65536 2025-05-07T19:46:29.8993703Z #define be64toh(x) __bswap_64 (x) 2025-05-07T19:46:29.8994023Z #define __ASSERT_VOID_CAST static_cast 2025-05-07T19:46:29.8994366Z #define __cpp_variadic_templates 200704L 2025-05-07T19:46:29.8994786Z #define RAND_MAX 2147483647 2025-05-07T19:46:29.8995043Z #define _GLIBCXX_USE_C99_COMPLEX_TR1 1 2025-05-07T19:46:29.8995381Z #define __UINT_FAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:29.8995695Z #define __SM_90_RT_H__ 2025-05-07T19:46:29.8995949Z #define __SIG_ATOMIC_TYPE__ int 2025-05-07T19:46:29.8996283Z #define __COMPAR_FN_T 2025-05-07T19:46:29.8996534Z #define __GID_T_TYPE __U32_TYPE 2025-05-07T19:46:29.8996791Z #define _IO_BAD_SEEN 0x4000 2025-05-07T19:46:29.8997294Z #define _PSTL_PRAGMA_MESSAGE_IMPL(x) _PSTL_PRAGMA(message(_PSTL_STRING_CONCAT(_PSTL_PRAGMA_LOCATION, x))) 2025-05-07T19:46:29.8997822Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:46:29.8998179Z #define __glibcxx_requires_sorted_pred(_First,_Last,_Pred) 2025-05-07T19:46:29.8998571Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:46:29.8998865Z #define _PSTL_PRAGMA_SIMD_INCLUSIVE_SCAN(PRM) 2025-05-07T19:46:29.8999220Z #define cudaArrayColorAttachment 0x20 2025-05-07T19:46:29.8999530Z #define __cpp_variable_templates 201304L 2025-05-07T19:46:29.9000060Z #define cudaKernelNodeAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:29.9000615Z #define __cpp_lib_integral_constant_callable 201304 2025-05-07T19:46:29.9000964Z #define _GLIBCXX_HAVE_SINHF 1 2025-05-07T19:46:29.9001249Z #define MOD_TIMECONST ADJ_TIMECONST 2025-05-07T19:46:29.9001545Z #define __cpp_lib_result_of_sfinae 201210 2025-05-07T19:46:29.9001861Z #define __SM_30_INTRINSICS_H__ 2025-05-07T19:46:29.9002125Z #define __FLT32X_MAX_EXP__ 1024 2025-05-07T19:46:29.9002403Z #define _GLIBCXX_USE_WCHAR_T 1 2025-05-07T19:46:29.9002657Z #define _GLIBCXX_MATH_H 1 2025-05-07T19:46:29.9002909Z #define __u_char_defined 2025-05-07T19:46:29.9003213Z #define WIFEXITED(status) __WIFEXITED (__WAIT_INT (status)) 2025-05-07T19:46:29.9003584Z #define STA_PPSERROR 0x0800 2025-05-07T19:46:29.9003828Z #define _GLIBCXX_STD_A std 2025-05-07T19:46:29.9004083Z #define __FLT32_HAS_DENORM__ 1 2025-05-07T19:46:29.9004365Z #define _GLIBCXX_BEGIN_NAMESPACE_VERSION 2025-05-07T19:46:29.9004801Z #define __device_builtin_texture_type__ __location__(device_builtin_texture_type) 2025-05-07T19:46:29.9005242Z #define FP_INFINITE 1 2025-05-07T19:46:29.9005602Z #define _GLIBCXX11_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:29.9006109Z #define _IO_pid_t __pid_t 2025-05-07T19:46:29.9006360Z #define __UINT_FAST8_MAX__ 0xff 2025-05-07T19:46:29.9006630Z #define __LEAF , __leaf__ 2025-05-07T19:46:29.9006867Z #define PATH_MAX 4096 2025-05-07T19:46:29.9026323Z #define __cpp_rvalue_reference 200610L 2025-05-07T19:46:29.9026922Z #define __LDBL_REDIR1(name,proto,alias) name proto 2025-05-07T19:46:29.9027273Z #define _LIMITS_H___ 2025-05-07T19:46:29.9027495Z #define __size_t 2025-05-07T19:46:29.9027738Z #define _GLIBCXX_HAVE_FREXPF 1 2025-05-07T19:46:29.9028294Z #define STA_RONLY (STA_PPSSIGNAL | STA_PPSJITTER | STA_PPSWANDER | STA_PPSERROR | STA_CLOCKERR | STA_NANO | STA_MODE | STA_CLK) 2025-05-07T19:46:29.9029303Z #define _GLIBCXX_HAVE_FREXPL 1 2025-05-07T19:46:29.9029681Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:46:29.9030065Z #define __DEC64_MAX_EXP__ 385 2025-05-07T19:46:29.9030348Z #define _WCHAR_T_DEFINED 2025-05-07T19:46:29.9030713Z #define __glibcxx_requires_can_decrement_range(_First1,_Last1,_First2) 2025-05-07T19:46:29.9031160Z #define MOD_STATUS ADJ_STATUS 2025-05-07T19:46:29.9031465Z #define _GLIBCXX_PURE __attribute__ ((__pure__)) 2025-05-07T19:46:29.9031814Z #define _GLIBCXX_HAVE_STDINT_H 1 2025-05-07T19:46:29.9032099Z #define __SIZEOF_PTHREAD_CONDATTR_T 4 2025-05-07T19:46:29.9032397Z #define __INT8_C(c) c 2025-05-07T19:46:29.9032652Z #define __cudaCDP2GetParameterBuffer 2025-05-07T19:46:29.9032974Z #define _GLIBCXX_HAVE_COSHF 1 2025-05-07T19:46:29.9033239Z #define _GLIBCXX_HAVE_COSHL 1 2025-05-07T19:46:29.9033512Z #define __SM_70_RT_HPP__ 2025-05-07T19:46:29.9033777Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:46:29.9034051Z #define __cpp_variadic_using 201611L 2025-05-07T19:46:29.9034393Z #define __UINT_LEAST64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:29.9034731Z #define __INT_LEAST8_MAX__ 0x7f 2025-05-07T19:46:29.9035018Z #define __SM_61_INTRINSICS_HPP__ 2025-05-07T19:46:29.9035293Z #define _IO_FLAGS2_MMAP 1 2025-05-07T19:46:29.9035573Z #define __cpp_capture_star_this 201603L 2025-05-07T19:46:29.9035885Z #define __cudaCDP2LaunchDeviceV2_ptsz 2025-05-07T19:46:29.9036386Z #define _GLIBCXX_HAVE_ENDIAN_H 1 2025-05-07T19:46:29.9036757Z #define __always_inline __inline __attribute__ ((__always_inline__)) 2025-05-07T19:46:29.9037162Z #define NFDBITS __NFDBITS 2025-05-07T19:46:29.9037435Z #define _PSTL_PRAGMA_FORCEINLINE 2025-05-07T19:46:29.9037722Z #define _GLIBCXX_HAVE_SYS_STATVFS_H 1 2025-05-07T19:46:29.9038057Z #define __glibcxx_requires_sorted(_First,_Last) 2025-05-07T19:46:29.9038379Z #define __SHRT_MAX__ 0x7fff 2025-05-07T19:46:29.9038649Z #define _GLIBCXX_SYMVER_GNU 1 2025-05-07T19:46:29.9038939Z #define w_stopval __wait_stopped.__w_stopval 2025-05-07T19:46:29.9039259Z #define STA_UNSYNC 0x0040 2025-05-07T19:46:29.9039569Z #define __LDBL_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:46:29.9040011Z #define _GLIBCXX_USE_C99_COMPLEX _GLIBCXX11_USE_C99_COMPLEX 2025-05-07T19:46:29.9040400Z #define __FLT64X_MAX_10_EXP__ 4932 2025-05-07T19:46:29.9040692Z #define __cpp_if_constexpr 201606L 2025-05-07T19:46:29.9041028Z #define __glibcxx_class_requires4(_a,_b,_c,_d,_e) 2025-05-07T19:46:29.9041410Z #define cudaStreamFireAndForget ((cudaStream_t)0x4) 2025-05-07T19:46:29.9041773Z #define _GLIBCXX_HAVE_WCHAR_H 1 2025-05-07T19:46:29.9042095Z #define _GLIBCXX_USE_C99_STDIO _GLIBCXX11_USE_C99_STDIO 2025-05-07T19:46:29.9042455Z #define __daddr_t_defined 2025-05-07T19:46:29.9042708Z #define __LDBL_IS_IEC_60559__ 2 2025-05-07T19:46:29.9042993Z #define _GLIBCXX_TR1_RIEMANN_ZETA_TCC 1 2025-05-07T19:46:29.9043324Z #define _GLIBCXX_HAVE_STRUCT_DIRENT_D_TYPE 1 2025-05-07T19:46:29.9043852Z #define _PSTL_CPP11_STD_ROTATE_BROKEN ((__GLIBCXX__ && __GLIBCXX__ < 20150716) || (_MSC_VER && _MSC_VER < 1800)) 2025-05-07T19:46:29.9044367Z #define _ACRTIMP 2025-05-07T19:46:29.9044588Z #define _IO_EOF_SEEN 0x10 2025-05-07T19:46:29.9044867Z #define _GLIBCXX_TR1_POLY_LAGUERRE_TCC 1 2025-05-07T19:46:29.9045160Z #define _IOS_BIN 128 2025-05-07T19:46:29.9045649Z #define __fortify_function __extern_always_inline __attribute_artificial__ 2025-05-07T19:46:29.9046086Z #define __FLT64X_HAS_QUIET_NAN__ 1 2025-05-07T19:46:29.9046375Z #define UNDERFLOW 4 2025-05-07T19:46:29.9046608Z #define NAME_MAX 255 2025-05-07T19:46:29.9046842Z #define SCHAR_MAX __SCHAR_MAX__ 2025-05-07T19:46:29.9047130Z #define __UINT_LEAST8_MAX__ 0xff 2025-05-07T19:46:29.9047407Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:46:29.9047716Z #define _IO_UNIFIED_JUMPTABLES 1 2025-05-07T19:46:29.9048103Z #define __FLT128_DENORM_MIN__ 6.47517511943802511092443895822764655e-4966F128 2025-05-07T19:46:29.9048522Z #define __ptr_t void * 2025-05-07T19:46:29.9048758Z #define M_E 2.7182818284590452354 2025-05-07T19:46:29.9049050Z #define cudaSurfaceType1D 0x01 2025-05-07T19:46:29.9049318Z #define __USE_ISOCXX11 1 2025-05-07T19:46:29.9049601Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:46:29.9049937Z #define cudaDeviceBlockingSync 0x04 2025-05-07T19:46:29.9050317Z #define CLOCK_MONOTONIC_COARSE 6 2025-05-07T19:46:29.9050619Z #define _GLIBCXX_OS_DEFINES 1 2025-05-07T19:46:29.9050913Z #define _GLIBCXX_NODISCARD [[__nodiscard__]] 2025-05-07T19:46:29.9051246Z #define cudaSurfaceType2D 0x02 2025-05-07T19:46:29.9051506Z #define __linux 1 2025-05-07T19:46:29.9051746Z #define __DEC32_EPSILON__ 1E-6DF 2025-05-07T19:46:29.9052022Z #define cudaDeviceMask 0xff 2025-05-07T19:46:29.9052305Z #define _GLIBCXX_END_NAMESPACE_ALGO 2025-05-07T19:46:29.9052596Z #define __CUDA_API_VER_MAJOR__ 12 2025-05-07T19:46:29.9052893Z #define htobe16(x) __bswap_16 (x) 2025-05-07T19:46:29.9053193Z #define HUGE_VALF (__builtin_huge_valf()) 2025-05-07T19:46:29.9053504Z #define __FLT_EVAL_METHOD_TS_18661_3__ 0 2025-05-07T19:46:29.9053825Z #define HUGE_VALL (__builtin_huge_vall()) 2025-05-07T19:46:29.9054118Z #define _BITS_TYPES_H 1 2025-05-07T19:46:29.9054416Z #define ULONG_LONG_MAX (LONG_LONG_MAX * 2ULL + 1ULL) 2025-05-07T19:46:29.9054761Z #define _IO_cleanup_region_end(_Doit) 2025-05-07T19:46:29.9055081Z #define cudaSurfaceType3D 0x03 2025-05-07T19:46:29.9055361Z #define _GLIBCXX_HAVE_SYS_TIME_H 1 2025-05-07T19:46:29.9055744Z #define __cudaGet_blockIdx() blockIdx 2025-05-07T19:46:29.9056051Z #define _IO_DONT_CLOSE 0100000 2025-05-07T19:46:29.9056884Z #define __MATHDECLX(type,function,suffix,args,attrib) __MATHDECL_1(type, function,suffix, args) __attribute__ (attrib); __MATHDECL_1(type, __CONCAT(__,function),suffix, args) __attribute__ (attrib) 2025-05-07T19:46:29.9057768Z #define cudaHostRegisterDefault 0x00 2025-05-07T19:46:29.9058054Z #define __unix 1 2025-05-07T19:46:29.9058284Z #define MATH_ERRNO 1 2025-05-07T19:46:29.9058523Z #define _GLIBCXX_STDIO_SEEK_END 2 2025-05-07T19:46:29.9058820Z #define _GLIBCXX_USE_FCHMODAT 1 2025-05-07T19:46:29.9059091Z #define __UINT32_MAX__ 0xffffffffU 2025-05-07T19:46:29.9059394Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:46:29.9059693Z #define __UID_T_TYPE __U32_TYPE 2025-05-07T19:46:29.9059975Z #define _GLIBCXX_HAVE_ATOMIC_LOCK_POLICY 1 2025-05-07T19:46:29.9060466Z #define __CUDART_API_VERSION ((__CUDA_API_VER_MAJOR__ * 1000) + (__CUDA_API_VER_MINOR__ * 10)) 2025-05-07T19:46:29.9060959Z #define __nv_pure__ __location__(nv_pure) 2025-05-07T19:46:29.9061276Z #define CUDARTAPI_CDECL 2025-05-07T19:46:29.9061531Z #define _PSTL_USAGE_WARNINGS 0 2025-05-07T19:46:29.9061821Z #define _GLIBCXX98_USE_C99_COMPLEX 1 2025-05-07T19:46:29.9062107Z #define __cpp_lib_void_t 201411 2025-05-07T19:46:29.9062386Z #define _POSIX_AIO_MAX 1 2025-05-07T19:46:29.9062619Z #define __SIZE_T 2025-05-07T19:46:29.9062981Z #define isgraph_l(c,l) __isgraph_l ((c), (l)) 2025-05-07T19:46:29.9063297Z #define _GLIBCXX_FULLY_DYNAMIC_STRING 0 2025-05-07T19:46:29.9063576Z #define _POSIX_PIPE_BUF 512 2025-05-07T19:46:29.9063836Z #define _GLIBCXX_HAVE_STRTOLD 1 2025-05-07T19:46:29.9064081Z #define _ATFILE_SOURCE 1 2025-05-07T19:46:29.9064460Z #define __glibcxx_assert(cond) do { __glibcxx_constexpr_assert(cond); } while (false) 2025-05-07T19:46:29.9064882Z #define __WAIT_STATUS void * 2025-05-07T19:46:29.9065143Z #define __MATH_FUNCTIONS_H__ 2025-05-07T19:46:29.9065626Z #define _GLIBCXX_HAVE_WCSTOF 1 2025-05-07T19:46:29.9065909Z #define __FLT128_MIN_EXP__ (-16381) 2025-05-07T19:46:29.9066205Z #define _GLIBCXX_HAVE_LC_MESSAGES 1 2025-05-07T19:46:29.9066534Z #define __WINT_MIN__ 0U 2025-05-07T19:46:29.9067310Z #define _PSTL_CPP14_VARIABLE_TEMPLATES_PRESENT (!__INTEL_COMPILER || __INTEL_COMPILER >= 1700) && (_MSC_FULL_VER >= 190023918 || __cplusplus >= 201402L) 2025-05-07T19:46:29.9067997Z #define isdigit_l(c,l) __isdigit_l ((c), (l)) 2025-05-07T19:46:29.9068301Z #define WUNTRACED 2 2025-05-07T19:46:29.9068542Z #define _GLIBCXX_HAVE_SQRTF 1 2025-05-07T19:46:29.9068816Z #define __SIZEOF_PTHREAD_RWLOCKATTR_T 8 2025-05-07T19:46:29.9069117Z #define NZERO 20 2025-05-07T19:46:29.9069338Z #define _GLIBCXX_HAVE_MEMALIGN 1 2025-05-07T19:46:29.9069628Z #define _PSTL_PRAGMA(x) _Pragma(#x) 2025-05-07T19:46:29.9069921Z #define MOD_CLKA ADJ_OFFSET_SINGLESHOT 2025-05-07T19:46:29.9070221Z #define MOD_CLKB ADJ_TICK 2025-05-07T19:46:29.9070471Z #define __FLT128_MIN_10_EXP__ (-4931) 2025-05-07T19:46:29.9070773Z #define __FLT32X_IS_IEC_60559__ 2 2025-05-07T19:46:29.9071064Z #define __DEVICE_FUNCTIONS_H__ 2025-05-07T19:46:29.9071337Z #define SCHAR_MIN (-SCHAR_MAX - 1) 2025-05-07T19:46:29.9071626Z #define EXIT_FAILURE 1 2025-05-07T19:46:29.9071859Z #define ADJ_MAXERROR 0x0004 2025-05-07T19:46:29.9072128Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:46:29.9072391Z #define _SIZE_T_DEFINED_ 2025-05-07T19:46:29.9072653Z #define _POSIX_AIO_LISTIO_MAX 2 2025-05-07T19:46:29.9072930Z #define __cudaCDP2DeviceGetLimit 2025-05-07T19:46:29.9073284Z #define __LDBL_REDIR_NTH(name,proto) name proto __THROW 2025-05-07T19:46:29.9073651Z #define __cudaCDP2FuncGetAttributes 2025-05-07T19:46:29.9073954Z #define __SCHAR_MAX__ 0x7f 2025-05-07T19:46:29.9074215Z #define __FLT128_MANT_DIG__ 113 2025-05-07T19:46:29.9074481Z #define __USING_NAMESPACE_STD(name) 2025-05-07T19:46:29.9074787Z #define _GLIBCXX_HAVE_OBSOLETE_ISINF 1 2025-05-07T19:46:29.9075093Z #define __WCHAR_MIN__ (-__WCHAR_MAX__ - 1) 2025-05-07T19:46:29.9075403Z #define SEEK_DATA 3 2025-05-07T19:46:29.9075627Z #define __KERNEL_STRICT_NAMES 2025-05-07T19:46:29.9076026Z #define _IO_stderr ((_IO_FILE*)(&_IO_2_1_stderr_)) 2025-05-07T19:46:29.9076478Z #define _IO_ferror_unlocked(__fp) (((__fp)->_flags & _IO_ERR_SEEN) != 0) 2025-05-07T19:46:29.9076896Z #define _FUNCTEXCEPT_H 1 2025-05-07T19:46:29.9077158Z #define __INT64_C(c) c ## L 2025-05-07T19:46:29.9077426Z #define __NTH(fct) __LEAF_ATTR fct throw () 2025-05-07T19:46:29.9077774Z #define _GLIBCXX_CONST __attribute__ ((__const__)) 2025-05-07T19:46:29.9078104Z #define _GLIBCXX_HAVE_LINK 1 2025-05-07T19:46:29.9078397Z #define cudaNvSciSyncAttrWait 0x2 2025-05-07T19:46:29.9078697Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:46:29.9079016Z #define STA_PPSWANDER 0x0400 2025-05-07T19:46:29.9079273Z #define __INT_WCHAR_T_H 2025-05-07T19:46:29.9079522Z #define WSTOPPED 2 2025-05-07T19:46:29.9079755Z #define _POSIX_THREAD_THREADS_MAX 64 2025-05-07T19:46:29.9080052Z #define _POSIX_MQ_OPEN_MAX 8 2025-05-07T19:46:29.9080320Z #define FP_NORMAL 4 2025-05-07T19:46:29.9080560Z #define __cudaCDP2LaunchDevice_ptsz 2025-05-07T19:46:29.9080856Z #define _BITS_TIMEX_H 1 2025-05-07T19:46:29.9081092Z #define _POSIX_LINK_MAX 8 2025-05-07T19:46:29.9081360Z #define _GLIBCXX_HAVE_LIMIT_FSIZE 1 2025-05-07T19:46:29.9081642Z #define _GLIBCXX_HAVE_ATAN2F 1 2025-05-07T19:46:29.9081925Z #define cudaTextureType1D 0x01 2025-05-07T19:46:29.9082192Z #define _GLIBCXX_HAVE_ATAN2L 1 2025-05-07T19:46:29.9082468Z #define COLL_WEIGHTS_MAX 255 2025-05-07T19:46:29.9082735Z #define __isascii(c) (((c) & ~0x7f) == 0) 2025-05-07T19:46:29.9083050Z #define __toascii(c) ((c) & 0x7f) 2025-05-07T19:46:29.9083502Z #define __attribute_format_strfmon__(a,b) __attribute__ ((__format__ (__strfmon__, a, b))) 2025-05-07T19:46:29.9083967Z #define _IO_MAGIC 0xFBAD0000 2025-05-07T19:46:29.9084243Z #define _GLIBCXX_USE_SENDFILE 1 2025-05-07T19:46:29.9084502Z #define _POSIX_SOURCE 1 2025-05-07T19:46:29.9084761Z #define cudaTextureType2D 0x02 2025-05-07T19:46:29.9085098Z #define _PTR_TRAITS_H 1 2025-05-07T19:46:29.9085390Z #define _GLIBCXX_NOEXCEPT_QUAL noexcept (_NE) 2025-05-07T19:46:29.9085711Z #define _GLIBCXX_HAVE_POWF 1 2025-05-07T19:46:29.9086144Z #define _POSIX2_BC_STRING_MAX 1000 2025-05-07T19:46:29.9086466Z #define __attribute_used__ __attribute__ ((__used__)) 2025-05-07T19:46:29.9086828Z #define cudaTextureType3D 0x03 2025-05-07T19:46:29.9087217Z #define _STDIO_USES_IOSTREAM 2025-05-07T19:46:29.9087470Z #define CLOCK_REALTIME 0 2025-05-07T19:46:29.9087827Z #define __FLT32X_MANT_DIG__ 53 2025-05-07T19:46:29.9088080Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:46:29.9088379Z #define __cpp_aligned_new 201606L 2025-05-07T19:46:29.9088640Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:46:29.9088915Z #define cudaEventBlockingSync 0x01 2025-05-07T19:46:29.9089184Z #define _GLIBCXX_HAVE_TANL 1 2025-05-07T19:46:29.9089451Z #define _GLIBCXX_USE_PTHREAD_RWLOCK_T 1 2025-05-07T19:46:29.9089732Z #define _GLIBCXX_HAVE_LINUX_RANDOM_H 1 2025-05-07T19:46:29.9090023Z #define _GLIBCXX_USE_C99_FENV_TR1 1 2025-05-07T19:46:29.9090372Z #define __FLT32_MAX_10_EXP__ 38 2025-05-07T19:46:29.9090770Z #define __GLIBC__ 2 2025-05-07T19:46:29.9090999Z #define __END_DECLS } 2025-05-07T19:46:29.9091232Z #define FP_ILOGB0 (-2147483647 - 1) 2025-05-07T19:46:29.9091615Z #define __FLT64X_EPSILON__ 1.08420217248550443400745280086994171e-19F64x 2025-05-07T19:46:29.9092004Z #define __CONCAT(x,y) x ## y 2025-05-07T19:46:29.9092267Z #define WCONTINUED 8 2025-05-07T19:46:29.9092497Z #define __STDC_HOSTED__ 1 2025-05-07T19:46:29.9092762Z #define _GLIBCXX_HAVE_ARPA_INET_H 1 2025-05-07T19:46:29.9093031Z #define _ALLOCA_H 1 2025-05-07T19:46:29.9093270Z #define __host__ __location__(host) 2025-05-07T19:46:29.9093714Z #define __warndecl(name,msg) extern void name (void) __attribute__((__warning__ (msg))) 2025-05-07T19:46:29.9094171Z #define __SLONG32_TYPE int 2025-05-07T19:46:29.9094448Z #define _GLIBCXX_DEBUG_ASSERTIONS_H 1 2025-05-07T19:46:29.9094733Z #define _SYS_SELECT_H 1 2025-05-07T19:46:29.9094982Z #define _IO_LINE_BUF 0x200 2025-05-07T19:46:29.9095224Z #define _IOS_NOCREATE 32 2025-05-07T19:46:29.9097339Z #define __DEC64_MIN_EXP__ (-382) 2025-05-07T19:46:29.9097686Z #define __cudaGet_warpSize() warpSize 2025-05-07T19:46:29.9097994Z #define __SSIZE_T_TYPE __SWORD_TYPE 2025-05-07T19:46:29.9098294Z #define _GLIBCXX_HAVE_LIMIT_VMEM 0 2025-05-07T19:46:29.9098585Z #define __global__ __location__(global) 2025-05-07T19:46:29.9098891Z #define __GNU_LIBRARY__ 6 2025-05-07T19:46:29.9099146Z #define __cpp_decltype_auto 201304L 2025-05-07T19:46:29.9099433Z #define __DBL_DIG__ 15 2025-05-07T19:46:29.9099659Z #define TIME_UTC 1 2025-05-07T19:46:29.9099886Z #define __FLT32_DIG__ 6 2025-05-07T19:46:29.9100211Z #define __forceinline__ __inline__ __attribute__((always_inline)) 2025-05-07T19:46:29.9100631Z #define cudaHostAllocWriteCombined 0x04 2025-05-07T19:46:29.9100950Z #define cudaDeviceScheduleAuto 0x00 2025-05-07T19:46:29.9101274Z #define iscntrl_l(c,l) __iscntrl_l ((c), (l)) 2025-05-07T19:46:29.9101595Z #define _G_BUFSIZ 8192 2025-05-07T19:46:29.9101902Z #define __FLT_EPSILON__ 1.19209289550781250000000000000000000e-7F 2025-05-07T19:46:29.9102295Z #define cudaTextureTypeCubemap 0x0C 2025-05-07T19:46:29.9102593Z #define __cudaCDP2GetDevice 2025-05-07T19:46:29.9102882Z #define __cudaCDP2PeekAtLastError 2025-05-07T19:46:29.9103167Z #define STA_CLOCKERR 0x1000 2025-05-07T19:46:29.9103427Z #define __GXX_WEAK__ 1 2025-05-07T19:46:29.9103678Z #define __RLIM_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:29.9103998Z #define _GLIBCXX_HAVE_ISNANF 1 2025-05-07T19:46:29.9104255Z #define __SHRT_WIDTH__ 16 2025-05-07T19:46:29.9104560Z #define __cpp_lib_robust_nonmodifying_seq_ops 201304 2025-05-07T19:46:29.9104915Z #define _GLIBCXX_BITS_SPECFUN_H 1 2025-05-07T19:46:29.9105193Z #define _GLIBCXX_HAVE_ISNANL 1 2025-05-07T19:46:29.9105493Z #define isblank_l(c,l) __isblank_l ((c), (l)) 2025-05-07T19:46:29.9105789Z #define _G_config_h 1 2025-05-07T19:46:29.9106077Z #define M_LOG2El 1.442695040888963407359924681001892137L 2025-05-07T19:46:29.9106495Z #define ADJ_OFFSET_SINGLESHOT 0x8001 2025-05-07T19:46:29.9106790Z #define _GCC_WCHAR_T 2025-05-07T19:46:29.9107013Z #define TMP_MAX 238328 2025-05-07T19:46:29.9107261Z #define __FLT32_IS_IEC_60559__ 2 2025-05-07T19:46:29.9107539Z #define __DEVICE_TYPES_H__ 2025-05-07T19:46:29.9107796Z #define __DEV_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:29.9108081Z #define _EXT_NUMERIC_TRAITS 1 2025-05-07T19:46:29.9108353Z #define _GLIBCXX_BEGIN_NAMESPACE_ALGO 2025-05-07T19:46:29.9108649Z #define _IO_SKIPWS 01 2025-05-07T19:46:29.9109063Z #define cudaStreamGraphFireAndForgetAsSibling (cudaStream_t)0x0300000000000000 2025-05-07T19:46:29.9109553Z #define _IO_SCIENTIFIC 04000 2025-05-07T19:46:29.9109816Z #define _GLIBCXX_HAVE_STRING_H 1 2025-05-07T19:46:29.9110163Z #define __LDBL_MIN__ 3.36210314311209350626267781732175260e-4932L 2025-05-07T19:46:29.9110538Z #define cudaDeviceScheduleSpin 0x01 2025-05-07T19:46:29.9110928Z #define __nonnull(params) __attribute__ ((__nonnull__ params)) 2025-05-07T19:46:29.9111320Z #define __DBL_IS_IEC_60559__ 2 2025-05-07T19:46:29.9111574Z #define le32toh(x) (x) 2025-05-07T19:46:29.9111824Z #define _SIZE_T_DEFINED 2025-05-07T19:46:29.9112074Z #define _GLIBCXX_HAVE_XLOCALE_H 1 2025-05-07T19:46:29.9112536Z #define cudaArraySparsePropertiesSingleMipTail 0x1 2025-05-07T19:46:29.9112986Z #define __DEC32_MAX__ 9.999999E96DF 2025-05-07T19:46:29.9113373Z #define __WIFSIGNALED(status) (((signed char) (((status) & 0x7f) + 1) >> 1) > 0) 2025-05-07T19:46:29.9113772Z #define _GLIBCXX_HAVE_FMODL 1 2025-05-07T19:46:29.9114034Z #define _GLIBCXX_HAVE_POLL 1 2025-05-07T19:46:29.9114290Z #define __SM_32_INTRINSICS_H__ 2025-05-07T19:46:29.9114535Z #define _POSIX_NAME_MAX 14 2025-05-07T19:46:29.9114807Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:46:29.9115308Z #define _GLIBCXX_MAKE_MOVE_IF_NOEXCEPT_ITERATOR(_Iter) std::__make_move_if_noexcept_iterator(_Iter) 2025-05-07T19:46:29.9115804Z #define _GLIBCXX_USE_CLOCK_REALTIME 1 2025-05-07T19:46:29.9116096Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:46:29.9116439Z #define __WCOREDUMP(status) ((status) & __WCOREFLAG) 2025-05-07T19:46:29.9116807Z #define _WCHAR_T_ 2025-05-07T19:46:29.9117038Z #define _GLIBCXX_FAST_MATH 0 2025-05-07T19:46:29.9117399Z #define __FLT64X_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951F64x 2025-05-07T19:46:29.9117770Z #define RTSIG_MAX 32 2025-05-07T19:46:29.9117993Z #define _STDDEF_H 2025-05-07T19:46:29.9118207Z #define CU_UUID_HAS_BEEN_DEFINED 2025-05-07T19:46:29.9118474Z #define _VA_LIST_DEFINED 2025-05-07T19:46:29.9118707Z #define __FLT32X_HAS_INFINITY__ 1 2025-05-07T19:46:29.9119032Z #define __glibcxx_requires_non_empty_range(_First,_Last) 2025-05-07T19:46:29.9119403Z #define __grid_constant__ __location__(grid_constant) 2025-05-07T19:46:29.9119734Z #define __INT32_MAX__ 0x7fffffff 2025-05-07T19:46:29.9120020Z #define _GLIBCXX_BEGIN_EXTERN_C extern "C" { 2025-05-07T19:46:29.9120465Z #define _PSTL_CPP14_INTEGER_SEQUENCE_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L) 2025-05-07T19:46:29.9120987Z #define __glibcxx_digits_b(T,B) (B - __glibcxx_signed_b (T,B)) 2025-05-07T19:46:29.9121339Z #define __SIZEOF_PTHREAD_COND_T 48 2025-05-07T19:46:29.9121652Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC(PRM) 2025-05-07T19:46:29.9121946Z #define __unix__ 1 2025-05-07T19:46:29.9122171Z #define __SM_60_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:29.9122434Z #define __INT_WIDTH__ 32 2025-05-07T19:46:29.9122671Z #define __SIZEOF_LONG__ 8 2025-05-07T19:46:29.9122902Z #define _IONBF 2 2025-05-07T19:46:29.9123326Z #define __MATHCALLX(function,suffix,args,attrib) __MATHDECLX (_Mdouble_,function,suffix, args, attrib) 2025-05-07T19:46:29.9124079Z #define _IO_getc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) ? __uflow (_fp) : *(unsigned char *) (_fp)->_IO_read_ptr++) 2025-05-07T19:46:29.9124595Z #define __STDC_IEC_559__ 1 2025-05-07T19:46:29.9124844Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:46:29.9125091Z #define __UINT16_C(c) c 2025-05-07T19:46:29.9125391Z #define M_2_PI 0.63661977236758134308 2025-05-07T19:46:29.9125656Z #define STA_DEL 0x0020 2025-05-07T19:46:29.9125880Z #define __CUDACC_VER_MINOR__ 6 2025-05-07T19:46:29.9126133Z #define __id_t_defined 2025-05-07T19:46:29.9126382Z #define w_retcode __wait_terminated.__w_retcode 2025-05-07T19:46:29.9126824Z #define _IO_PENDING_OUTPUT_COUNT(_fp) ((_fp)->_IO_write_ptr - (_fp)->_IO_write_base) 2025-05-07T19:46:29.9127229Z #define _GLIBCXX_HAVE_MODFF 1 2025-05-07T19:46:29.9127488Z #define _GLIBCXX_HAVE_MODFL 1 2025-05-07T19:46:29.9127727Z #define __DECIMAL_DIG__ 21 2025-05-07T19:46:29.9127975Z #define _POSIX2_RE_DUP_MAX 255 2025-05-07T19:46:29.9128215Z #define __USE_FORTIFY_LEVEL 0 2025-05-07T19:46:29.9128610Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:46:29.9129048Z #define SING 2 2025-05-07T19:46:29.9129258Z #define STA_FREQHOLD 0x0080 2025-05-07T19:46:29.9129561Z #define __SM_32_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:29.9129858Z #define cudaStreamDefault 0x00 2025-05-07T19:46:29.9130322Z #define __FLT64_EPSILON__ 2.22044604925031308084726333618164062e-16F64 2025-05-07T19:46:29.9130708Z #define _GLIBCXX_HAVE_HYPOTL 1 2025-05-07T19:46:29.9130994Z #define _GLIBCXX_HAVE_SYS_UIO_H 1 2025-05-07T19:46:29.9131260Z #define __gnu_linux__ 1 2025-05-07T19:46:29.9131510Z #define __INT16_MAX__ 0x7fff 2025-05-07T19:46:29.9131763Z #define _LARGEFILE_SOURCE 1 2025-05-07T19:46:29.9132026Z #define MAX_INPUT 255 2025-05-07T19:46:29.9132277Z #define __FLT64_MIN_EXP__ (-1021) 2025-05-07T19:46:29.9132608Z #define __isalpha_l(c,l) __isctype_l((c), _ISalpha, (l)) 2025-05-07T19:46:29.9133004Z #define __glibcxx_requires_heap(_First,_Last) 2025-05-07T19:46:29.9133329Z #define _GLIBCXX_CPU_DEFINES 1 2025-05-07T19:46:29.9133613Z #define _GLIBCXX_HAVE_POLL_H 1 2025-05-07T19:46:29.9134017Z #define __attribute_warn_unused_result__ __attribute__ ((__warn_unused_result__)) 2025-05-07T19:46:29.9134471Z #define _IO_SHOWPOS 02000 2025-05-07T19:46:29.9134797Z #define _GLIBCXX_HAVE_SYMVER_SYMBOL_RENAMING_RUNTIME_SUPPORT 1 2025-05-07T19:46:29.9135184Z #define _Mfloat_ float 2025-05-07T19:46:29.9135459Z #define __glibcxx_requires_cond(_Cond,_Msg) 2025-05-07T19:46:29.9135901Z #define __FLT64X_MIN_10_EXP__ (-4931) 2025-05-07T19:46:29.9136212Z #define DELAYTIMER_MAX 2147483647 2025-05-07T19:46:29.9136718Z #define __glibcxx_max_b(T,B) (__glibcxx_signed_b (T,B) ? (((((T)1 << (__glibcxx_digits_b (T,B) - 1)) - 1) << 1) + 1) : ~(T)0) 2025-05-07T19:46:29.9137252Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:29.9137532Z #define _GLIBCXX98_USE_C99_STDIO 1 2025-05-07T19:46:29.9137884Z #define cudaKernelNodeAttrID cudaLaunchAttributeID 2025-05-07T19:46:29.9138255Z #define __glibcxx_class_requires2(_a,_b,_c) 2025-05-07T19:46:29.9138573Z #define __USE_ISOC11 1 2025-05-07T19:46:29.9138813Z #define _BSD_SIZE_T_ 2025-05-07T19:46:29.9139040Z #define ADJ_MICRO 0x1000 2025-05-07T19:46:29.9139300Z #define _GLIBCXX_HAVE_FABSF 1 2025-05-07T19:46:29.9139558Z #define _GLIBCXX_HAVE_FABSL 1 2025-05-07T19:46:29.9139866Z #define _PSTL_PRAGMA_SIMD _PSTL_PRAGMA(omp simd) 2025-05-07T19:46:29.9140193Z #define __FLT64_MANT_DIG__ 53 2025-05-07T19:46:29.9140518Z #define __attribute_const__ __attribute__ ((__const__)) 2025-05-07T19:46:29.9140853Z #define __THROW throw () 2025-05-07T19:46:29.9141119Z #define __cudaGet_gridDim() gridDim 2025-05-07T19:46:29.9141411Z #define __SM_60_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:29.9141783Z #define __glibcxx_requires_heap_pred(_First,_Last,_Pred) 2025-05-07T19:46:29.9142158Z #define htobe32(x) __bswap_32 (x) 2025-05-07T19:46:29.9142432Z #define _GLIBCXX_HAVE_POWL 1 2025-05-07T19:46:29.9142709Z #define __FLT64X_MANT_DIG__ 64 2025-05-07T19:46:29.9142975Z #define __GLIBC_HAVE_LONG_LONG 1 2025-05-07T19:46:29.9143240Z #define L_tmpnam 20 2025-05-07T19:46:29.9143500Z #define ___int_wchar_t_h 2025-05-07T19:46:29.9143872Z #define WIFCONTINUED(status) __WIFCONTINUED (__WAIT_INT (status)) 2025-05-07T19:46:29.9144285Z #define isascii(c) __isascii (c) 2025-05-07T19:46:29.9144577Z #define _T_PTRDIFF 2025-05-07T19:46:29.9144892Z #define _GLIBCXX_MOVE3(_Tp,_Up,_Vp) std::move(_Tp, _Up, _Vp) 2025-05-07T19:46:29.9145389Z #define toascii(c) __toascii (c) 2025-05-07T19:46:29.9145888Z #define __GNUC__ 11 2025-05-07T19:46:29.9146144Z #define __SYSCALL_ULONG_TYPE __ULONGWORD_TYPE 2025-05-07T19:46:29.9146458Z #define __GXX_RTTI 1 2025-05-07T19:46:29.9146692Z #define __pie__ 2 2025-05-07T19:46:29.9146921Z #define __MMX__ 1 2025-05-07T19:46:29.9147149Z #define __cudaCDP2Malloc 2025-05-07T19:46:29.9147414Z #define __timespec_defined 1 2025-05-07T19:46:29.9147656Z #define L_ctermid 9 2025-05-07T19:46:29.9147914Z #define __OFF64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:29.9148218Z #define __cudaCDP2GetParameterBufferV2 2025-05-07T19:46:29.9148605Z #define offsetof(TYPE,MEMBER) __builtin_offsetof (TYPE, MEMBER) 2025-05-07T19:46:29.9148982Z #define _BITS_POSIX2_LIM_H 1 2025-05-07T19:46:29.9149251Z #define _GLIBCXX98_USE_C99_STDLIB 1 2025-05-07T19:46:29.9149534Z #define cudaMemAttachGlobal 0x01 2025-05-07T19:46:29.9149848Z #define FD_SET(fd,fdsetp) __FD_SET (fd, fdsetp) 2025-05-07T19:46:29.9150172Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:46:29.9150423Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:46:29.9150869Z #define _GLIBCXX_NATIVE_THREAD_ID (__gthread_active_p() ? __gthread_self() : (__gthread_t)1) 2025-05-07T19:46:29.9151613Z #define assert_perror(errnum) (!(errnum) ? __ASSERT_VOID_CAST (0) : __assert_perror_fail ((errnum), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:29.9152239Z #define _IO_HAVE_ST_BLKSIZE _G_HAVE_ST_BLKSIZE 2025-05-07T19:46:29.9152535Z #define __USE_SVID 1 2025-05-07T19:46:29.9152789Z #define __constant__ __location__(constant) 2025-05-07T19:46:29.9153088Z #define _GLIBCXX_HAVE_POSIX_MEMALIGN 1 2025-05-07T19:46:29.9153383Z #define __device__ __location__(device) 2025-05-07T19:46:29.9153708Z #define _GLIBCXX_HAVE_EXCEPTION_PTR_SINCE_GCC46 1 2025-05-07T19:46:29.9154016Z #define _GLIBCXX_RES_LIMITS 1 2025-05-07T19:46:29.9154279Z #define M_1_PI 0.31830988618379067154 2025-05-07T19:46:29.9154542Z #define CUDART_DEVICE __device__ 2025-05-07T19:46:29.9154902Z #define __LDBL_REDIR1_NTH(name,proto,alias) name proto __THROW 2025-05-07T19:46:29.9155330Z #define M_PI_2 1.57079632679489661923 2025-05-07T19:46:29.9155621Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:46:29.9155983Z #define cudaExternalSemaphoreWaitSkipNvSciBufMemSync 0x02 2025-05-07T19:46:29.9156359Z #define __STDC_UTF_16__ 1 2025-05-07T19:46:29.9156596Z #define LONG_MAX __LONG_MAX__ 2025-05-07T19:46:29.9156936Z #define __glibcxx_digits10_b(T,B) (__glibcxx_digits_b (T,B) * 643L / 2136) 2025-05-07T19:46:29.9157345Z #define _POSIX_THREAD_DESTRUCTOR_ITERATIONS 4 2025-05-07T19:46:29.9157639Z #define _POSIX_HOST_NAME_MAX 255 2025-05-07T19:46:29.9157892Z #define __FLT64_MAX_10_EXP__ 308 2025-05-07T19:46:29.9158134Z #define NGROUPS_MAX 65536 2025-05-07T19:46:29.9158366Z #define _GLIBCXX_NAMESPACE_LDBL 2025-05-07T19:46:29.9158605Z #define __USE_ISOC95 1 2025-05-07T19:46:29.9158814Z #define _TIME_H 1 2025-05-07T19:46:29.9159051Z #define M_LOG10El 0.434294481903251827651128918916605082L 2025-05-07T19:46:29.9159371Z #define __USE_ISOC99 1 2025-05-07T19:46:29.9159696Z #define __ASMNAME(cname) __ASMNAME2 (__USER_LABEL_PREFIX__, cname) 2025-05-07T19:46:29.9160045Z #define HOST_NAME_MAX 64 2025-05-07T19:46:29.9160285Z #define _POSIX_SEM_NSEMS_MAX 256 2025-05-07T19:46:29.9160523Z #define _IOS_ATEND 4 2025-05-07T19:46:29.9160749Z #define __SM_35_INTRINSICS_H__ 2025-05-07T19:46:29.9161050Z #define WTERMSIG(status) __WTERMSIG (__WAIT_INT (status)) 2025-05-07T19:46:29.9161437Z #define cudaStreamAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:29.9161762Z #define _GLIBCXX_HAVE_S_ISREG 1 2025-05-07T19:46:29.9162026Z #define cudaSurfaceTypeCubemap 0x0C 2025-05-07T19:46:29.9162329Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:46:29.9162620Z #define __FLT32_HAS_INFINITY__ 1 2025-05-07T19:46:29.9162862Z #define _STDIO_H 1 2025-05-07T19:46:29.9163233Z #define __isctype_l(c,type,locale) ((locale)->__ctype_b[(int) (c)] & (unsigned short int) type) 2025-05-07T19:46:29.9163686Z #define _GLIBCXX_PREDEFINED_OPS_H 1 2025-05-07T19:46:29.9164104Z #define __DBL_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:46:29.9164474Z #define _G_IO_IO_FILE_VERSION 0x20001 2025-05-07T19:46:29.9164767Z #define _POSIX_SIGQUEUE_MAX 32 2025-05-07T19:46:29.9165036Z #define _GLIBCXX_HAVE_GETS 1 2025-05-07T19:46:29.9165309Z #define _GLIBCXX_HAVE_LINUX_TYPES_H 1 2025-05-07T19:46:29.9165574Z #define __cpp_raw_strings 200710L 2025-05-07T19:46:29.9165863Z #define __INT_FAST32_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:29.9166155Z #define _GLIBCXX_HAVE_VFWSCANF 1 2025-05-07T19:46:29.9166417Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:46:29.9166664Z #define __STDCPP_MATH_SPEC_FUNCS__ 201003L 2025-05-07T19:46:29.9166941Z #define _GLIBCXX_STDIO_EOF -1 2025-05-07T19:46:29.9167184Z #define __SIZEOF_PTHREAD_MUTEX_T 40 2025-05-07T19:46:29.9167449Z #define __CHANNEL_DESCRIPTOR_H__ 2025-05-07T19:46:29.9167774Z #define _ISbit(bit) ((bit) < 8 ? ((1 << (bit)) << 8) : ((1 << (bit)) >> 8)) 2025-05-07T19:46:29.9168125Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:46:29.9168534Z #define __USE_XOPEN 1 2025-05-07T19:46:29.9168770Z #define __SIZEOF_PTHREAD_RWLOCK_T 56 2025-05-07T19:46:29.9169212Z #define cudaStreamAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:29.9169834Z #define __USE_XOPEN2K 1 2025-05-07T19:46:29.9170076Z #define _PSTL_UDR_PRESENT 1 2025-05-07T19:46:29.9170428Z #define __HAVE_SPECULATION_SAFE_VALUE 1 2025-05-07T19:46:29.9170729Z #define _GLIBCXX_HAVE_COSF 1 2025-05-07T19:46:29.9170998Z #define __cpp_fold_expressions 201603L 2025-05-07T19:46:29.9171547Z #define cudaWaitExternalSemaphoresAsync __CUDART_API_PTSZ(cudaWaitExternalSemaphoresAsync_v2) 2025-05-07T19:46:29.9172103Z #define NL_LANGMAX _POSIX2_LINE_MAX 2025-05-07T19:46:29.9172382Z #define __DEC32_MIN_EXP__ (-94) 2025-05-07T19:46:29.9172752Z #define __glibcxx_requires_partitioned_upper(_First,_Last,_Value) 2025-05-07T19:46:29.9173144Z #define __DADDR_T_TYPE __S32_TYPE 2025-05-07T19:46:29.9173536Z #define cudaExternalSemaphoreSignalSkipNvSciBufMemSync 0x01 2025-05-07T19:46:29.9173946Z #define __END_NAMESPACE_C99 2025-05-07T19:46:29.9174289Z #define __glibcxx_integral_traps true 2025-05-07T19:46:29.9174574Z #define _POSIX_PATH_MAX 256 2025-05-07T19:46:29.9174829Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:46:29.9175091Z #define __FLT64X_HAS_INFINITY__ 1 2025-05-07T19:46:29.9175359Z #define _ISOC11_SOURCE 1 2025-05-07T19:46:29.9175615Z #define _GLIBCXX_HAVE_LINUX_FUTEX 1 2025-05-07T19:46:29.9175907Z #define __UINT_LEAST32_MAX__ 0xffffffffU 2025-05-07T19:46:29.9176213Z #define _GLIBCXX_HAVE_QUICK_EXIT 1 2025-05-07T19:46:29.9176581Z #define __glibcxx_requires_irreflexive_pred2(_First,_Last,_Pred) 2025-05-07T19:46:29.9176980Z #define LONG_MIN (-LONG_MAX - 1L) 2025-05-07T19:46:29.9177256Z #define _GLIBCXX_HAVE_SINCOSF 1 2025-05-07T19:46:29.9177528Z #define _IO_UNITBUF 020000 2025-05-07T19:46:29.9177775Z #define _GLIBCXX_HAVE_SINCOSL 1 2025-05-07T19:46:29.9178044Z #define __FD_SETSIZE 1024 2025-05-07T19:46:29.9178297Z #define getc(_fp) _IO_getc (_fp) 2025-05-07T19:46:29.9178573Z #define be32toh(x) __bswap_32 (x) 2025-05-07T19:46:29.9178927Z #define _GLIBCXX_PACKAGE__GLIBCXX_VERSION "version-unused" 2025-05-07T19:46:29.9179300Z #define __FLT32X_HAS_DENORM__ 1 2025-05-07T19:46:29.9179576Z #define __INT_FAST16_TYPE__ long int 2025-05-07T19:46:29.9179885Z #define isxdigit_l(c,l) __isxdigit_l ((c), (l)) 2025-05-07T19:46:29.9180221Z #define _GLIBCXX_HAVE_GETIPINFO 1 2025-05-07T19:46:29.9180487Z #define __MMX_WITH_SSE__ 1 2025-05-07T19:46:29.9180802Z #define __isalnum_l(c,l) __isctype_l((c), _ISalnum, (l)) 2025-05-07T19:46:29.9181152Z #define _WCHAR_T_DEFINED_ 2025-05-07T19:46:29.9181434Z #define cudaIpcMemLazyEnablePeerAccess 0x01 2025-05-07T19:46:29.9181766Z #define _GLIBCXX_HAVE_AT_QUICK_EXIT 1 2025-05-07T19:46:29.9182059Z #define __INO_T_MATCHES_INO64_T 1 2025-05-07T19:46:29.9182337Z #define __USE_POSIX199506 1 2025-05-07T19:46:29.9182677Z #define _FEATURES_H 1 2025-05-07T19:46:29.9182901Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:46:29.9183269Z #define _PSTL_PRAGMA_SIMD_REDUCTION(PRM) _PSTL_PRAGMA(omp simd reduction(PRM)) 2025-05-07T19:46:29.9183738Z #define __stub_getmsg 2025-05-07T19:46:29.9183944Z #define _IO_FIXED 010000 2025-05-07T19:46:29.9184202Z #define __cpp_lib_addressof_constexpr 201603 2025-05-07T19:46:29.9184498Z #define _GLIBCXX11_USE_C99_STDIO 1 2025-05-07T19:46:29.9184748Z #define __stub_setlogin 2025-05-07T19:46:29.9184969Z #define __stub_fattach 2025-05-07T19:46:29.9185329Z #define __cplusplus 201703L 2025-05-07T19:46:29.9185578Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:46:29.9185830Z #define _STRUCT_TIMEVAL 1 2025-05-07T19:46:29.9186067Z #define INFINITY (__builtin_inff()) 2025-05-07T19:46:29.9186318Z #define _IO_UNBUFFERED 2 2025-05-07T19:46:29.9186797Z #define cudaStreamAttributeSynchronizationPolicy cudaLaunchAttributeSynchronizationPolicy 2025-05-07T19:46:29.9187304Z #define _IO_INTERNAL 010 2025-05-07T19:46:29.9187545Z #define __DEC32_MIN__ 1E-95DF 2025-05-07T19:46:29.9187880Z #define cudaKernelNodeAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:29.9188223Z #define __dev_t_defined 2025-05-07T19:46:29.9188466Z #define __DEPRECATED 1 2025-05-07T19:46:29.9188679Z #define __S32_TYPE int 2025-05-07T19:46:29.9188918Z #define __cpp_rvalue_references 200610L 2025-05-07T19:46:29.9189191Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:46:29.9189443Z #define _IO_fpos_t _G_fpos_t 2025-05-07T19:46:29.9189678Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:46:29.9190284Z #define cudaKernelNodeAttributePreferredSharedMemoryCarveout cudaLaunchAttributePreferredSharedMemoryCarveout 2025-05-07T19:46:29.9190922Z #define _G_HAVE_MREMAP 1 2025-05-07T19:46:29.9191208Z #define __FLT32_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:46:29.9191549Z #define OVERFLOW 3 2025-05-07T19:46:29.9191781Z #define __toascii_l(c,l) ((l), __toascii (c)) 2025-05-07T19:46:29.9192086Z #define __DEC128_EPSILON__ 1E-33DL 2025-05-07T19:46:29.9192347Z #define __SM_32_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:29.9192683Z #define _GLIBCXX_DEFAULT_ABI_TAG _GLIBCXX_ABI_TAG_CXX11 2025-05-07T19:46:29.9193003Z #define __SSE2_MATH__ 1 2025-05-07T19:46:29.9193298Z #define __ATOMIC_HLE_RELEASE 131072 2025-05-07T19:46:29.9193587Z #define __FSFILCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:29.9193884Z #define _IO_STDIO_H 2025-05-07T19:46:29.9194140Z #define PDP_ENDIAN __PDP_ENDIAN 2025-05-07T19:46:29.9194417Z #define isspace_l(c,l) __isspace_l ((c), (l)) 2025-05-07T19:46:29.9194732Z #define __cudaCDP2Memcpy2DAsync 2025-05-07T19:46:29.9195013Z #define __PTRDIFF_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:29.9195318Z #define _GLIBCXX_HAVE_STRERROR_R 1 2025-05-07T19:46:29.9195735Z #define __amd64 1 2025-05-07T19:46:29.9195963Z #define _POSIX_TZNAME_MAX 6 2025-05-07T19:46:29.9196218Z #define __cudaCDP2Memset3DAsync 2025-05-07T19:46:29.9196497Z #define __SYSCALL_WORDSIZE 64 2025-05-07T19:46:29.9196793Z #define _GLIBCXX_HAVE_ATTRIBUTE_VISIBILITY 1 2025-05-07T19:46:29.9197263Z #define _EXT_TYPE_TRAITS 1 2025-05-07T19:46:29.9197539Z #define _GLIBCXX_HAVE_POSIX_SEMAPHORE 1 2025-05-07T19:46:29.9197843Z #define _POSIX_RE_DUP_MAX 255 2025-05-07T19:46:29.9198114Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:46:29.9198373Z #define __bounded 2025-05-07T19:46:29.9198615Z #define __USECONDS_T_TYPE __U32_TYPE 2025-05-07T19:46:29.9198902Z #define _IO_DELETE_DONT_CLOSE 0x40 2025-05-07T19:46:29.9199195Z #define __BEGIN_NAMESPACE_STD 2025-05-07T19:46:29.9199459Z #define _PTRDIFF_T_DECLARED 2025-05-07T19:46:29.9199743Z #define __OFF_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:29.9200077Z #define __W_STOPCODE(sig) ((sig) << 8 | 0x7f) 2025-05-07T19:46:29.9200503Z #define cudaStreamAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:29.9200932Z #define _GLIBCXX_HAVE_NETDB_H 1 2025-05-07T19:46:29.9201201Z #define __SM_20_INTRINSICS_HPP__ 2025-05-07T19:46:29.9201554Z #define __cpp_lib_has_unique_object_representations 201606 2025-05-07T19:46:29.9201914Z #define STA_PLL 0x0001 2025-05-07T19:46:29.9202183Z #define __ATOMIC_HLE_ACQUIRE 65536 2025-05-07T19:46:29.9202447Z #define __GNUG__ 11 2025-05-07T19:46:29.9202759Z #define _GLIBCXX_USE_GET_NPROCS 1 2025-05-07T19:46:29.9203023Z #define _T_WCHAR 2025-05-07T19:46:29.9203276Z #define __cudaCDP2GetDeviceCount 2025-05-07T19:46:29.9203581Z #define __specialization_static 2025-05-07T19:46:29.9203887Z #define __LONG_LONG_MAX__ 0x7fffffffffffffffLL 2025-05-07T19:46:29.9204218Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:46:29.9204477Z #define cudaArraySparse 0x40 2025-05-07T19:46:29.9204754Z #define STA_PPSFREQ 0x0002 2025-05-07T19:46:29.9204999Z #define __GLIBCXX__ 20230528 2025-05-07T19:46:29.9205293Z #define _IO_stdin ((_IO_FILE*)(&_IO_2_1_stdin_)) 2025-05-07T19:46:29.9205598Z #define _WCHAR_T 2025-05-07T19:46:29.9205826Z #define __cudaCDP2Free 2025-05-07T19:46:29.9206514Z #define __FD_ZERO(fdsp) do { int __d0, __d1; __asm__ __volatile__ ("cld; rep; " __FD_ZERO_STOS : "=c" (__d0), "=D" (__d1) : "a" (0), "0" (sizeof (fd_set) / sizeof (__fd_mask)), "1" (&__FDS_BITS (fdsp)[0]) : "memory"); } while (0) 2025-05-07T19:46:29.9207239Z #define __cpp_nsdmi 200809L 2025-05-07T19:46:29.9207696Z #define __glibcxx_min_b(T,B) (__glibcxx_signed_b (T,B) ? -__glibcxx_max_b (T,B) - 1 : (T)0) 2025-05-07T19:46:29.9208155Z #define __FLT64X_MIN_EXP__ (-16381) 2025-05-07T19:46:29.9208453Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:46:29.9208715Z #define cudaArrayCubemap 0x04 2025-05-07T19:46:29.9209067Z #define _PSTL_MONOTONIC_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:29.9209442Z #define _GLIBCXX_UTILITY 1 2025-05-07T19:46:29.9209688Z #define __NO_CTYPE 1 2025-05-07T19:46:29.9209935Z #define __stub_bdflush 2025-05-07T19:46:29.9210381Z #define _GLIBCXX_MAKE_MOVE_ITERATOR(_Iter) std::make_move_iterator(_Iter) 2025-05-07T19:46:29.9210838Z #define __CORRECT_ISO_CPP_STRING_H_PROTO 2025-05-07T19:46:29.9211204Z #define _GLIBCXX_STDC_HEADERS 1 2025-05-07T19:46:29.9211490Z #define __LONG_LONG_WIDTH__ 64 2025-05-07T19:46:29.9211767Z #define __cpp_initializer_lists 200806L 2025-05-07T19:46:29.9212096Z #define _GLIBCXX_HAVE_NETINET_TCP_H 1 2025-05-07T19:46:29.9212399Z #define __U16_TYPE unsigned short int 2025-05-07T19:46:29.9212853Z #define __glibcxx_requires_can_increment(_First,_Size) 2025-05-07T19:46:29.9213226Z #define _GLIBCXX_HAVE_SYS_PARAM_H 1 2025-05-07T19:46:29.9213508Z #define __FLT32_MAX_EXP__ 128 2025-05-07T19:46:29.9213808Z #define cudaHostRegisterIoMemory 0x04 2025-05-07T19:46:29.9214153Z #define __FD_MASK(d) ((__fd_mask) 1 << ((d) % __NFDBITS)) 2025-05-07T19:46:29.9214615Z #define __cpp_lib_is_invocable 201703 2025-05-07T19:46:29.9214901Z #define _IO_STDIO 040000 2025-05-07T19:46:29.9215245Z #define _SIGSET_NWORDS (1024 / (8 * sizeof (unsigned long int))) 2025-05-07T19:46:29.9215637Z #define cudaSurfaceType1DLayered 0xF1 2025-05-07T19:46:29.9215969Z #define cudaArraySurfaceLoadStore 0x02 2025-05-07T19:46:29.9216272Z #define _PTRDIFF_T 2025-05-07T19:46:29.9216483Z #define _MOVE_H 1 2025-05-07T19:46:29.9216716Z #define __cpp_hex_float 201603L 2025-05-07T19:46:29.9216974Z #define ADJ_TAI 0x0080 2025-05-07T19:46:29.9217212Z #define __ptrvalue 2025-05-07T19:46:29.9217438Z #define _GLIBCXX_HOSTED 1 2025-05-07T19:46:29.9217699Z #define __GXX_ABI_VERSION 1016 2025-05-07T19:46:29.9217984Z #define __WTERMSIG(status) ((status) & 0x7f) 2025-05-07T19:46:29.9218300Z #define MATH_ERREXCEPT 2 2025-05-07T19:46:29.9218550Z #define _GLIBCXX_HAS_GTHREADS 1 2025-05-07T19:46:29.9218846Z #define cudaTextureType2DLayered 0xF2 2025-05-07T19:46:29.9219264Z #define __isleap(year) ((year) % 4 == 0 && ((year) % 100 != 0 || (year) % 400 == 0)) 2025-05-07T19:46:29.9219657Z #define __USE_GNU 1 2025-05-07T19:46:29.9219902Z #define __FLT128_HAS_INFINITY__ 1 2025-05-07T19:46:29.9220181Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:46:29.9220459Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:46:29.9220852Z #define __FD_CLR(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] &= ~__FD_MASK (d))) 2025-05-07T19:46:29.9221266Z #define WEXITED 4 2025-05-07T19:46:29.9221479Z #define _IO_NO_READS 4 2025-05-07T19:46:29.9221792Z #define cudaGraphKernelNodePortLaunchCompletion 2 2025-05-07T19:46:29.9222235Z #define M_LOG2E 1.4426950408889634074 2025-05-07T19:46:29.9222520Z #define _POSIX_SYMLINK_MAX 255 2025-05-07T19:46:29.9222938Z #define _GLIBCXX_HAVE_BUILTIN_HAS_UNIQ_OBJ_REP 1 2025-05-07T19:46:29.9223239Z #define __uid_t_defined 2025-05-07T19:46:29.9223490Z #define __FD_ELT(d) ((d) / __NFDBITS) 2025-05-07T19:46:29.9223765Z #define _GLIBCXX_USE_STD_SPEC_FUNCS 1 2025-05-07T19:46:29.9224037Z #define WNOHANG 1 2025-05-07T19:46:29.9224263Z #define alloca(size) __builtin_alloca (size) 2025-05-07T19:46:29.9224564Z #define _GLIBCXX_HAVE_HYPOTF 1 2025-05-07T19:46:29.9224816Z #define cudaEventDefault 0x00 2025-05-07T19:46:29.9225109Z #define __maxnreg__(a) __attribute__((maxnreg(a))) 2025-05-07T19:46:29.9225423Z #define NL_SETMAX INT_MAX 2025-05-07T19:46:29.9225641Z #define __x86_64 1 2025-05-07T19:46:29.9225866Z #define __cudaCDP2LaunchDevice 2025-05-07T19:46:29.9226239Z #define __REDIRECT(name,proto,alias) name proto __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:29.9226710Z #define _GLIBCXX_BEGIN_NAMESPACE_CXX11 namespace __cxx11 { 2025-05-07T19:46:29.9227199Z #define __extern_always_inline extern __always_inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:29.9227632Z #define __PTRDIFF_T 2025-05-07T19:46:29.9227933Z #define __exctype_l(name) extern int name (int, __locale_t) __THROW 2025-05-07T19:46:29.9228310Z #define _GLIBCXX_HAVE_FINITEL 1 2025-05-07T19:46:29.9228732Z #define __SM_35_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:29.9229188Z #define _Mlong_double_ long double 2025-05-07T19:46:29.9229484Z #define __cpp_lambdas 200907L 2025-05-07T19:46:29.9229786Z #define _IO_DEC 020 2025-05-07T19:46:29.9230056Z #define _GLIBCXX_HAVE_SINHL 1 2025-05-07T19:46:29.9230329Z #define _POSIX_CLOCKRES_MIN 20000000 2025-05-07T19:46:29.9230630Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:46:29.9230909Z #define ADJ_TIMECONST 0x0020 2025-05-07T19:46:29.9231180Z #define _GLIBCXX_HAVE_SQRTL 1 2025-05-07T19:46:29.9231476Z #define __cudaCDP2DeviceGetSharedMemConfig 2025-05-07T19:46:29.9231817Z #define _GLIBCXX_HAVE_STDALIGN_H 1 2025-05-07T19:46:29.9232106Z #define _ANSI_STDDEF_H 2025-05-07T19:46:29.9232500Z #define _GLIBCXX_MOVE(__val) std::move(__val) 2025-05-07T19:46:29.9232835Z #define _GLIBCXX_HAVE_STRERROR_L 1 2025-05-07T19:46:29.9233204Z #define __FLT64_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F64 2025-05-07T19:46:29.9233618Z #define _GLIBCXX_USE_DEV_RANDOM 1 2025-05-07T19:46:29.9233899Z #define _STL_ITERATOR_BASE_TYPES_H 1 2025-05-07T19:46:29.9234204Z #define __cpp_template_auto 201606L 2025-05-07T19:46:29.9234569Z #define __DBL_MIN__ double(2.22507385850720138309023271733240406e-308L) 2025-05-07T19:46:29.9234964Z #define _GLIBCXX_HAVE_SYS_SEM_H 1 2025-05-07T19:46:29.9235248Z #define __key_t_defined 2025-05-07T19:46:29.9235495Z #define _IO_MAGIC_MASK 0xFFFF0000 2025-05-07T19:46:29.9235887Z #define __cluster_dims__(...) __attribute__((cluster_dims(__VA_ARGS__))) 2025-05-07T19:46:29.9236376Z #define __FLT128_EPSILON__ 1.92592994438723585305597794258492732e-34F128 2025-05-07T19:46:29.9236772Z #define __GNUC_VA_LIST 2025-05-07T19:46:29.9237117Z #define __FLT64X_NORM_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:46:29.9237530Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:46:29.9237800Z #define CLOCK_REALTIME_COARSE 5 2025-05-07T19:46:29.9238131Z #define _GLIBCXX14_CONSTEXPR constexpr 2025-05-07T19:46:29.9238475Z #define __USE_XOPEN2KXSI 1 2025-05-07T19:46:29.9238742Z #define __WCOREFLAG 0x80 2025-05-07T19:46:29.9239040Z #define M_2_SQRTPI 1.12837916709551257390 2025-05-07T19:46:29.9239373Z #define cudaEventDisableTiming 0x02 2025-05-07T19:46:29.9239706Z #define __LP64__ 1 2025-05-07T19:46:29.9239975Z #define __isascii_l(c,l) ((l), __isascii (c)) 2025-05-07T19:46:29.9240352Z #define cudaStreamNonBlocking 0x01 2025-05-07T19:46:29.9240666Z #define _IO_off64_t __off64_t 2025-05-07T19:46:29.9240981Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:29.9241268Z #define __time_t_defined 1 2025-05-07T19:46:29.9241571Z #define _POSIX_SYMLOOP_MAX 8 2025-05-07T19:46:29.9242057Z #define __FLT32X_EPSILON__ 2.22044604925031308084726333618164062e-16F32x 2025-05-07T19:46:29.9242529Z #define __USE_UNIX98 1 2025-05-07T19:46:29.9242777Z #define __MODE_T_TYPE __U32_TYPE 2025-05-07T19:46:29.9243035Z #define CLOCK_REALTIME_ALARM 8 2025-05-07T19:46:29.9243305Z #define _GLIBCXX_HAVE_STRINGS_H 1 2025-05-07T19:46:29.9243590Z #define __LEAF_ATTR __attribute__ ((__leaf__)) 2025-05-07T19:46:29.9243904Z #define __DECIMAL_BID_FORMAT__ 1 2025-05-07T19:46:29.9244147Z #define SEEK_CUR 1 2025-05-07T19:46:29.9244377Z #define __RLIM64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:29.9244632Z #define _ASSERT_H 1 2025-05-07T19:46:29.9245202Z #define _PSTL_PRAGMA_DECLARE_REDUCTION(NAME,OP) _PSTL_PRAGMA(omp declare reduction(NAME:OP : omp_out(omp_in)) initializer(omp_priv = omp_orig)) 2025-05-07T19:46:29.9245834Z #define _GLIBCXX_USE_DEPRECATED 1 2025-05-07T19:46:29.9246092Z #define CHAR_MAX SCHAR_MAX 2025-05-07T19:46:29.9246341Z #define _GLIBCXX_HAVE_SETENV 1 2025-05-07T19:46:29.9246590Z #define NL_ARGMAX _POSIX_ARG_MAX 2025-05-07T19:46:29.9246873Z #define _GLIBCXX_USE_UTIMENSAT 1 2025-05-07T19:46:29.9247268Z #define __extern_inline extern __inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:29.9247729Z #define _GLIBCXX_DEBUG_ONLY(_Statement) 2025-05-07T19:46:29.9248428Z #define _IO_putc_unlocked(_ch,_fp) (_IO_BE ((_fp)->_IO_write_ptr >= (_fp)->_IO_write_end, 0) ? __overflow (_fp, (unsigned char) (_ch)) : (unsigned char) (*(_fp)->_IO_write_ptr++ = (_ch))) 2025-05-07T19:46:29.9249097Z #define _GLIBCXX_HAVE_BUILTIN_LAUNDER 1 2025-05-07T19:46:29.9249421Z #define _IO_BOOLALPHA 0200000 2025-05-07T19:46:29.9249770Z #define _PSTL_CPP17_EXECUTION_POLICIES_PRESENT (_MSC_VER >= 1912) 2025-05-07T19:46:29.9250249Z #define _GLIBCXX_PACKAGE_URL "" 2025-05-07T19:46:29.9250514Z #define __FLT64_MIN_10_EXP__ (-307) 2025-05-07T19:46:29.9250962Z #define cudaArrayDefault 0x00 2025-05-07T19:46:29.9251308Z #define __cudaCDP2LaunchDeviceV2 2025-05-07T19:46:29.9251635Z #define __FDS_BITS(set) ((set)->fds_bits) 2025-05-07T19:46:29.9251958Z #define TLOSS 5 2025-05-07T19:46:29.9252191Z #define __ssize_t_defined 2025-05-07T19:46:29.9252488Z #define __CUDACC_VER_BUILD__ 85 2025-05-07T19:46:29.9252852Z #define _GLIBCXX_HAVE_SYS_SOCKET_H 1 2025-05-07T19:46:29.9253178Z #define ULONG_MAX (LONG_MAX * 2UL + 1UL) 2025-05-07T19:46:29.9253471Z #define __FLT64X_DECIMAL_DIG__ 21 2025-05-07T19:46:29.9253857Z #define _GLIBCXX_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_NAMESPACE_CXX11 2025-05-07T19:46:29.9254252Z #define _POSIX_HIWAT _POSIX_PIPE_BUF 2025-05-07T19:46:29.9254547Z #define __DEC128_MIN__ 1E-6143DL 2025-05-07T19:46:29.9254836Z #define __cudaCDP2EventRecordWithFlags 2025-05-07T19:46:29.9255160Z #define _GLIBCXX_ATOMIC_BUILTINS 1 2025-05-07T19:46:29.9255463Z #define cudaPeerAccessDefault 0x00 2025-05-07T19:46:29.9255737Z #define __REGISTER_PREFIX__ 2025-05-07T19:46:29.9256004Z #define __UINT16_MAX__ 0xffff 2025-05-07T19:46:29.9256333Z #define __glibcxx_requires_sorted_set(_First1,_Last1,_First2) 2025-05-07T19:46:29.9256706Z #define _IOS_NOREPLACE 64 2025-05-07T19:46:29.9256934Z #define __cdecl 2025-05-07T19:46:29.9257180Z #define cudaEventInterprocess 0x04 2025-05-07T19:46:29.9257510Z #define M_SQRT1_2l 0.707106781186547524400844362104849039L 2025-05-07T19:46:29.9257862Z #define LOGIN_NAME_MAX 256 2025-05-07T19:46:29.9258153Z #define _IO_TIED_PUT_GET 0x400 2025-05-07T19:46:29.9258431Z #define X_TLOSS 1.41484755040568800000e+16 2025-05-07T19:46:29.9258752Z #define CUDA_IPC_HANDLE_SIZE 64 2025-05-07T19:46:29.9259036Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:46:29.9259367Z #define __attribute_pure__ __attribute__ ((__pure__)) 2025-05-07T19:46:29.9259716Z #define __TEXTURE_TYPES_H__ 2025-05-07T19:46:29.9260149Z #define __NV_GLIBCXX_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:46:29.9260617Z #define ADJ_NANO 0x2000 2025-05-07T19:46:29.9260956Z #define __FLT32_MIN__ 1.17549435082228750796873653722224568e-38F32 2025-05-07T19:46:29.9261346Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:46:29.9261636Z #define _GLIBCXX_HAVE_ISWBLANK 1 2025-05-07T19:46:29.9262063Z #define __FLT_DIG__ 6 2025-05-07T19:46:29.9262424Z #define __REDIRECT_LDBL(name,proto,alias) __REDIRECT (name, proto, alias) 2025-05-07T19:46:29.9262870Z #define __NO_INLINE__ 1 2025-05-07T19:46:29.9263180Z #define _PSTL_EARLYEXIT_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:29.9263567Z #define _POSIX_NGROUPS_MAX 8 2025-05-07T19:46:29.9263825Z #define ADJ_STATUS 0x0010 2025-05-07T19:46:29.9264113Z #define __cudaCDP2MemcpyAsync_ptsz 2025-05-07T19:46:29.9264402Z #define CLOCK_BOOTTIME_ALARM 9 2025-05-07T19:46:29.9264700Z #define LONG_LONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:29.9265028Z #define _GLIBCXX_HAVE_OBSOLETE_ISNAN 1 2025-05-07T19:46:29.9265325Z #define __DEC_EVAL_METHOD__ 2 2025-05-07T19:46:29.9265734Z #define cudaStreamGraphFireAndForget (cudaStream_t)0x0200000000000000 2025-05-07T19:46:29.9266187Z #define _GLIBCXX_HAVE_ALIGNED_ALLOC 1 2025-05-07T19:46:29.9266559Z #define __DEC128_MAX__ 9.999999999999999999999999999999999E6144DL 2025-05-07T19:46:29.9266919Z #define CHAR_MIN SCHAR_MIN 2025-05-07T19:46:29.9267292Z #define MAX_CANON 255 2025-05-07T19:46:29.9267605Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:46:29.9267872Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:46:29.9268151Z #define _GLIBCXX_HAVE_COMPLEX_H 1 2025-05-07T19:46:29.9268418Z #define _PSTL_PRAGMA_VECTOR_UNALIGNED 2025-05-07T19:46:29.9268733Z #define _POSIX_FD_SETSIZE _POSIX_OPEN_MAX 2025-05-07T19:46:29.9269028Z #define _GLIBCXX_HAVE_HYPOT 1 2025-05-07T19:46:29.9269323Z #define __cudaCDP2Memset2DAsync_ptsz 2025-05-07T19:46:29.9269639Z #define _GLIBCXX_TR1_MODIFIED_BESSEL_FUNC_TCC 1 2025-05-07T19:46:29.9269981Z #define __VERSION__ "11.4.0" 2025-05-07T19:46:29.9270241Z #define _GLIBCXX11_USE_C99_STDLIB 1 2025-05-07T19:46:29.9270553Z #define cudaHostRegisterMapped 0x02 2025-05-07T19:46:29.9270847Z #define _GLIBCXX_HAVE_INT64_T 1 2025-05-07T19:46:29.9271137Z #define _GLIBCXX_USE_CONSTEXPR constexpr 2025-05-07T19:46:29.9271441Z #define FD_ZERO(fdsetp) __FD_ZERO (fdsetp) 2025-05-07T19:46:29.9271723Z #define __UINT64_C(c) c ## UL 2025-05-07T19:46:29.9271996Z #define MOD_OFFSET ADJ_OFFSET 2025-05-07T19:46:29.9272232Z #define _SYS_TYPES_H 1 2025-05-07T19:46:29.9272533Z #define AIO_PRIO_DELTA_MAX 20 2025-05-07T19:46:29.9272788Z #define _GLIBCXX_HAVE_TANHF 1 2025-05-07T19:46:29.9273047Z #define _SYS_CDEFS_H 1 2025-05-07T19:46:29.9273268Z #define _GLIBCXX_HAVE_TANHL 1 2025-05-07T19:46:29.9273543Z #define __cpp_unicode_characters 201411L 2025-05-07T19:46:29.9273823Z #define _IO_ERR_SEEN 0x20 2025-05-07T19:46:29.9274076Z #define _GLIBCXX_USE_DECIMAL_FLOAT 1 2025-05-07T19:46:29.9274370Z #define __cudaCDP2StreamDestroy 2025-05-07T19:46:29.9274625Z #define FP_SUBNORMAL 3 2025-05-07T19:46:29.9274875Z #define cudaOccupancyDefault 0x00 2025-05-07T19:46:29.9275141Z #define _INITIALIZER_LIST 2025-05-07T19:46:29.9275391Z #define _STDC_PREDEF_H 1 2025-05-07T19:46:29.9275628Z #define __CUDA_RUNTIME_API_H__ 2025-05-07T19:46:29.9275904Z #define _GLIBCXX_PACKAGE_BUGREPORT "" 2025-05-07T19:46:29.9276180Z #define _GLIBCXX_HAVE_MODF 1 2025-05-07T19:46:29.9276447Z #define _IO_file_flags _flags 2025-05-07T19:46:29.9276689Z #define __USE_XOPEN2K8 1 2025-05-07T19:46:29.9276940Z #define htobe64(x) __bswap_64 (x) 2025-05-07T19:46:29.9277217Z #define _OLD_STDIO_MAGIC 0xFABC0000 2025-05-07T19:46:29.9277474Z #define HUGE 3.40282347e+38F 2025-05-07T19:46:29.9277742Z #define __cpp_lib_is_null_pointer 201309 2025-05-07T19:46:29.9278101Z #define WEXITSTATUS(status) __WEXITSTATUS (__WAIT_INT (status)) 2025-05-07T19:46:29.9278492Z #define islower_l(c,l) __islower_l ((c), (l)) 2025-05-07T19:46:29.9278782Z #define _GLIBCXX_USE_CXX11_ABI 1 2025-05-07T19:46:29.9279052Z #define _GLIBCXX_HAVE_SYMLINK 1 2025-05-07T19:46:29.9279291Z #define _BSD_SOURCE 1 2025-05-07T19:46:29.9279524Z #define _GLIBCXX_THROW(_EXC) 2025-05-07T19:46:29.9280365Z #define _GLIBCXX_HAS_NESTED_TYPE(_NTYPE) template> struct __has_ ##_NTYPE : false_type { }; template struct __has_ ##_NTYPE<_Tp, __void_t> : true_type { }; 2025-05-07T19:46:29.9281268Z #define __catch(X) catch(X) 2025-05-07T19:46:29.9281526Z #define __INT_LEAST32_MAX__ 0x7fffffff 2025-05-07T19:46:29.9281803Z #define LINE_MAX _POSIX2_LINE_MAX 2025-05-07T19:46:29.9282073Z #define __TIMER_T_TYPE void * 2025-05-07T19:46:29.9282307Z #define __STRING(x) #x 2025-05-07T19:46:29.9282552Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:46:29.9282807Z #define _T_PTRDIFF_ 2025-05-07T19:46:29.9283047Z #define _GLIBCXX_USE_NOEXCEPT noexcept 2025-05-07T19:46:29.9283345Z #define cudaEventWaitExternal 0x01 2025-05-07T19:46:29.9283606Z #define __unbounded 2025-05-07T19:46:29.9283846Z #define __DEVICE_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:29.9284117Z #define __FLT128_MAX_EXP__ 16384 2025-05-07T19:46:29.9284400Z #define __INO_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:29.9284687Z #define be16toh(x) __bswap_16 (x) 2025-05-07T19:46:29.9284961Z #define __cpp_lib_is_final 201402L 2025-05-07T19:46:29.9285385Z #define _GLIBCXX_BEGIN_NAMESPACE_CONTAINER 2025-05-07T19:46:29.9285708Z #define LONG_LONG_MIN (-LONG_LONG_MAX - 1LL) 2025-05-07T19:46:29.9286002Z #define __MATH_DECLARE_LDOUBLE 1 2025-05-07T19:46:29.9286280Z #define __managed__ __location__(managed) 2025-05-07T19:46:29.9286745Z #define _POSIX2_EXPR_NEST_MAX 32 2025-05-07T19:46:29.9287164Z #define __GNUC_PREREQ(maj,min) ((__GNUC__ << 16) + __GNUC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:29.9287601Z #define _POSIX_STREAM_MAX 8 2025-05-07T19:46:29.9288021Z #define __LIBRARY_TYPES_H__ 2025-05-07T19:46:29.9288457Z #define _GLIBCXX_END_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_END_NAMESPACE_CXX11 2025-05-07T19:46:29.9288872Z #define __FLT32_MANT_DIG__ 24 2025-05-07T19:46:29.9289139Z #define _SYS_SIZE_T_H 2025-05-07T19:46:29.9289431Z #define _PSTL_VERSION_MINOR ((_PSTL_VERSION % 1000) / 10) 2025-05-07T19:46:29.9289795Z #define _GLIBCXX_STDLIB_H 1 2025-05-07T19:46:29.9290089Z #define isupper_l(c,l) __isupper_l ((c), (l)) 2025-05-07T19:46:29.9290474Z #define _CRTIMP 2025-05-07T19:46:29.9290716Z #define _GLIBCXX_CXX_CONFIG_H 1 2025-05-07T19:46:29.9291027Z #define __FLOAT_WORD_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:46:29.9291479Z #define STA_PPSJITTER 0x0200 2025-05-07T19:46:29.9291840Z #define _IO_feof_unlocked(__fp) (((__fp)->_flags & _IO_EOF_SEEN) != 0) 2025-05-07T19:46:29.9292283Z #define __SUSECONDS_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:29.9292612Z #define _GLIBCXX_HAVE_ISINFF 1 2025-05-07T19:46:29.9292910Z #define __glibcxx_requires_subscript(_N) 2025-05-07T19:46:29.9293219Z #define __SIZE_T__ 2025-05-07T19:46:29.9293428Z #define __stub_gtty 2025-05-07T19:46:29.9293666Z #define __pid_t_defined 2025-05-07T19:46:29.9294036Z #define _GLIBCXX_FWDREF(_Tp) _Tp&& 2025-05-07T19:46:29.9294350Z #define __NLINK_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:29.9294671Z #define __glibcxx_function_requires(...) 2025-05-07T19:46:29.9294985Z #define __SM_80_RT_HPP__ 2025-05-07T19:46:29.9295230Z #define __need_clockid_t 2025-05-07T19:46:29.9295490Z #define SSIZE_MAX LONG_MAX 2025-05-07T19:46:29.9295758Z #define _GLIBCXX_HAVE_USELOCALE 1 2025-05-07T19:46:29.9296100Z #define __glibcxx_requires_string_len(_String,_Len) 2025-05-07T19:46:29.9296444Z #define _IO_HEX 0100 2025-05-07T19:46:29.9296701Z #define __NFDBITS (8 * (int) sizeof (__fd_mask)) 2025-05-07T19:46:29.9297057Z #define cudaExternalMemoryDedicated 0x1 2025-05-07T19:46:29.9297372Z #define _GLIBCXX_HAVE_TGMATH_H 1 2025-05-07T19:46:29.9297663Z #define _GLIBCXX11_USE_C99_COMPLEX 1 2025-05-07T19:46:29.9298081Z #define _GLIBCXX17_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:29.9298551Z #define ispunct_l(c,l) __ispunct_l ((c), (l)) 2025-05-07T19:46:29.9298873Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:46:29.9299181Z #define __cudaGet_blockDim() blockDim 2025-05-07T19:46:29.9299286Z #define __cudaCDP2Memcpy3DAsync 2025-05-07T19:46:29.9299403Z #define __cudaCDP2MemcpyAsync 2025-05-07T19:46:29.9299487Z #define __stub_sstk 2025-05-07T19:46:29.9299579Z #define _IO_IN_BACKUP 0x100 2025-05-07T19:46:29.9299735Z #define _GLIBCXX_USE_C99_STDLIB _GLIBCXX11_USE_C99_STDLIB 2025-05-07T19:46:29.9299891Z #define __wur 2025-05-07T19:46:29.9300015Z #define isprint_l(c,l) __isprint_l ((c), (l)) 2025-05-07T19:46:29.9300102Z #define _G_HAVE_MMAP 1 2025-05-07T19:46:29.9300199Z #define _IO_OCT 040 2025-05-07T19:46:29.9300295Z #define __FLT128_HAS_DENORM__ 1 2025-05-07T19:46:29.9300385Z #define NL_MSGMAX INT_MAX 2025-05-07T19:46:29.9300478Z #define _GLIBCXX_USE_LFS 1 2025-05-07T19:46:29.9300624Z #define cudaDeviceScheduleBlockingSync 0x04 2025-05-07T19:46:29.9300722Z #define _POSIX_RTSIG_MAX 8 2025-05-07T19:46:29.9300825Z #define _GLIBCXX_NOEXCEPT noexcept 2025-05-07T19:46:29.9301036Z #define __glibcxx_requires_partitioned_lower(_First,_Last,_Value) 2025-05-07T19:46:29.9301131Z #define __FLT32_DECIMAL_DIG__ 9 2025-05-07T19:46:29.9301223Z #define _STL_ALGOBASE_H 1 2025-05-07T19:46:29.9301333Z #define __cudaCDP2MemsetAsync_ptsz 2025-05-07T19:46:29.9301443Z #define __off64_t_defined 2025-05-07T19:46:29.9301544Z #define _GLIBCXX_WEAK_DEFINITION 2025-05-07T19:46:29.9301635Z #define __FLT128_DIG__ 33 2025-05-07T19:46:29.9301754Z #define _GLIBCXX_USE_C99_INTTYPES_TR1 1 2025-05-07T19:46:29.9301854Z #define _GLIBCXX_HAVE_LOCALE_H 1 2025-05-07T19:46:29.9301939Z #define __INT32_C(c) c 2025-05-07T19:46:29.9302036Z #define __DEC64_EPSILON__ 1E-15DD 2025-05-07T19:46:29.9302148Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:46:29.9302241Z #define __DEC128_MIN_EXP__ (-6142) 2025-05-07T19:46:29.9302331Z #define __PDP_ENDIAN 3412 2025-05-07T19:46:29.9302543Z #define _ISOC95_SOURCE 1 2025-05-07T19:46:29.9302635Z #define _IO_fpos64_t _G_fpos64_t 2025-05-07T19:46:29.9302765Z #define M_PI_2l 1.570796326794896619231321691639751442L 2025-05-07T19:46:29.9302861Z #define BYTE_ORDER __BYTE_ORDER 2025-05-07T19:46:29.9302959Z #define __SM_90_RT_HPP__ 2025-05-07T19:46:29.9303056Z #define __INT_FAST32_TYPE__ long int 2025-05-07T19:46:29.9303149Z #define __have_pthread_attr_t 1 2025-05-07T19:46:29.9303260Z #define _GLIBCXX_HAVE_LIMIT_DATA 1 2025-05-07T19:46:29.9303486Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_BEGIN_NAMESPACE_CXX11 2025-05-07T19:46:29.9303596Z #define __cudaCDP2StreamWaitEvent 2025-05-07T19:46:29.9303767Z #define __cudaCDP2EventRecord 2025-05-07T19:46:29.9303877Z #define _BITS_TYPESIZES_H 1 2025-05-07T19:46:29.9303964Z #define htole32(x) (x) 2025-05-07T19:46:29.9304220Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessorWithFlags 2025-05-07T19:46:29.9304359Z #define __SYSCALL_SLONG_TYPE __SLONGWORD_TYPE 2025-05-07T19:46:29.9304459Z #define _GLIBCXX_USE_C99_MATH_TR1 1 2025-05-07T19:46:29.9304614Z #define WSTOPSIG(status) __WSTOPSIG (__WAIT_INT (status)) 2025-05-07T19:46:29.9304769Z #define _GLIBCXX_USE_C99_MATH _GLIBCXX11_USE_C99_MATH 2025-05-07T19:46:29.9304895Z #define __UINT_LEAST16_TYPE__ short unsigned int 2025-05-07T19:46:29.9305034Z #define __WIFEXITED(status) (__WTERMSIG(status) == 0) 2025-05-07T19:46:29.9305124Z #define ADJ_OFFSET 0x0001 2025-05-07T19:46:29.9305246Z #define cudaArrayLayered 0x01 2025-05-07T19:46:29.9305414Z #define _PSTL_ICC_18_OMP_SIMD_BROKEN (__INTEL_COMPILER == 1800) 2025-05-07T19:46:29.9305528Z #define cudaEventRecordDefault 0x00 2025-05-07T19:46:29.9305642Z #define _GLIBCXX_HAVE_FMODF 1 2025-05-07T19:46:29.9305742Z #define _PSTL_PRAGMA_MESSAGE(x) 2025-05-07T19:46:29.9305824Z #define unix 1 2025-05-07T19:46:29.9305917Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:46:29.9306026Z #define _POSIX_CHILD_MAX 25 2025-05-07T19:46:29.9306121Z #define _POSIX_MAX_INPUT 255 2025-05-07T19:46:29.9306237Z #define __cudaCDP2DeviceGetCacheConfig 2025-05-07T19:46:29.9306340Z #define __USE_POSIX 1 2025-05-07T19:46:29.9306435Z #define __FD_ZERO_STOS "stosq" 2025-05-07T19:46:29.9306566Z #define _PSTL_VERSION_MAJOR (_PSTL_VERSION / 1000) 2025-05-07T19:46:29.9306660Z #define __THROWNL throw () 2025-05-07T19:46:29.9306764Z #define __cpp_rtti 199711L 2025-05-07T19:46:29.9306866Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:46:29.9306955Z #define __PMT(args) args 2025-05-07T19:46:29.9307083Z #define __UINT64_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:29.9307277Z #define __va_arg_pack_len() __builtin_va_arg_pack_len () 2025-05-07T19:46:29.9307398Z #define __ULONGWORD_TYPE unsigned long int 2025-05-07T19:46:29.9307486Z #define _SIZE_T_DECLARED 2025-05-07T19:46:29.9307595Z #define _PSTL_STRING_AUX(x) #x 2025-05-07T19:46:29.9307684Z #define __FLT_IS_IEC_60559__ 2 2025-05-07T19:46:29.9308088Z #define _PSTL_CPP14_MAKE_REVERSE_ITERATOR_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L || __cpp_lib_make_reverse_iterator == 201402) 2025-05-07T19:46:29.9308201Z #define _GLIBCXX_HAVE_LIMIT_AS 1 2025-05-07T19:46:29.9308294Z #define XATTR_LIST_MAX 65536 2025-05-07T19:46:29.9308386Z #define __CUDACC_VER_MAJOR__ 12 2025-05-07T19:46:29.9308541Z #define __GNUC_WIDE_EXECUTION_CHARSET_NAME "UTF-32LE" 2025-05-07T19:46:29.9308625Z #define _WCHAR_T_H 2025-05-07T19:46:29.9308710Z #define __FLT64X_DIG__ 18 2025-05-07T19:46:29.9308798Z #define _IO_SHOWBASE 0200 2025-05-07T19:46:29.9308901Z #define _POSIX_QLIMIT 1 2025-05-07T19:46:29.9308995Z #define __INT8_TYPE__ signed char 2025-05-07T19:46:29.9309090Z #define __SURFACE_TYPES_H__ 2025-05-07T19:46:29.9309190Z #define __CUDA_ARCH__ 520 2025-05-07T19:46:29.9309297Z #define __cpp_digit_separators 201309L 2025-05-07T19:46:29.9309375Z #define __ELF__ 1 2025-05-07T19:46:29.9309471Z #define CLOCK_THREAD_CPUTIME_ID 3 2025-05-07T19:46:29.9309579Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:46:29.9309664Z #define STA_INS 0x0010 2025-05-07T19:46:29.9309759Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:46:29.9309941Z #define _toupper(c) ((int) (*__ctype_toupper_loc ())[(int) (c)]) 2025-05-07T19:46:29.9310032Z #define _BITS_BYTESWAP_H 1 2025-05-07T19:46:29.9310123Z #define __ID_T_TYPE __U32_TYPE 2025-05-07T19:46:29.9310230Z #define __TIME_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:29.9310348Z #define __DEVICE_DOUBLE_FUNCTIONS_HPP__ 2025-05-07T19:46:29.9310443Z #define _GLIBCXX_HAVE_MBSTATE_T 1 2025-05-07T19:46:29.9310543Z #define __cpp_lib_logical_traits 201510 2025-05-07T19:46:29.9310650Z #define ADJ_OFFSET_SS_READ 0xa001 2025-05-07T19:46:29.9310805Z #define __warnattr(msg) __attribute__((__warning__ (msg))) 2025-05-07T19:46:29.9311003Z #define _PSTL_PRAGMA_LOCATION " [Parallel STL message]: " 2025-05-07T19:46:29.9311100Z #define _IO_funlockfile(_fp) 2025-05-07T19:46:29.9311449Z #define cudaKernelNodeAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:29.9311579Z #define M_2_PIl 0.636619772367581343075535053490057448L 2025-05-07T19:46:29.9311672Z #define __DRIVER_TYPES_H__ 2025-05-07T19:46:29.9311771Z #define __FLT_RADIX__ 2 2025-05-07T19:46:29.9311871Z #define __INT_LEAST16_TYPE__ short int 2025-05-07T19:46:29.9312035Z #define __LDBL_EPSILON__ 1.08420217248550443400745280086994171e-19L 2025-05-07T19:46:29.9329367Z #define __UINTMAX_C(c) c ## UL 2025-05-07T19:46:29.9329565Z #define _GLIBCXX_USE_LSTAT 1 2025-05-07T19:46:29.9329676Z #define minor(dev) gnu_dev_minor (dev) 2025-05-07T19:46:29.9329777Z #define _POSIX_C_SOURCE 200809L 2025-05-07T19:46:29.9329876Z #define _GLIBCXX_HAVE_DIRENT_H 1 2025-05-07T19:46:29.9330011Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:46:29.9330097Z #define WORD_BIT 32 2025-05-07T19:46:29.9330269Z #define _IO_USER_BUF 1 2025-05-07T19:46:29.9330383Z #define __VECTOR_TYPES_H__ 2025-05-07T19:46:29.9330489Z #define __SM_20_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:29.9330599Z #define cudaHostAllocPortable 0x01 2025-05-07T19:46:29.9330699Z #define PTHREAD_STACK_MIN 16384 2025-05-07T19:46:29.9330808Z #define __long_double_t long double 2025-05-07T19:46:29.9330904Z #define _GLIBCXX_HAVE_ISINF 1 2025-05-07T19:46:29.9330994Z #define _POSIX_ARG_MAX 4096 2025-05-07T19:46:29.9331454Z #define cudaKernelNodeAttributeDeviceUpdatableKernelNode cudaLaunchAttributeDeviceUpdatableKernelNode 2025-05-07T19:46:29.9331533Z #define __k8 1 2025-05-07T19:46:29.9331734Z #define _GLIBCXX_NO_OBSOLETE_ISINF_ISNAN_DYNAMIC __GLIBC_PREREQ(2,23) 2025-05-07T19:46:29.9331922Z #define __FLT32X_MIN__ 2.22507385850720138309023271733240406e-308F32x 2025-05-07T19:46:29.9332036Z #define __LDBL_REDIR(name,proto) name proto 2025-05-07T19:46:29.9333044Z #define __SIG_ATOMIC_MAX__ 0x7fffffff 2025-05-07T19:46:29.9333150Z #define __SM_30_INTRINSICS_HPP__ 2025-05-07T19:46:29.9333267Z #define _GLIBCXX_EXTERN_TEMPLATE 1 2025-05-07T19:46:29.9333360Z #define __blksize_t_defined 2025-05-07T19:46:29.9333453Z #define _IO_SHOWPOINT 0400 2025-05-07T19:46:29.9333565Z #define _GLIBCXX_HAVE_LIMIT_RSS 1 2025-05-07T19:46:29.9333678Z #define cudaDeviceLmemResizeToMax 0x10 2025-05-07T19:46:29.9333773Z #define _GLIBCXX_X86_RDRAND 1 2025-05-07T19:46:29.9333880Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:46:29.9333988Z #define _IO_IS_FILEBUF 0x2000 2025-05-07T19:46:29.9334084Z #define _GLIBCXX_USE_DUAL_ABI 1 2025-05-07T19:46:29.9334353Z #define __bswap_constant_16(x) ((unsigned short int) ((((x) >> 8) & 0xff) | (((x) & 0xff) << 8))) 2025-05-07T19:46:29.9334729Z #define cudaSignalExternalSemaphoresAsync __CUDART_API_PTSZ(cudaSignalExternalSemaphoresAsync_v2) 2025-05-07T19:46:29.9334830Z #define UCHAR_MAX (SCHAR_MAX * 2 + 1) 2025-05-07T19:46:29.9334928Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:46:29.9335026Z #define SEEK_SET 0 2025-05-07T19:46:29.9335129Z #define _GLIBCXX_TR1_GAMMA_TCC 1 2025-05-07T19:46:29.9335221Z #define __CUDA_API_VER_MINOR__ 6 2025-05-07T19:46:29.9335418Z #define _GLIBCXX_VISIBILITY(V) __attribute__ ((__visibility__ (#V))) 2025-05-07T19:46:29.9335533Z #define _GLIBCXX20_DEPRECATED(MSG) 2025-05-07T19:46:29.9335635Z #define __cudaCDP2GetLastError 2025-05-07T19:46:29.9335727Z #define _GLIBCXX_HAVE_COSL 1 2025-05-07T19:46:29.9335831Z #define _MATH_H_MATHDEF 1 2025-05-07T19:46:29.9336167Z #define __bswap_constant_32(x) ((((x) & 0xff000000) >> 24) | (((x) & 0x00ff0000) >> 8) | (((x) & 0x0000ff00) << 8) | (((x) & 0x000000ff) << 24)) 2025-05-07T19:46:29.9336263Z #define _GLIBCXX_USE_FLOAT128 1 2025-05-07T19:46:29.9336359Z #define _IO_FLAGS2_NOTCANCEL 2 2025-05-07T19:46:29.9336463Z #define __stub_sigreturn 2025-05-07T19:46:29.9336714Z #define __errordecl(name,msg) extern void name (void) __attribute__((__error__ (msg))) 2025-05-07T19:46:29.9336811Z #define _GLIBCXX_HAVE_UTIME_H 1 2025-05-07T19:46:29.9336994Z #define __HOST_CONFIG_H__ 2025-05-07T19:46:29.9337095Z #define _XOPEN_SOURCE_EXTENDED 1 2025-05-07T19:46:29.9337179Z #define CLOCK_TAI 11 2025-05-07T19:46:29.9337284Z #define _GLIBCXX_END_NAMESPACE_VERSION 2025-05-07T19:46:29.9337384Z #define __restrict_arr 2025-05-07T19:46:29.9337498Z #define _PSTL_PRAGMA_MESSAGE_POLICIES(x) 2025-05-07T19:46:29.9337637Z #define __glibcxx_requires_valid_range(_First,_Last) 2025-05-07T19:46:29.9338206Z #define strndupa(s,n) (__extension__ ({ const char *__old = (s); size_t __len = strnlen (__old, (n)); char *__new = (char *) __builtin_alloca (__len + 1); __new[__len] = '\0'; (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:29.9338393Z #define __attribute_artificial__ __attribute__ ((__artificial__)) 2025-05-07T19:46:29.9338479Z #define __USE_MISC 1 2025-05-07T19:46:29.9338596Z #define __UWORD_TYPE unsigned long int 2025-05-07T19:46:29.9338692Z #define _EXCEPTION_DEFINES_H 1 2025-05-07T19:46:29.9338783Z #define _GCC_LIMITS_H_ 2025-05-07T19:46:29.9338867Z #define __LDBL_DIG__ 18 2025-05-07T19:46:29.9338974Z #define __BIT_TYPES_DEFINED__ 1 2025-05-07T19:46:29.9339074Z #define __malloc_and_calloc_defined 2025-05-07T19:46:29.9339165Z #define __FLT64_IS_IEC_60559__ 2 2025-05-07T19:46:29.9339276Z #define _GLIBCXX_HAVE_SYS_SYSINFO_H 1 2025-05-07T19:46:29.9339356Z #define __x86_64__ 1 2025-05-07T19:46:29.9339438Z #define _SIZE_T_ 2025-05-07T19:46:29.9340410Z #define __bswap_constant_64(x) (__extension__ ((((x) & 0xff00000000000000ull) >> 56) | (((x) & 0x00ff000000000000ull) >> 40) | (((x) & 0x0000ff0000000000ull) >> 24) | (((x) & 0x000000ff00000000ull) >> 8) | (((x) & 0x00000000ff000000ull) << 8) | (((x) & 0x0000000000ff0000ull) << 24) | (((x) & 0x000000000000ff00ull) << 40) | (((x) & 0x00000000000000ffull) << 56))) 2025-05-07T19:46:29.9340512Z #define _POSIX2_COLL_WEIGHTS_MAX 2 2025-05-07T19:46:29.9340607Z #define __FLT32X_MIN_EXP__ (-1021) 2025-05-07T19:46:29.9340735Z #define __PTHREAD_RWLOCK_INT_FLAGS_SHARED 1 2025-05-07T19:46:29.9340901Z #define __DEC32_SUBNORMAL_MIN__ 0.000001E-95DF 2025-05-07T19:46:29.9340998Z #define _IO_iconv_t _G_iconv_t 2025-05-07T19:46:29.9341106Z #define _GLIBCXX_FLOAT_IS_IEEE_BINARY32 1 2025-05-07T19:46:29.9341240Z #define __cpp_lib_make_reverse_iterator 201402 2025-05-07T19:46:29.9341383Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_BEFORE(A) 2025-05-07T19:46:29.9341480Z #define _GLIBCXX_HAVE_DLFCN_H 1 2025-05-07T19:46:29.9341990Z #define strdupa(s) (__extension__ ({ const char *__old = (s); size_t __len = strlen (__old) + 1; char *__new = (char *) __builtin_alloca (__len); (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:29.9342217Z #define __no_return__ __attribute__((noreturn)) 2025-05-07T19:46:29.9342351Z #define __device_builtin__ __location__(device_builtin) 2025-05-07T19:46:29.9342444Z #define _PSTL_HIDE_FROM_ABI_POP 2025-05-07T19:46:29.9342546Z #define _GLIBCXX_HAVE_ACOSF 1 2025-05-07T19:46:29.9342628Z #define STA_FLL 0x0008 2025-05-07T19:46:29.9342764Z #define _GLIBCXX_HAVE_BUILTIN_IS_CONSTANT_EVALUATED 1 2025-05-07T19:46:29.9342872Z #define _GLIBCXX_END_EXTERN_C } 2025-05-07T19:46:29.9342980Z #define __INT_FAST16_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:29.9343080Z #define __cpp_lib_integer_sequence 201304 2025-05-07T19:46:29.9343174Z #define __stub_revoke 2025-05-07T19:46:29.9343258Z #define __timer_t_defined 1 2025-05-07T19:46:29.9343383Z #define _GLIBCXX11_DEPRECATED _GLIBCXX_DEPRECATED 2025-05-07T19:46:29.9343466Z #define INT_MAX __INT_MAX__ 2025-05-07T19:46:29.9343576Z #define ULLONG_MAX (LLONG_MAX * 2ULL + 1) 2025-05-07T19:46:29.9343672Z #define _GLIBCXX_END_NAMESPACE_CXX11 } 2025-05-07T19:46:29.9343759Z #define _GLIBCXX_ICONV_CONST 2025-05-07T19:46:29.9343865Z #define major(dev) gnu_dev_major (dev) 2025-05-07T19:46:29.9343964Z #define cudaArrayTextureGather 0x08 2025-05-07T19:46:29.9344053Z #define _GLIBCXX_LT_OBJDIR ".libs/" 2025-05-07T19:46:29.9344189Z #define __inline_hint__ __attribute__((nv_inline_hint)) 2025-05-07T19:46:29.9344294Z #define __NV_LEGACY_LAUNCH 1 2025-05-07T19:46:29.9344377Z #define _IO_off_t __off_t 2025-05-07T19:46:29.9344506Z #define __FLT64_DIG__ 15 2025-05-07T19:46:29.9344733Z #define PTHREAD_DESTRUCTOR_ITERATIONS _POSIX_THREAD_DESTRUCTOR_ITERATIONS 2025-05-07T19:46:29.9344823Z #define _POSIX2_LINE_MAX 2048 2025-05-07T19:46:29.9344940Z #define __UINT_FAST32_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:29.9345054Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:46:29.9345155Z #define ADJ_FREQUENCY 0x0002 2025-05-07T19:46:29.9345249Z #define __CUDART_API_PTDS(api) api 2025-05-07T19:46:29.9345326Z #define NULL __null 2025-05-07T19:46:29.9345460Z #define cudaStreamPerThread ((cudaStream_t)0x2) 2025-05-07T19:46:29.9345555Z #define _GLIBCXX_CONSTEXPR constexpr 2025-05-07T19:46:29.9345643Z #define __U64_TYPE unsigned long int 2025-05-07T19:46:29.9345729Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:46:29.9345829Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:46:29.9345907Z #define FP_ZERO 2 2025-05-07T19:46:29.9345996Z #define _GLIBCXX_HAVE_FLOORL 1 2025-05-07T19:46:29.9346152Z #define __isgraph_l(c,l) __isctype_l((c), _ISgraph, (l)) 2025-05-07T19:46:29.9346254Z #define __LONG_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:29.9346330Z #define __WCHAR_T__ 2025-05-07T19:46:29.9346415Z #define __FLT64X_HAS_DENORM__ 1 2025-05-07T19:46:29.9346614Z #define __DEC128_SUBNORMAL_MIN__ 0.000000000000000000000000000000001E-6143DL 2025-05-07T19:46:29.9346922Z #define _GLIBCXX_NORETURN __attribute__ ((__noreturn__)) 2025-05-07T19:46:29.9347013Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:46:29.9347143Z #define __GNUC_EXECUTION_CHARSET_NAME "UTF-8" 2025-05-07T19:46:29.9347255Z #define _GLIBCXX20_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:29.9347382Z #define __WSTOPSIG(status) __WEXITSTATUS(status) 2025-05-07T19:46:29.9347519Z #define cudaSurfaceTypeCubemapLayered 0xFC 2025-05-07T19:46:29.9347608Z #define _BSD_PTRDIFF_T_ 2025-05-07T19:46:29.9347696Z #define _SIGSET_H_types 1 2025-05-07T19:46:29.9347801Z #define cudaTextureType1DLayered 0xF1 2025-05-07T19:46:29.9348130Z #define __cpp_unicode_literals 200710L 2025-05-07T19:46:29.9348283Z #define __isdigit_l(c,l) __isctype_l((c), _ISdigit, (l)) 2025-05-07T19:46:29.9348385Z #define __LONG_LONG_PAIR(HI,LO) LO, HI 2025-05-07T19:46:29.9348554Z #define __UINT_FAST16_TYPE__ long unsigned int 2025-05-07T19:46:29.9348686Z #define __bos0(ptr) __builtin_object_size (ptr, 0) 2025-05-07T19:46:29.9348790Z #define __DEC64_MAX__ 9.999999999999999E384DD 2025-05-07T19:46:29.9348921Z #define M_1_PIl 0.318309886183790671537767526745028724L 2025-05-07T19:46:29.9349105Z #define WIFSTOPPED(status) __WIFSTOPPED (__WAIT_INT (status)) 2025-05-07T19:46:29.9349197Z #define __INT_FAST32_WIDTH__ 64 2025-05-07T19:46:29.9349300Z #define _POSIX2_CHARCLASS_NAME_MAX 14 2025-05-07T19:46:29.9349410Z #define _GLIBCXX_BITS_STD_ABS_H 2025-05-07T19:46:29.9349500Z #define STA_MODE 0x4000 2025-05-07T19:46:29.9349608Z #define __CHAR16_TYPE__ short unsigned int 2025-05-07T19:46:29.9349722Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:46:29.9349836Z #define __glibcxx_signed_b(T,B) ((T)(-1) < 0) 2025-05-07T19:46:29.9349938Z #define __USING_NAMESPACE_C99(name) 2025-05-07T19:46:29.9350031Z #define BIG_ENDIAN __BIG_ENDIAN 2025-05-07T19:46:29.9350149Z #define __cudaCDP2EventRecord_ptsz 2025-05-07T19:46:29.9350239Z #define _GLIBCXX_HAVE_SINL 1 2025-05-07T19:46:29.9350351Z #define EXPR_NEST_MAX _POSIX2_EXPR_NEST_MAX 2025-05-07T19:46:29.9350458Z #define __SIZE_WIDTH__ 64 2025-05-07T19:46:29.9350575Z #define __BLKSIZE_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:29.9350654Z #define __SEG_FS 1 2025-05-07T19:46:29.9350743Z #define _IO_size_t size_t 2025-05-07T19:46:29.9350848Z #define __INT_LEAST16_MAX__ 0x7fff 2025-05-07T19:46:29.9350944Z #define INT_MIN (-INT_MAX - 1) 2025-05-07T19:46:29.9351029Z #define __stub_lchmod 2025-05-07T19:46:29.9351140Z #define __DEC64_MANT_DIG__ 16 2025-05-07T19:46:29.9351243Z #define __INT64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:29.9351337Z #define _GLIBCXX_MANGLE_SIZE_T m 2025-05-07T19:46:29.9351419Z #define __SEG_GS 1 2025-05-07T19:46:29.9351684Z #define __FLT32_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F32 2025-05-07T19:46:29.9351772Z #define _IOS_APPEND 8 2025-05-07T19:46:29.9351866Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:46:29.9351969Z #define _GLIBCXX_RELEASE 11 2025-05-07T19:46:29.9352063Z #define _GLIBCXX98_USE_C99_WCHAR 1 2025-05-07T19:46:29.9352158Z #define _IO_IS_APPENDING 0x1000 2025-05-07T19:46:29.9352256Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:46:29.9352355Z #define htole16(x) (x) 2025-05-07T19:46:29.9352461Z #define __TEXTURE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:29.9352556Z #define _GLIBCXX_HAVE_FCNTL_H 1 2025-05-07T19:46:29.9352663Z #define __INT16_TYPE__ short int 2025-05-07T19:46:29.9352764Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:46:29.9352869Z #define __glibcxx_class_requires(_a,_b) 2025-05-07T19:46:29.9352977Z #define __cpp_structured_bindings 201606L 2025-05-07T19:46:29.9353115Z #define __align__(n) __attribute__((aligned(n))) 2025-05-07T19:46:29.9353204Z #define __SIZEOF_INT__ 4 2025-05-07T19:46:29.9353292Z #define __WCLONE 0x80000000 2025-05-07T19:46:29.9353401Z #define __DEC32_MAX_EXP__ 97 2025-05-07T19:46:29.9353482Z #define SEEK_HOLE 4 2025-05-07T19:46:29.9353566Z #define TIMER_ABSTIME 1 2025-05-07T19:46:29.9353656Z #define __INT_FAST8_MAX__ 0x7f 2025-05-07T19:46:29.9353757Z #define __CUDA_MATH_CRTIMP 2025-05-07T19:46:29.9353931Z #define __FLT128_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:46:29.9354045Z #define __INTPTR_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:29.9354151Z #define __DRIVER_FUNCTIONS_H__ 2025-05-07T19:46:29.9354258Z #define __cpp_sized_deallocation 201309L 2025-05-07T19:46:29.9354350Z #define __MATH_FUNCTIONS_HPP__ 2025-05-07T19:46:29.9354469Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:46:29.9354571Z #define _LINUX_LIMITS_H 2025-05-07T19:46:29.9354648Z #define linux 1 2025-05-07T19:46:29.9354738Z #define MOD_MICRO ADJ_MICRO 2025-05-07T19:46:29.9354862Z #define _GLIBCXX_DEBUG_ASSERT(_Condition) 2025-05-07T19:46:29.9355010Z #define _GLIBCXX_HAVE_VSWSCANF 1 2025-05-07T19:46:29.9355106Z #define _GLIBCXX_HAVE_ISNAN 1 2025-05-07T19:46:29.9355207Z #define _XOPEN_IOV_MAX _POSIX_UIO_MAXIOV 2025-05-07T19:46:29.9355368Z #define __cudart_builtin__ __location__(cudart_builtin) 2025-05-07T19:46:29.9355464Z #define __cpp_lib_hypot 201603 2025-05-07T19:46:29.9355556Z #define __FLT64_HAS_QUIET_NAN__ 1 2025-05-07T19:46:29.9355668Z #define _GLIBCXX_HAVE_WCTYPE_H 1 2025-05-07T19:46:29.9355759Z #define MOD_NANO ADJ_NANO 2025-05-07T19:46:29.9355843Z #define htole64(x) (x) 2025-05-07T19:46:29.9355957Z #define FP_ILOGBNAN (-2147483647 - 1) 2025-05-07T19:46:29.9356082Z #define _IO_stdout ((_IO_FILE*)(&_IO_2_1_stdout_)) 2025-05-07T19:46:29.9356175Z #define _IO_UPPERCASE 01000 2025-05-07T19:46:29.9356711Z #define cudaKernelNodeAttributeClusterSchedulingPolicyPreference cudaLaunchAttributeClusterSchedulingPolicyPreference 2025-05-07T19:46:29.9356809Z #define __USE_POSIX2 1 2025-05-07T19:46:29.9356911Z #define MOD_ESTERROR ADJ_ESTERROR 2025-05-07T19:46:29.9356999Z #define __WALL 0x40000000 2025-05-07T19:46:29.9357111Z #define _GLIBCXX_HAVE_LDEXPF 1 2025-05-07T19:46:29.9357196Z #define _XLOCALE_H 1 2025-05-07T19:46:29.9357291Z #define _GLIBCXX_USE_TMPNAM 1 2025-05-07T19:46:29.9357382Z #define __FLT32_MIN_10_EXP__ (-37) 2025-05-07T19:46:29.9357488Z #define __KEY_T_TYPE __S32_TYPE 2025-05-07T19:46:29.9357591Z #define __cudaGet_threadIdx() threadIdx 2025-05-07T19:46:29.9357680Z #define __EXCEPTIONS 1 2025-05-07T19:46:29.9357793Z #define __CUDART_API_PTSZ(api) api 2025-05-07T19:46:29.9357986Z #define __launch_bounds__(...) __annotate__(launch_bounds(__VA_ARGS__)) 2025-05-07T19:46:29.9358072Z #define __WORDSIZE 64 2025-05-07T19:46:29.9358163Z #define CLOCK_MONOTONIC 1 2025-05-07T19:46:29.9358264Z #define _STL_RELOPS_H 1 2025-05-07T19:46:29.9358361Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:46:29.9358563Z #define __BEGIN_DECLS extern "C" { 2025-05-07T19:46:29.9358665Z #define _GLIBCXX_HAVE_SYS_IPC_H 1 2025-05-07T19:46:29.9358758Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:46:29.9358851Z #define _GLIBCXX_HAVE_TRUNCATE 1 2025-05-07T19:46:29.9359214Z #define cudaKernelNodeAttributeClusterDimension cudaLaunchAttributeClusterDimension 2025-05-07T19:46:29.9359462Z #define _PSTL_GCC_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:46:29.9359583Z #define _GLIBCXX_NAMESPACE_CXX11 __cxx11:: 2025-05-07T19:46:29.9359785Z #define _GLIBCXX_NUMERIC_LIMITS 1 2025-05-07T19:46:29.9359888Z #define __cpp_range_based_for 201603L 2025-05-07T19:46:29.9359987Z #define __cpp_lib_exchange_function 201304 2025-05-07T19:46:29.9360078Z #define _GLIBCXX_HAVE_INTTYPES_H 1 2025-05-07T19:46:29.9360190Z #define _GLIBCXX_DARWIN_USE_64_BIT_INODE 1 2025-05-07T19:46:29.9360364Z #define cudaCooperativeLaunchMultiDeviceNoPostSync 0x02 2025-05-07T19:46:29.9360454Z #define __FLT64_HAS_INFINITY__ 1 2025-05-07T19:46:29.9360540Z #define _GLIBCXX_CSTDLIB 1 2025-05-07T19:46:29.9360649Z #define _GLIBCXX_DEBUG_MACRO_SWITCH_H 1 2025-05-07T19:46:29.9360810Z #define __FLT64X_MAX__ 1.18973149535723176502126385303097021e+4932F64x 2025-05-07T19:46:29.9360936Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16 2025-05-07T19:46:29.9361015Z #define _STRING_H 1 2025-05-07T19:46:29.9361107Z #define _BITS_PTHREADTYPES_H 1 2025-05-07T19:46:29.9361188Z #define _GCC_MAX_ALIGN_T 2025-05-07T19:46:29.9361285Z #define __SM_32_INTRINSICS_HPP__ 2025-05-07T19:46:29.9361410Z #define __SIG_ATOMIC_MIN__ (-__SIG_ATOMIC_MAX__ - 1) 2025-05-07T19:46:29.9361498Z #define __code_model_small__ 1 2025-05-07T19:46:29.9361587Z #define _PSTL_CONFIG_H 2025-05-07T19:46:29.9361681Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:46:29.9361787Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:46:29.9361875Z #define __SM_20_INTRINSICS_H__ 2025-05-07T19:46:29.9361975Z #define cudaCpuDeviceId ((int)-1) 2025-05-07T19:46:29.9362303Z #define assert(expr) ((expr) ? __ASSERT_VOID_CAST (0) : __assert_fail (__STRING(expr), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:29.9362441Z #define __DEC32_MANT_DIG__ 7 2025-05-07T19:46:29.9362531Z #define le64toh(x) (x) 2025-05-07T19:46:29.9362619Z #define FILENAME_MAX 4096 2025-05-07T19:46:29.9362762Z #define __iscntrl_l(c,l) __isctype_l((c), _IScntrl, (l)) 2025-05-07T19:46:29.9362879Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:46:29.9362955Z #define L_cuserid 9 2025-05-07T19:46:29.9363035Z #define __ino_t_defined 2025-05-07T19:46:29.9363109Z #define __k8__ 1 2025-05-07T19:46:29.9363211Z #define __INTPTR_TYPE__ long int 2025-05-07T19:46:29.9363310Z #define __UINT16_TYPE__ short unsigned int 2025-05-07T19:46:29.9363392Z #define __int8_t_defined 2025-05-07T19:46:29.9363492Z #define __WCHAR_TYPE__ int 2025-05-07T19:46:29.9363585Z #define __CLOCKID_T_TYPE __S32_TYPE 2025-05-07T19:46:29.9363689Z #define cudaHostRegisterPortable 0x01 2025-05-07T19:46:29.9363781Z #define __SLONGWORD_TYPE long int 2025-05-07T19:46:29.9363873Z #define _IOS_TRUNC 16 2025-05-07T19:46:29.9363982Z #define _GLIBCXX_PACKAGE_TARNAME "libstdc++" 2025-05-07T19:46:29.9364125Z #define __isblank_l(c,l) __isctype_l((c), _ISblank, (l)) 2025-05-07T19:46:29.9364225Z #define __HAVE_COLUMN 2025-05-07T19:46:29.9364301Z #define __stub_fdetach 2025-05-07T19:46:29.9364699Z #define __CUDACC_VER__ "__CUDACC_VER__ is no longer supported. Use __CUDACC_VER_MAJOR__, __CUDACC_VER_MINOR__, and __CUDACC_VER_BUILD__ instead." 2025-05-07T19:46:29.9364774Z #define __pic__ 2 2025-05-07T19:46:29.9364897Z #define __UINTPTR_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:29.9364985Z #define CLOCKS_PER_SEC 1000000l 2025-05-07T19:46:29.9365070Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:46:29.9365174Z #define _GLIBCXX_HAVE_SOCKATMARK 1 2025-05-07T19:46:29.9365253Z #define __stub_chflags 2025-05-07T19:46:29.9365334Z #define CLOCK_BOOTTIME 7 2025-05-07T19:46:29.9365411Z #define __need_IOV_MAX 2025-05-07T19:46:29.9365525Z #define putc(_ch,_fp) _IO_putc (_ch, _fp) 2025-05-07T19:46:29.9365620Z #define __UQUAD_TYPE unsigned long int 2025-05-07T19:46:29.9365709Z #define __cpp_decltype 200707L 2025-05-07T19:46:29.9365815Z #define __BYTE_ORDER __LITTLE_ENDIAN 2025-05-07T19:46:29.9365945Z #define _GLIBCXX_USE_C99 1 2025-05-07T19:46:29.9366044Z #define _GLIBCXX_TR1_BETA_FUNCTION_TCC 1 2025-05-07T19:46:29.9366124Z #define TTY_NAME_MAX 32 2025-05-07T19:46:29.9366294Z #define _GLIBCXX_FORWARD(_Tp,__val) std::forward<_Tp>(__val) 2025-05-07T19:46:29.9366405Z #define __INT_FAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:29.9366560Z #define _PSTL_ASSERT(_Condition) __glibcxx_assert(_Condition) 2025-05-07T19:46:29.9366676Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:46:29.9366761Z #define __LITTLE_ENDIAN 1234 2025-05-07T19:46:29.9366848Z #define STA_PPSTIME 0x0004 2025-05-07T19:46:29.9366935Z #define __import__ 2025-05-07T19:46:29.9367019Z #define BUFSIZ _IO_BUFSIZ 2025-05-07T19:46:29.9367144Z #define M_SQRT2l 1.414213562373095048801688724209698079L 2025-05-07T19:46:29.9367220Z #define __export__ 2025-05-07T19:46:29.9367340Z #define __FSID_T_TYPE struct { int __val[2]; } 2025-05-07T19:46:29.9367438Z #define cudaMemAttachHost 0x02 2025-05-07T19:46:29.9367591Z #define __FLT_NORM_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:46:29.9367691Z #define _GLIBCXX_HAVE_ICONV 1 2025-05-07T19:46:29.9367774Z #define _GLIBCXX_SYMVER 1 2025-05-07T19:46:29.9367861Z #define __FLT64X_MAX_EXP__ 16384 2025-05-07T19:46:29.9367944Z #define _WCHAR_T_DECLARED 2025-05-07T19:46:29.9368066Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:46:29.9368174Z #define isalpha_l(c,l) __isalpha_l ((c), (l)) 2025-05-07T19:46:29.9368268Z #define __cpp_inline_variables 201606L 2025-05-07T19:46:29.9368364Z #define WNOWAIT 0x01000000 2025-05-07T19:46:29.9368438Z #define PLOSS 6 2025-05-07T19:46:29.9368523Z #define M_LN10 2.30258509299404568402 2025-05-07T19:46:29.9368776Z #define _PSTL_UDS_PRESENT (__INTEL_COMPILER >= 1900 && __INTEL_COMPILER_BUILD_DATE >= 20180626) 2025-05-07T19:46:29.9368868Z #define EXIT_SUCCESS 0 2025-05-07T19:46:29.9368958Z #define __LDBL_REDIR_DECL(name) 2025-05-07T19:46:29.9369090Z #define _GLIBCXX_HAVE_STRTOF 1 2025-05-07T19:46:29.9369195Z #define MOD_FREQUENCY ADJ_FREQUENCY 2025-05-07T19:46:29.9369281Z #define __thread__ __thread 2025-05-07T19:46:29.9369370Z #define _GLIBCXX_HAVE_MEMORY_H 1 2025-05-07T19:46:29.9369454Z #define __INT_MAX__ 0x7fffffff 2025-05-07T19:46:29.9369559Z #define __SIZEOF_PTHREAD_BARRIER_T 32 2025-05-07T19:46:29.9369775Z #define __glibcxx_requires_partitioned_upper_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:29.9369885Z #define __cudaCDP2StreamWaitEvent_ptsz 2025-05-07T19:46:29.9369986Z #define _GLIBCXX_HAVE_SINF 1 2025-05-07T19:46:29.9370061Z #define __linux__ 1 2025-05-07T19:46:29.9370149Z #define STA_PPSSIGNAL 0x0100 2025-05-07T19:46:29.9370355Z #define M_LN2l 0.693147180559945309417232121458176568L 2025-05-07T19:46:29.9370442Z #define __S16_TYPE short int 2025-05-07T19:46:29.9370964Z #define __glibcxx_constexpr_assert(cond) if (__builtin_is_constant_evaluated() && !bool(cond)) __builtin_unreachable() 2025-05-07T19:46:29.9371076Z #define __NVCC_DIAG_PRAGMA_SUPPORT__ 1 2025-05-07T19:46:29.9371330Z #define __bos(ptr) __builtin_object_size (ptr, __USE_FORTIFY_LEVEL > 1) 2025-05-07T19:46:29.9371432Z #define __COMMON_FUNCTIONS_H__ 2025-05-07T19:46:29.9371530Z #define UINT_MAX (INT_MAX * 2U + 1U) 2025-05-07T19:46:29.9371623Z #define _T_SIZE_ 2025-05-07T19:46:29.9371719Z #define LLONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:29.9371838Z #define __cudaCDP2StreamCreateWithFlags 2025-05-07T19:46:29.9371931Z #define _PSTL_VERSION 12000 2025-05-07T19:46:29.9372061Z #define __noinline__ __attribute__((noinline)) 2025-05-07T19:46:29.9372155Z #define __WNOTHREAD 0x20000000 2025-05-07T19:46:29.9372248Z #define _G_va_list __gnuc_va_list 2025-05-07T19:46:29.9372391Z #define M_PI_4l 0.785398163397448309615660845819875721L 2025-05-07T19:46:29.9372473Z #define _IOS_INPUT 1 2025-05-07T19:46:29.9372565Z #define __USE_LARGEFILE64 1 2025-05-07T19:46:29.9372667Z #define _GLIBCXX_TR1_EXP_INTEGRAL_TCC 1 2025-05-07T19:46:29.9372769Z #define __INT64_TYPE__ long int 2025-05-07T19:46:29.9372867Z #define _POSIX_SSIZE_MAX 32767 2025-05-07T19:46:29.9373023Z #define __shared__ __location__(shared) 2025-05-07T19:46:29.9373128Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:46:29.9373282Z #define __glibc_unlikely(cond) __builtin_expect((cond), 0) 2025-05-07T19:46:29.9373371Z #define __gid_t_defined 2025-05-07T19:46:29.9373482Z #define _GLIBCXX_USE_SC_NPROCESSORS_ONLN 1 2025-05-07T19:46:29.9373589Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:46:29.9373792Z #define __glibcxx_requires_can_increment_range(_First1,_Last1,_First2) 2025-05-07T19:46:29.9373887Z #define _GLIBCXX17_INLINE inline 2025-05-07T19:46:29.9373987Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:46:29.9374067Z #define ___int_size_t_h 2025-05-07T19:46:29.9374172Z #define __FSBLKCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:29.9374308Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:46:29.9374463Z #define __WIFCONTINUED(status) ((status) == __W_CONTINUED) 2025-05-07T19:46:29.9374561Z #define CUDA_DOUBLE_MATH_FUNCTIONS 1 2025-05-07T19:46:29.9374659Z #define _GLIBCXX_HAVE_FENV_H 1 2025-05-07T19:46:29.9374772Z #define _GLIBCXX_HAVE_STDBOOL_H 1 2025-05-07T19:46:29.9374863Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:46:29.9374987Z #define __INT_LEAST64_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:29.9375112Z #define _GLIBCXX_TR1_HYPERGEOMETRIC_TCC 1 2025-05-07T19:46:29.9375232Z #define _GLIBCXX_DEBUG_PEDASSERT(_Condition) 2025-05-07T19:46:29.9375323Z #define __clock_t_defined 1 2025-05-07T19:46:29.9375420Z #define _POSIX_SEM_VALUE_MAX 32767 2025-05-07T19:46:29.9375541Z #define __cudaCDP2RuntimeGetVersion 2025-05-07T19:46:29.9375632Z #define __GLIBC_MINOR__ 17 2025-05-07T19:46:29.9375722Z #define __DEC64_MIN__ 1E-383DD 2025-05-07T19:46:29.9375834Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:46:29.9375941Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:46:29.9376031Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:46:29.9376206Z #define __FLT32_NORM_MAX__ 3.40282346638528859811704183484516925e+38F32 2025-05-07T19:46:29.9376367Z #define __SSE__ 1 2025-05-07T19:46:29.9376461Z #define SEM_VALUE_MAX (2147483647) 2025-05-07T19:46:29.9376559Z #define M_SQRT1_2 0.70710678118654752440 2025-05-07T19:46:29.9376656Z #define _CTYPE_H 1 2025-05-07T19:46:29.9376747Z #define __sigset_t_defined 2025-05-07T19:46:29.9376837Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:46:29.9376931Z #define _GLIBCXX_HAVE_LOGF 1 2025-05-07T19:46:29.9377032Z #define MOD_TAI ADJ_TAI 2025-05-07T19:46:29.9377127Z #define _IO_va_list __gnuc_va_list 2025-05-07T19:46:29.9377217Z #define _GLIBCXX_HAVE_LOGL 1 2025-05-07T19:46:29.9377313Z #define __SM_70_RT_H__ 2025-05-07T19:46:29.9377409Z #define _GLIBCXX_HAVE_WRITEV 1 2025-05-07T19:46:29.9377516Z #define cudaEventWaitDefault 0x00 2025-05-07T19:46:29.9377610Z #define _GLIBCXX_HAVE_EXPL 1 2025-05-07T19:46:29.9377788Z #define __FLT64_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:46:29.9377882Z #define _POSIX_MAX_CANON 255 2025-05-07T19:46:29.9377992Z #define _GLIBCXX_NOEXCEPT_PARM , bool _NE 2025-05-07T19:46:29.9378102Z #define FD_SETSIZE __FD_SETSIZE 2025-05-07T19:46:29.9378196Z #define _GLIBCXX_TXN_SAFE 2025-05-07T19:46:29.9378279Z #define __amd64__ 1 2025-05-07T19:46:29.9378367Z #define __WINT_WIDTH__ 32 2025-05-07T19:46:29.9378481Z #define __CUDA_DEVICE_RUNTIME_API_H__ 2025-05-07T19:46:29.9378761Z #define __REDIRECT_NTHNL(name,proto,alias) name proto __THROWNL __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:29.9378858Z #define _GLIBCXX_STDIO_SEEK_CUR 1 2025-05-07T19:46:29.9378953Z #define EOF (-1) 2025-05-07T19:46:29.9379047Z #define __WAIT_STATUS_DEFN void * 2025-05-07T19:46:29.9379138Z #define __USE_POSIX199309 1 2025-05-07T19:46:29.9379242Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:46:29.9379337Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:46:29.9379429Z #define __FLT32X_MAX_10_EXP__ 308 2025-05-07T19:46:29.9379523Z #define LLONG_MIN (-LLONG_MAX-1) 2025-05-07T19:46:29.9379644Z #define cudaSurfaceType2DLayered 0xF2 2025-05-07T19:46:29.9379737Z #define ____mbstate_t_defined 1 2025-05-07T19:46:29.9379826Z #define STA_NANO 0x2000 2025-05-07T19:46:29.9379931Z #define _GLIBCXX_HAVE_LOG10F 1 2025-05-07T19:46:29.9380072Z #define _GLIBCXX_HAVE_LOG10L 1 2025-05-07T19:46:29.9380160Z #define _IO_LINKED 0x80 2025-05-07T19:46:29.9380254Z #define __cpp_lib_launder 201606 2025-05-07T19:46:29.9380358Z #define __SIZEOF_INT128__ 16 2025-05-07T19:46:29.9380460Z #define __PTHREAD_MUTEX_HAVE_PREV 1 2025-05-07T19:46:29.9380551Z #define __FLT64X_IS_IEC_60559__ 2 2025-05-07T19:46:29.9380656Z #define _GLIBCXX_TYPE_TRAITS 1 2025-05-07T19:46:29.9380796Z #define cudaGraphKernelNodePortProgrammatic 1 2025-05-07T19:46:29.9380903Z #define __DEVICE_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:29.9381003Z #define __BLKCNT64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:29.9381106Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:46:29.9381199Z #define __W_CONTINUED 0xffff 2025-05-07T19:46:29.9381288Z #define __ATOMIC_RELAXED 0 2025-05-07T19:46:29.9381433Z #define w_coredump __wait_terminated.__w_coredump 2025-05-07T19:46:29.9381558Z #define __FSBLKCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:29.9381768Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessor 2025-05-07T19:46:29.9381958Z #define __DBL_EPSILON__ double(2.22044604925031308084726333618164062e-16L) 2025-05-07T19:46:29.9382055Z #define __stub_stty 2025-05-07T19:46:29.9382226Z #define _tolower(c) ((int) (*__ctype_tolower_loc ())[(int) (c)]) 2025-05-07T19:46:29.9382312Z #define le16toh(x) (x) 2025-05-07T19:46:29.9382434Z #define BC_SCALE_MAX _POSIX2_BC_SCALE_MAX 2025-05-07T19:46:29.9382610Z #define __FLT128_MIN__ 3.36210314311209350626267781732175260e-4932F128 2025-05-07T19:46:29.9382690Z #define _SIZET_ 2025-05-07T19:46:29.9382795Z #define XATTR_NAME_MAX 255 2025-05-07T19:46:29.9382878Z #define _SVID_SOURCE 1 2025-05-07T19:46:29.9382956Z #define _LP64 1 2025-05-07T19:46:29.9383043Z #define _LIBC_LIMITS_H_ 1 2025-05-07T19:46:29.9383299Z #define __REDIRECT_NTH_LDBL(name,proto,alias) __REDIRECT_NTH (name, proto, alias) 2025-05-07T19:46:29.9383409Z #define _GLIBCXX_TR1_BESSEL_FUNCTION_TCC 1 2025-05-07T19:46:29.9383569Z #define __UINT8_C(c) c 2025-05-07T19:46:29.9383678Z #define _GLIBCXX_HAVE_CEILF 1 2025-05-07T19:46:29.9383767Z #define _GLIBCXX_HAVE_CEILL 1 2025-05-07T19:46:29.9383874Z #define __cudaCDP2Memset3DAsync_ptsz 2025-05-07T19:46:29.9383964Z #define __CUDA_ARCH_LIST__ 520 2025-05-07T19:46:29.9384067Z #define __FLT64_MAX_EXP__ 1024 2025-05-07T19:46:29.9384161Z #define MOD_MAXERROR ADJ_MAXERROR 2025-05-07T19:46:29.9384242Z #define CUDARTAPI 2025-05-07T19:46:29.9384338Z #define IOV_MAX 1024 2025-05-07T19:46:29.9384484Z #define __glibcxx_requires_irreflexive2(_First,_Last) 2025-05-07T19:46:29.9384579Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:46:29.9384679Z #define cudaMemAttachSingle 0x04 2025-05-07T19:46:29.9384777Z #define __wchar_t__ 2025-05-07T19:46:29.9384876Z #define __cpp_lib_is_aggregate 201703 2025-05-07T19:46:29.9384956Z #define SEEK_END 2 2025-05-07T19:46:29.9385058Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:46:29.9385231Z #define _GLIBCXX_USE_TBB_PAR_BACKEND __has_include() 2025-05-07T19:46:29.9385330Z #define _IO_ftrylockfile(_fp) 2025-05-07T19:46:29.9385479Z #define _GLIBCXX_USE_C99_WCHAR _GLIBCXX11_USE_C99_WCHAR 2025-05-07T19:46:29.9385582Z #define ____FILE_defined 1 2025-05-07T19:46:29.9385698Z #define _GLIBCXX_HAVE_BUILTIN_IS_AGGREGATE 1 2025-05-07T19:46:29.9385792Z #define __GNUC_PATCHLEVEL__ 0 2025-05-07T19:46:29.9385893Z #define _ISOC99_SOURCE 1 2025-05-07T19:46:29.9385989Z #define __VECTOR_FUNCTIONS_H__ 2025-05-07T19:46:29.9386249Z #define __REDIRECT_NTH(name,proto,alias) name proto __THROW __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:29.9386524Z #define _PSTL_USE_NONTEMPORAL_STORES_IF_ALLOWED 2025-05-07T19:46:29.9386622Z #define _IO_RIGHT 04 2025-05-07T19:46:29.9386715Z #define __END_NAMESPACE_STD 2025-05-07T19:46:29.9386903Z #define __FLT128_NORM_MAX__ 1.18973149535723176508575932662800702e+4932F128 2025-05-07T19:46:29.9387008Z #define _GLIBCXX_STD_C std 2025-05-07T19:46:29.9387128Z #define cudaInitDeviceFlagsAreValid 0x01 2025-05-07T19:46:29.9387228Z #define _LARGEFILE64_SOURCE 1 2025-05-07T19:46:29.9387375Z #define _GLIBCXX_USE_C99_STDINT_TR1 1 2025-05-07T19:46:29.9387472Z #define _STDDEF_H_ 2025-05-07T19:46:29.9387650Z #define __FLT64_NORM_MAX__ 1.79769313486231570814527423731704357e+308F64 2025-05-07T19:46:29.9387743Z #define __FLT128_HAS_QUIET_NAN__ 1 2025-05-07T19:46:29.9387866Z #define isalnum_l(c,l) __isalnum_l ((c), (l)) 2025-05-07T19:46:29.9388177Z #define __FD_ISSET(d,set) ((__FDS_BITS (set)[__FD_ELT (d)] & __FD_MASK (d)) != 0) 2025-05-07T19:46:29.9388286Z #define __INTMAX_MAX__ 0x7fffffffffffffffL 2025-05-07T19:46:29.9388436Z #define __glibcxx_requires_irreflexive(_First,_Last) 2025-05-07T19:46:29.9388554Z #define cudaGraphKernelNodePortDefault 0 2025-05-07T19:46:29.9388652Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:46:29.9388757Z #define __cudaCDP2Memcpy3DAsync_ptsz 2025-05-07T19:46:29.9388859Z #define __PID_T_TYPE __S32_TYPE 2025-05-07T19:46:29.9388969Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:46:29.9389066Z #define CHARCLASS_NAME_MAX 2048 2025-05-07T19:46:29.9389170Z #define _GLIBCXX_HAVE_TANF 1 2025-05-07T19:46:29.9389262Z #define _GLIBCXX_USE_ST_MTIM 1 2025-05-07T19:46:29.9389435Z #define __FLT64X_MIN__ 3.36210314311209350626267781732175260e-4932F64x 2025-05-07T19:46:29.9389524Z #define __CUDA_RUNTIME_H__ 2025-05-07T19:46:29.9389716Z #define WIFSIGNALED(status) __WIFSIGNALED (__WAIT_INT (status)) 2025-05-07T19:46:29.9389813Z #define _GLIBCXX_HAVE_STDLIB_H 1 2025-05-07T19:46:29.9389903Z #define __STDCPP_THREADS__ 1 2025-05-07T19:46:29.9390054Z #define M_2_SQRTPIl 1.128379167095512573896158903121545172L 2025-05-07T19:46:29.9390144Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:46:29.9390234Z #define _POSIX_UIO_MAXIOV 16 2025-05-07T19:46:29.9390341Z #define _PSTL_PAR_BACKEND_SERIAL 2025-05-07T19:46:29.9390431Z #define P_tmpdir "/tmp" 2025-05-07T19:46:29.9390547Z #define __ASSERT_FUNCTION __PRETTY_FUNCTION__ 2025-05-07T19:46:29.9390637Z #define __FLT64_HAS_DENORM__ 1 2025-05-07T19:46:29.9390801Z #define __WORDSIZE_TIME64_COMPAT32 1 2025-05-07T19:46:29.9390964Z #define _GLIBCXX_DEPRECATED __attribute__ ((__deprecated__)) 2025-05-07T19:46:29.9391131Z #define __FLT32_EPSILON__ 1.19209289550781250000000000000000000e-7F32 2025-05-07T19:46:29.9391238Z #define _PSTL_HIDE_FROM_ABI_PUSH 2025-05-07T19:46:29.9391353Z #define cudaStreamLegacy ((cudaStream_t)0x1) 2025-05-07T19:46:29.9391463Z #define _IO_cleanup_region_start(_fct,_fp) 2025-05-07T19:46:29.9391561Z #define __location__(a) __annotate__(a) 2025-05-07T19:46:29.9391800Z #define __device_builtin_surface_type__ __location__(device_builtin_surface_type) 2025-05-07T19:46:29.9391894Z #define _POSIX2_BC_BASE_MAX 99 2025-05-07T19:46:29.9392002Z #define __cudaCDP2DeviceGetAttribute 2025-05-07T19:46:29.9392102Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:46:29.9392185Z #define __STDC_UTF_32__ 1 2025-05-07T19:46:29.9392274Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:46:29.9392365Z #define NAN (__builtin_nanf ("")) 2025-05-07T19:46:29.9392467Z #define _POSIX_MQ_PRIO_MAX 32 2025-05-07T19:46:29.9392546Z #define __FXSR__ 1 2025-05-07T19:46:29.9392626Z #define _SIZE_T 2025-05-07T19:46:29.9392739Z #define _GLIBCXX_USE_GETTIMEOFDAY 1 2025-05-07T19:46:29.9392845Z #define cudaHostRegisterReadOnly 0x08 2025-05-07T19:46:29.9393011Z #define __FLT32X_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:46:29.9393154Z #define __WIFSTOPPED(status) (((status) & 0xff) == 0x7f) 2025-05-07T19:46:29.9393253Z #define _IO_ssize_t __ssize_t 2025-05-07T19:46:29.9393347Z #define __ULONG32_TYPE unsigned int 2025-05-07T19:46:29.9393531Z #define __DBL_NORM_MAX__ double(1.79769313486231570814527423731704357e+308L) 2025-05-07T19:46:29.9393744Z #define cudaStreamGraphTailLaunch (cudaStream_t)0x0100000000000000 2025-05-07T19:46:29.9393828Z #define _GXX_NULLPTR_T 2025-05-07T19:46:29.9393946Z #define __glibcxx_class_requires3(_a,_b,_c,_d) 2025-05-07T19:46:29.9394040Z #define FOPEN_MAX 16 2025-05-07T19:46:29.9394126Z #define __BIG_ENDIAN 4321 2025-05-07T19:46:29.9394247Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:46:29.9394338Z #define __suseconds_t_defined 2025-05-07T19:46:29.9394487Z #define __off_t_defined 2025-05-07T19:46:29.9394567Z #define stderr stderr 2025-05-07T19:46:29.9394655Z #define M_LOG10E 0.43429448190325182765 2025-05-07T19:46:29.9394777Z #define __glibcxx_requires_string(_String) 2025-05-07T19:46:29.9394871Z #define _GLIBCXX_HAVE_LDEXPL 1 2025-05-07T19:46:29.9394958Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:46:29.9395379Z #define _PSTL_CPP14_2RANGE_MISMATCH_EQUAL_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201300L || __cpp_lib_robust_nonmodifying_seq_ops == 201304) 2025-05-07T19:46:29.9395476Z #define __mode_t_defined 2025-05-07T19:46:29.9395553Z #define _GCC_SIZE_T 2025-05-07T19:46:29.9395752Z #define __INO64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:29.9395858Z #define __cpp_runtime_arrays 198712L 2025-05-07T19:46:29.9395957Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:46:29.9396041Z #define __USE_XOPEN2K8XSI 1 2025-05-07T19:46:29.9396127Z #define __UINT32_C(c) c ## U 2025-05-07T19:46:29.9396233Z #define __cpp_alias_templates 200704L 2025-05-07T19:46:29.9396333Z #define cudaHostAllocMapped 0x02 2025-05-07T19:46:29.9396430Z #define __DEVICE_LAUNCH_PARAMETERS_H__ 2025-05-07T19:46:29.9396516Z #define _STL_ITERATOR_H 1 2025-05-07T19:46:29.9396605Z #define __size_t__ 2025-05-07T19:46:29.9396726Z #define cudaStreamAttrID cudaLaunchAttributeID 2025-05-07T19:46:29.9396812Z #define _GLIBCXX_HAVE_ATANF 1 2025-05-07T19:46:29.9396927Z #define cudaEventRecordExternal 0x01 2025-05-07T19:46:29.9397066Z #define __isspace_l(c,l) __isctype_l((c), _ISspace, (l)) 2025-05-07T19:46:29.9397150Z #define _IO_BUFSIZ _G_BUFSIZ 2025-05-07T19:46:29.9397307Z #define __FLT_DENORM_MIN__ 1.40129846432481707092372958328991613e-45F 2025-05-07T19:46:29.9397401Z #define _ENDIAN_H 1 2025-05-07T19:46:29.9397496Z #define __builtin_align__(a) __align__(a) 2025-05-07T19:46:29.9397583Z #define _GLIBCXX20_CONSTEXPR 2025-05-07T19:46:29.9397691Z #define __NV_NO_HOST_COMPILER_CHECK 1 2025-05-07T19:46:29.9397818Z #define __try try 2025-05-07T19:46:29.9397909Z #define _GLIBCXX_HAVE_FINITE 1 2025-05-07T19:46:29.9398006Z #define __FLT128_IS_IEC_60559__ 2 2025-05-07T19:46:29.9398089Z #define __INT8_MAX__ 0x7f 2025-05-07T19:46:29.9398339Z #define cudaStreamGetCaptureInfo __CUDART_API_PTSZ(cudaStreamGetCaptureInfo_v2) 2025-05-07T19:46:29.9398418Z #define __LONG_WIDTH__ 64 2025-05-07T19:46:29.9398505Z #define __PIC__ 2 2025-05-07T19:46:29.9398611Z #define BC_STRING_MAX _POSIX2_BC_STRING_MAX 2025-05-07T19:46:29.9398721Z #define __UINT_FAST32_TYPE__ long unsigned int 2025-05-07T19:46:29.9398853Z #define FD_ISSET(fd,fdsetp) __FD_ISSET (fd, fdsetp) 2025-05-07T19:46:29.9398943Z #define _GLIBCXX_HAVE_FLOAT_H 1 2025-05-07T19:46:29.9399028Z #define _GLIBCXX_HAVE_ATANL 1 2025-05-07T19:46:29.9399201Z #define __FLT32X_NORM_MAX__ 1.79769313486231570814527423731704357e+308F32x 2025-05-07T19:46:29.9399303Z #define __DEVICE_FUNCTIONS_HPP__ 2025-05-07T19:46:29.9399393Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:46:29.9399477Z #define _IO_uid_t __uid_t 2025-05-07T19:46:29.9399578Z #define _GLIBCXX_HAVE_READLINK 1 2025-05-07T19:46:29.9399695Z #define __cudaCDP2EventRecordWithFlags_ptsz 2025-05-07T19:46:29.9399780Z #define _CONCEPT_CHECK_H 1 2025-05-07T19:46:29.9399913Z #define __FLT_MAX__ 3.40282346638528859811704183484516925e+38F 2025-05-07T19:46:29.9400015Z #define _GLIBCXX_HAVE_NETINET_IN_H 1 2025-05-07T19:46:29.9400128Z #define _GLIBCXX_TR1_SPECIAL_FUNCTION_UTIL_H 1 2025-05-07T19:46:29.9400204Z #define LONG_BIT 64 2025-05-07T19:46:29.9400313Z #define __SIZEOF_PTHREAD_BARRIERATTR_T 4 2025-05-07T19:46:29.9400405Z #define _GLIBCXX_USE_ALLOCATOR_NEW 1 2025-05-07T19:46:29.9400524Z #define __cpp_lib_math_special_functions 201603L 2025-05-07T19:46:29.9400615Z #define __fsfilcnt_t_defined 2025-05-07T19:46:29.9400709Z #define __blkcnt_t_defined 2025-05-07T19:46:29.9400973Z #define cudaKernelNodeAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:29.9401053Z #define __USE_LARGEFILE 1 2025-05-07T19:46:29.9401154Z #define __cpp_constexpr 201603L 2025-05-07T19:46:29.9401315Z #define CUDART_VERSION 12060 2025-05-07T19:46:29.9401401Z #define NL_TEXTMAX INT_MAX 2025-05-07T19:46:29.9401505Z #define cudaDeviceMapHost 0x08 2025-05-07T19:46:29.9401586Z #define _GLIBCXX_CMATH 1 2025-05-07T19:46:29.9401773Z #define __attribute_format_arg__(x) __attribute__ ((__format_arg__ (x))) 2025-05-07T19:46:29.9401857Z #define __lldiv_t_defined 1 2025-05-07T19:46:29.9401943Z #define __SSE2__ 1 2025-05-07T19:46:29.9402018Z #define _IOLBF 1 2025-05-07T19:46:29.9402111Z #define _GLIBCXX_HAVE_SYS_TYPES_H 1 2025-05-07T19:46:29.9402210Z #define _GLIBCXX_HAVE_FLOORF 1 2025-05-07T19:46:29.9402304Z #define __cpp_deduction_guides 201703L 2025-05-07T19:46:29.9402392Z #define _GLIBCXX_HAVE_EXPF 1 2025-05-07T19:46:29.9402494Z #define __annotate__(a) __attribute__((a)) 2025-05-07T19:46:29.9402589Z #define __INT32_TYPE__ int 2025-05-07T19:46:29.9402672Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:46:29.9402771Z #define cudaDeviceSyncMemops 0x80 2025-05-07T19:46:29.9402873Z #define __cpp_exceptions 199711L 2025-05-07T19:46:29.9402960Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:46:29.9403060Z #define cudaDeviceScheduleYield 0x02 2025-05-07T19:46:29.9403144Z #define _SYS_SYSMACROS_H 1 2025-05-07T19:46:29.9403259Z #define _GLIBCXX_TR1_LEGENDRE_FUNCTION_TCC 1 2025-05-07T19:46:29.9403408Z #define __FLT64_MIN__ 2.22507385850720138309023271733240406e-308F64 2025-05-07T19:46:29.9403497Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:46:29.9403593Z #define __SWORD_TYPE long int 2025-05-07T19:46:29.9403676Z #define __INTMAX_TYPE__ long int 2025-05-07T19:46:29.9403761Z #define _GLIBCXX11_USE_C99_MATH 1 2025-05-07T19:46:29.9403848Z #define __PTHREAD_SPINS 0, 0 2025-05-07T19:46:29.9403941Z #define _BITS_POSIX1_LIM_H 1 2025-05-07T19:46:29.9404384Z #define cudaStreamAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:29.9404474Z #define __DEC128_MAX_EXP__ 6145 2025-05-07T19:46:29.9404626Z #define math_errhandling (MATH_ERRNO | MATH_ERREXCEPT) 2025-05-07T19:46:29.9404765Z #define _T_SIZE 2025-05-07T19:46:29.9404872Z #define cudaHostAllocDefault 0x00 2025-05-07T19:46:29.9404993Z #define _PSTL_PRAGMA_SIMD_EXCLUSIVE_SCAN(PRM) 2025-05-07T19:46:29.9405123Z #define __va_arg_pack() __builtin_va_arg_pack () 2025-05-07T19:46:29.9405210Z #define _POSIX_TIMER_MAX 32 2025-05-07T19:46:29.9405300Z #define _GLIBCXX_HAVE_TLS 1 2025-05-07T19:46:29.9405593Z #define _GLIBCXX_NOTHROW _GLIBCXX_USE_NOEXCEPT 2025-05-07T19:46:29.9405685Z #define _GLIBCXX_HAVE_ACOSL 1 2025-05-07T19:46:29.9405777Z #define __FLT32X_HAS_QUIET_NAN__ 1 2025-05-07T19:46:29.9405867Z #define __ATOMIC_CONSUME 1 2025-05-07T19:46:29.9406058Z #define __CUDA_ARCH_HAS_FEATURE__(_FEAT) __CUDA_ARCH_FEAT_ ##_FEAT 2025-05-07T19:46:29.9406148Z #define __GNUC_MINOR__ 4 2025-05-07T19:46:29.9406248Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:46:29.9406355Z #define __INT_FAST16_WIDTH__ 64 2025-05-07T19:46:29.9406470Z #define __UINTMAX_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:29.9406555Z #define __PIE__ 2 2025-05-07T19:46:29.9406670Z #define LITTLE_ENDIAN __LITTLE_ENDIAN 2025-05-07T19:46:29.9406772Z #define _GLIBCXX_HAVE_INT64_T_LONG 1 2025-05-07T19:46:29.9406968Z #define __FLT32X_DENORM_MIN__ 4.94065645841246544176568792868221372e-324F32x 2025-05-07T19:46:29.9407194Z #define __intN_t(N,MODE) typedef int int ##N ##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:29.9407299Z #define __nlink_t_defined 2025-05-07T19:46:29.9407426Z #define _GLIBCXX17_DEPRECATED [[__deprecated__]] 2025-05-07T19:46:29.9407534Z #define _PSTL_STRING(x) _PSTL_STRING_AUX(x) 2025-05-07T19:46:29.9407628Z #define _XOPEN_LIM_H 1 2025-05-07T19:46:29.9407902Z #define __u_intN_t(N,MODE) typedef unsigned int u_int ##N ##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:29.9408017Z #define __cpp_template_template_args 201611L 2025-05-07T19:46:29.9408119Z #define _GTHREAD_USE_MUTEX_TIMEDLOCK 1 2025-05-07T19:46:29.9408232Z #define BC_DIM_MAX _POSIX2_BC_DIM_MAX 2025-05-07T19:46:29.9408326Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:46:29.9408409Z #define __FILE_defined 1 2025-05-07T19:46:29.9408655Z #define __LDBL_DENORM_MIN__ 3.64519953188247460252840593361941982e-4951L 2025-05-07T19:46:29.9408753Z #define _GLIBCXX_HAVE_SINCOS 1 2025-05-07T19:46:29.9408846Z #define __USE_XOPEN_EXTENDED 1 2025-05-07T19:46:29.9408962Z #define __cpp_lib_tuple_element_t 201402L 2025-05-07T19:46:29.9409076Z #define isascii_l(c,l) __isascii_l ((c), (l)) 2025-05-07T19:46:29.9409184Z #define cudaInvalidDeviceId ((int)-2) 2025-05-07T19:46:29.9409282Z #define _GLIBCXX_HAVE_SYS_RESOURCE_H 1 2025-05-07T19:46:29.9409378Z #define __INT16_C(c) c 2025-05-07T19:46:29.9409472Z #define __U32_TYPE unsigned int 2025-05-07T19:46:29.9409569Z #define _GLIBCXX_HAVE_SYS_IOCTL_H 1 2025-05-07T19:46:29.9409702Z #define FD_CLR(fd,fdsetp) __FD_CLR (fd, fdsetp) 2025-05-07T19:46:29.9409784Z #define __STDC__ 1 2025-05-07T19:46:29.9409879Z #define _GLIBCXX_HAVE_VWSCANF 1 2025-05-07T19:46:29.9409979Z #define _GLIBCXX_HAVE_EXECINFO_H 1 2025-05-07T19:46:29.9410088Z #define _GLIBCXX_USE_REALPATH 1 2025-05-07T19:46:29.9410326Z #define __attribute_malloc__ __attribute__ ((__malloc__)) 2025-05-07T19:46:29.9410416Z #define __FLT32X_DIG__ 15 2025-05-07T19:46:29.9410532Z #define _GLIBCXX_USE_C99_CTYPE_TR1 1 2025-05-07T19:46:29.9410633Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:46:29.9410746Z #define cudaArrayDeferredMapping 0x80 2025-05-07T19:46:29.9410856Z #define _GLIBCXX_END_NAMESPACE_CONTAINER 2025-05-07T19:46:29.9410968Z #define USHRT_MAX (SHRT_MAX * 2 + 1) 2025-05-07T19:46:29.9411068Z #define __cpp_lib_is_swappable 201603 2025-05-07T19:46:29.9411153Z #define stdin stdin 2025-05-07T19:46:29.9411254Z #define __ino64_t_defined 2025-05-07T19:46:29.9411341Z #define STA_CLK 0x8000 2025-05-07T19:46:29.9411553Z #define __clockid_t_defined 1 2025-05-07T19:46:29.9411701Z #define _GLIBCXX_NOEXCEPT_IF(...) noexcept(__VA_ARGS__) 2025-05-07T19:46:29.9411878Z #define __attribute_noinline__ __attribute__ ((__noinline__)) 2025-05-07T19:46:29.9411979Z #define __cudaCDP2MemsetAsync 2025-05-07T19:46:29.9412136Z #define _PSTL_PRAGMA_SIMD_SCAN(PRM) 2025-05-07T19:46:29.9412247Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL 2025-05-07T19:46:29.9412347Z #define _GLIBCXX_TR1_POLY_HERMITE_TCC 1 2025-05-07T19:46:29.9412551Z #define __FD_SET(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] |= __FD_MASK (d))) 2025-05-07T19:46:29.9412650Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:46:29.9413214Z #define __tobody(c,f,a,args) (__extension__ ({ int __res; if (sizeof (c) > 1) { if (__builtin_constant_p (c)) { int __c = (c); __res = __c < -128 || __c > 255 ? __c : (a)[__c]; } else __res = f args; } else __res = (a)[(int) (c)]; __res; })) 2025-05-07T19:46:29.9413297Z #define DOMAIN 1 2025-05-07T19:46:29.9413387Z #define M_LN2 0.69314718055994530942 2025-05-07T19:46:29.9413474Z #define __NVCC__ 1 2025-05-07T19:46:29.9413578Z #define __cudaCDP2Memset2DAsync 2025-05-07T19:46:29.9413689Z #define __CLOCK_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:29.9413798Z #define _PSTL_PRAGMA_SIMD_EARLYEXIT 2025-05-07T19:46:29.9413902Z #define __throw_exception_again throw 2025-05-07T19:46:29.9413998Z #define M_SQRT2 1.41421356237309504880 2025-05-07T19:46:29.9414086Z #define __EXCEPTION_H 1 2025-05-07T19:46:29.9414188Z #define __FLT32X_MIN_10_EXP__ (-307) 2025-05-07T19:46:29.9414286Z #define HUGE_VAL (__builtin_huge_val()) 2025-05-07T19:46:29.9414606Z #define cudaStreamAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:29.9414727Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:46:29.9414835Z #define _GLIBCXX_INLINE_VERSION 0 2025-05-07T19:46:29.9414924Z #define _GLIBCXX_USE_INT128 1 2025-05-07T19:46:29.9415019Z #define __cpp_lib_bool_constant 201505 2025-05-07T19:46:29.9415121Z #define PTHREAD_KEYS_MAX 1024 2025-05-07T19:46:29.9415259Z #define __DEC64_SUBNORMAL_MIN__ 0.000000000000001E-383DD 2025-05-07T19:46:29.9415360Z #define __FSFILCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:29.9415468Z #define _GLIBCXX_DOUBLE_IS_IEEE_BINARY64 1 2025-05-07T19:46:29.9415569Z #define __DEC128_MANT_DIG__ 34 2025-05-07T19:46:29.9415675Z #define __cpp_lib_tuples_by_type 201304 2025-05-07T19:46:29.9415883Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:46:29.9415993Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:46:29.9416126Z #define _GLIBCXX_THROW_OR_ABORT(_EXC) (throw (_EXC)) 2025-05-07T19:46:29.9416219Z #define __useconds_t_defined 2025-05-07T19:46:29.9416317Z #define _GLIBCXX_USE_SCHED_YIELD 1 2025-05-07T19:46:29.9416516Z #define __attribute_deprecated__ __attribute__ ((__deprecated__)) 2025-05-07T19:46:29.9416664Z #define __cpp_lib_type_trait_variable_templates 201510L 2025-05-07T19:46:29.9416751Z #define __SSE_MATH__ 1 2025-05-07T19:46:29.9416850Z #define _IO_wint_t wint_t 2025-05-07T19:46:29.9416941Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:46:29.9417030Z #define _GLIBCXX_VERBOSE 1 2025-05-07T19:46:29.9417123Z #define _GLIBCXX_HAVE_ASINF 1 2025-05-07T19:46:29.9417249Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:46:29.9417343Z #define _GLIBCXX_HAVE_ISINFL 1 2025-05-07T19:46:29.9417439Z #define _GLIBCXX_HAVE_ASINL 1 2025-05-07T19:46:29.9417533Z #define __USE_ATFILE 1 2025-05-07T19:46:29.9417626Z #define _POSIX_OPEN_MAX 20 2025-05-07T19:46:29.9417720Z #define _POSIX_LOGIN_NAME_MAX 9 2025-05-07T19:46:29.9417806Z #define _GCC_PTRDIFF_T 2025-05-07T19:46:29.9418051Z #define cudaKernelNodeAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:29.9418144Z #define __FLT128_DECIMAL_DIG__ 36 2025-05-07T19:46:29.9418241Z #define _POSIX_THREAD_KEYS_MAX 128 2025-05-07T19:46:29.9418356Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:46:29.9418461Z #define __cpp_lib_array_constexpr 201803L 2025-05-07T19:46:29.9418543Z #define _STDLIB_H 1 2025-05-07T19:46:29.9418693Z #define __exctype(name) extern int name (int) __THROW 2025-05-07T19:46:29.9418787Z #define __FLT32_HAS_QUIET_NAN__ 1 2025-05-07T19:46:29.9418873Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:46:29.9419001Z #define __UINT_FAST16_MAX__ 0xffffffffffffffffUL 2025-05-07T19:46:29.9419117Z #define __SURFACE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:29.9419257Z #define __SM_61_INTRINSICS_H__ 2025-05-07T19:46:29.9419448Z #define _GLIBCXX_PACKAGE_STRING "package-unused version-unused" 2025-05-07T19:46:29.9419615Z #define __isxdigit_l(c,l) __isctype_l((c), _ISxdigit, (l)) 2025-05-07T19:46:29.9419717Z #define __glibcxx_requires_nonempty() 2025-05-07T19:46:29.9419830Z #define w_stopsig __wait_stopped.__w_stopsig 2025-05-07T19:46:29.9419920Z #define __ldiv_t_defined 1 2025-05-07T19:46:29.9420112Z #define __glibcxx_requires_irreflexive_pred(_First,_Last,_Pred) 2025-05-07T19:46:29.9420201Z #define ___int_ptrdiff_t_h 2025-05-07T19:46:29.9420369Z #define __LDBL_NORM_MAX__ 1.18973149535723176502126385303097021e+4932L 2025-05-07T19:46:29.9420485Z #define __cudaCDP2EventDestroy 2025-05-07T19:46:29.9420573Z #define __HOST_DEFINES_H__ 2025-05-07T19:46:29.9420668Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:46:29.9420767Z #define __SM_20_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:29.9420875Z #define _GLIBCXX_USE_NANOSLEEP 1 2025-05-07T19:46:29.9420958Z #define CUDART_CB 2025-05-07T19:46:29.9421058Z #define BC_BASE_MAX _POSIX2_BC_BASE_MAX 2025-05-07T19:46:29.9421192Z #define _GLIBCXX_USE_C99_INTTYPES_WCHAR_T_TR1 1 2025-05-07T19:46:29.9421274Z #define MB_LEN_MAX 16 2025-05-07T19:46:29.9421504Z #define __glibcxx_requires_partitioned_lower_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:29.9421606Z #define _GLIBCXX11_USE_C99_WCHAR 1 2025-05-07T19:46:29.9421726Z #define _IO_peekc(_fp) _IO_peekc_unlocked (_fp) 2025-05-07T19:46:29.9421838Z #define _GLIBCXX_HAVE_AS_SYMVER_DIRECTIVE 1 2025-05-07T19:46:29.9421932Z #define _GLIBCXX_HAVE_UNISTD_H 1 2025-05-07T19:46:29.9422094Z #define __glibc_likely(cond) __builtin_expect((cond), 1) 2025-05-07T19:46:29.9422201Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:46:29.9422282Z #define _GNU_SOURCE 1 2025-05-07T19:46:29.9422374Z #define __stub_putmsg 2025-05-07T19:46:29.9422456Z #define __CUDACC__ 1 2025-05-07T19:46:29.9422544Z #define __N(msgid) (msgid) 2025-05-07T19:46:29.9422629Z #define __P(args) args 2025-05-07T19:46:29.9423056Z #define cudaKernelNodeAttributeCooperative cudaLaunchAttributeCooperative 2025-05-07T19:46:29.9423151Z #define __cpp_init_captures 201304L 2025-05-07T19:46:29.9423248Z #define _GLIBCXX17_CONSTEXPR constexpr 2025-05-07T19:46:29.9423342Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:46:29.9423430Z #define __cpp_lib_as_const 201510 2025-05-07T19:46:29.9423505Z #define __WCHAR_T 2025-05-07T19:46:29.9423585Z #define __ATOMIC_RELEASE 3 2025-05-07T19:46:29.9423680Z #define __fsblkcnt_t_defined 2025-05-07T19:46:29.9423789Z #define __cudaCDP2EventCreateWithFlags 2025-05-07T19:46:29.9423883Z #define __DEVICE_DOUBLE_FUNCTIONS_H__ 2025-05-07T19:46:29.9423888Z 2025-05-07T19:46:29.9754619Z 2025-05-07T19:46:29.9755404Z + conda run -n build_binary nvcc --version 2025-05-07T19:46:29.9756139Z 2025-05-07T19:46:31.8178695Z nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:46:31.8179185Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:46:31.8179521Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:46:31.8179901Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:46:31.8180273Z Build cuda_12.6.r12.6/compiler.35059454_0 2025-05-07T19:46:31.8180516Z 2025-05-07T19:46:31.8912281Z 2025-05-07T19:46:31.8920039Z which: no nvidia-smi in (CONDA=/github/home/miniconda:/github/home/miniconda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:46:31.8920831Z [CHECK] nvidia-smi not found 2025-05-07T19:46:31.8921134Z [INSTALL] Successfully installed CUDA 12.6.3 2025-05-07T19:46:31.9016770Z ##[group]Run . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:31.9017428Z . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:31.9018125Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:46:31.9018506Z env: 2025-05-07T19:46:31.9018785Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:46:31.9019108Z BUILD_ENV: build_binary 2025-05-07T19:46:31.9019400Z BUILD_TARGET: default 2025-05-07T19:46:31.9019841Z BUILD_VARIANT: cuda 2025-05-07T19:46:31.9020133Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:46:31.9020407Z ##[endgroup] 2025-05-07T19:46:32.3529687Z ################################################################################ 2025-05-07T19:46:32.3530308Z # Install PyTorch (PIP) 2025-05-07T19:46:32.3530595Z # 2025-05-07T19:46:32.3542244Z # [2025-05-07T19:46:32.353Z] + install_pytorch_pip build_binary nightly cuda/12.6.3 2025-05-07T19:46:32.3542888Z ################################################################################ 2025-05-07T19:46:32.3543141Z 2025-05-07T19:46:32.3572121Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y numpy 2025-05-07T19:46:33.2597372Z Channels: 2025-05-07T19:46:33.2598070Z - conda-forge 2025-05-07T19:46:33.2598703Z Platform: linux-64 2025-05-07T19:46:36.3967932Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:46:37.9883781Z Solving environment: \ | / - done 2025-05-07T19:46:38.2786724Z 2025-05-07T19:46:38.2787355Z ## Package Plan ## 2025-05-07T19:46:38.2787828Z 2025-05-07T19:46:38.2788421Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:46:38.2789331Z 2025-05-07T19:46:38.2789599Z added / updated specs: 2025-05-07T19:46:38.2790305Z - numpy 2025-05-07T19:46:38.2790638Z 2025-05-07T19:46:38.2790651Z 2025-05-07T19:46:38.2791042Z The following packages will be downloaded: 2025-05-07T19:46:38.2791266Z 2025-05-07T19:46:38.2791413Z package | build 2025-05-07T19:46:38.2791761Z ---------------------------|----------------- 2025-05-07T19:46:38.2792186Z libblas-3.9.0 |31_h59b9bed_openblas 16 KB conda-forge 2025-05-07T19:46:38.2792709Z libcblas-3.9.0 |31_he106b2a_openblas 16 KB conda-forge 2025-05-07T19:46:38.2793220Z liblapack-3.9.0 |31_h7ac8fdf_openblas 16 KB conda-forge 2025-05-07T19:46:38.2793684Z numpy-2.2.5 | py313h17eae1a_0 8.1 MB conda-forge 2025-05-07T19:46:38.2794123Z ------------------------------------------------------------ 2025-05-07T19:46:38.2794502Z Total: 8.2 MB 2025-05-07T19:46:38.2794721Z 2025-05-07T19:46:38.2794857Z The following NEW packages will be INSTALLED: 2025-05-07T19:46:38.2795119Z 2025-05-07T19:46:38.2795358Z libblas conda-forge/linux-64::libblas-3.9.0-31_h59b9bed_openblas 2025-05-07T19:46:38.2795901Z libcblas conda-forge/linux-64::libcblas-3.9.0-31_he106b2a_openblas 2025-05-07T19:46:38.2796475Z liblapack conda-forge/linux-64::liblapack-3.9.0-31_h7ac8fdf_openblas 2025-05-07T19:46:38.2797005Z numpy conda-forge/linux-64::numpy-2.2.5-py313h17eae1a_0 2025-05-07T19:46:38.2797299Z 2025-05-07T19:46:38.2797303Z 2025-05-07T19:46:38.2797307Z 2025-05-07T19:46:38.2797461Z Downloading and Extracting Packages: ...working... 2025-05-07T19:46:38.2797872Z numpy-2.2.5 | 8.1 MB | | 0% 2025-05-07T19:46:38.2798109Z 2025-05-07T19:46:38.2798463Z libblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:38.2798711Z 2025-05-07T19:46:38.2798714Z 2025-05-07T19:46:38.2803853Z libcblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:38.2804225Z 2025-05-07T19:46:38.2804477Z 2025-05-07T19:46:38.2804651Z 2025-05-07T19:46:38.4323956Z liblapack-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:38.4324851Z 2025-05-07T19:46:38.4325412Z 2025-05-07T19:46:38.4326104Z libcblas-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:46:38.4326858Z 2025-05-07T19:46:38.4326870Z 2025-05-07T19:46:38.4326890Z 2025-05-07T19:46:38.4351757Z liblapack-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:46:38.4352077Z 2025-05-07T19:46:38.4352082Z 2025-05-07T19:46:38.4352345Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:38.4352606Z 2025-05-07T19:46:38.4352891Z 2025-05-07T19:46:38.4352895Z 2025-05-07T19:46:38.4638724Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:38.4639352Z 2025-05-07T19:46:38.4639374Z 2025-05-07T19:46:38.4639377Z 2025-05-07T19:46:38.4639610Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:38.4639875Z 2025-05-07T19:46:38.4639879Z 2025-05-07T19:46:38.5050043Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:38.5050541Z 2025-05-07T19:46:38.5051106Z libblas-3.9.0 | 16 KB | #########7 | 97%  2025-05-07T19:46:38.5053308Z 2025-05-07T19:46:38.5158263Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:38.5324977Z numpy-2.2.5 | 8.1 MB | | 0% 2025-05-07T19:46:38.5325427Z 2025-05-07T19:46:38.5671220Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:38.8931846Z numpy-2.2.5 | 8.1 MB | ########## | 100% 2025-05-07T19:46:38.8932967Z numpy-2.2.5 | 8.1 MB | ########## | 100% 2025-05-07T19:46:38.8935566Z numpy-2.2.5 | 8.1 MB | ########## | 100% 2025-05-07T19:46:38.8936535Z 2025-05-07T19:46:38.8937119Z 2025-05-07T19:46:38.8937743Z  2025-05-07T19:46:38.8938367Z 2025-05-07T19:46:38.8938379Z 2025-05-07T19:46:38.8938853Z  2025-05-07T19:46:38.8939578Z 2025-05-07T19:46:38.8939582Z 2025-05-07T19:46:38.8939594Z 2025-05-07T19:46:38.8939801Z  done 2025-05-07T19:46:38.9947640Z Preparing transaction: | done 2025-05-07T19:46:39.1959470Z Verifying transaction: - \ done 2025-05-07T19:46:39.2971671Z Executing transaction: / done 2025-05-07T19:46:39.4053795Z ################################################################################ 2025-05-07T19:46:39.4055251Z # Install Package From PyTorch PIP: torch 2025-05-07T19:46:39.4056146Z # 2025-05-07T19:46:39.4080590Z # [2025-05-07T19:46:39.407Z] + install_from_pytorch_pip build_binary torch nightly cuda/12.6.3 2025-05-07T19:46:39.4081586Z ################################################################################ 2025-05-07T19:46:39.4081915Z 2025-05-07T19:46:39.4098338Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:46:39.5001398Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:46:39.5001935Z ################################################################################ 2025-05-07T19:46:39.5002431Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:46:39.5002728Z # 2025-05-07T19:46:39.5020211Z # [2025-05-07T19:46:39.501Z] + __prepare_pip_arguments torch nightly cuda/12.6.3 2025-05-07T19:46:39.5020771Z ################################################################################ 2025-05-07T19:46:39.5021008Z 2025-05-07T19:46:39.5044951Z [INSTALL] Extracted package (channel, version): (nightly, LATEST) 2025-05-07T19:46:39.5069744Z [INSTALL] Extracted package variant: cu126 2025-05-07T19:46:39.5085151Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:46:39.5086863Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:46:39.5091815Z [INSTALL] Extracted the full PIP package: --pre torch 2025-05-07T19:46:39.5100624Z [INSTALL] Attempting to install [torch, LATEST] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/cu126/ ... 2025-05-07T19:46:39.5133809Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:20.5037538Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:48:20.5039814Z 2025-05-07T19:48:20.5040046Z Looking in indexes: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:20.5040500Z Collecting torch 2025-05-07T19:48:20.5041205Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp313-cp313-manylinux_2_28_x86_64.whl.metadata (30 kB) 2025-05-07T19:48:20.5042260Z Collecting filelock (from torch) 2025-05-07T19:48:20.5045388Z Downloading https://download.pytorch.org/whl/nightly/filelock-3.16.1-py3-none-any.whl (16 kB) 2025-05-07T19:48:20.5046415Z Requirement already satisfied: typing-extensions>=4.10.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from torch) (4.13.2) 2025-05-07T19:48:20.5047736Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from torch) (78.1.1) 2025-05-07T19:48:20.5048475Z Collecting sympy>=1.13.3 (from torch) 2025-05-07T19:48:20.5049024Z Downloading https://download.pytorch.org/whl/nightly/sympy-1.13.3-py3-none-any.whl (6.2 MB) 2025-05-07T19:48:20.5049945Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.2/6.2 MB 28.5 MB/s eta 0:00:00 2025-05-07T19:48:20.5050430Z Collecting networkx (from torch) 2025-05-07T19:48:20.5051178Z Downloading https://download.pytorch.org/whl/nightly/networkx-3.4.2-py3-none-any.whl (1.7 MB) 2025-05-07T19:48:20.5051871Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.7/1.7 MB 11.0 MB/s eta 0:00:00 2025-05-07T19:48:20.5052665Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from torch) (3.1.6) 2025-05-07T19:48:20.5053367Z Collecting fsspec (from torch) 2025-05-07T19:48:20.5053892Z Downloading https://download.pytorch.org/whl/nightly/fsspec-2024.10.0-py3-none-any.whl (179 kB) 2025-05-07T19:48:20.5054519Z Collecting nvidia-cuda-nvrtc-cu12==12.6.77 (from torch) 2025-05-07T19:48:20.5055288Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_nvrtc_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (23.7 MB) 2025-05-07T19:48:20.5056167Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 23.7/23.7 MB 43.2 MB/s eta 0:00:00 2025-05-07T19:48:20.5056615Z Collecting nvidia-cuda-runtime-cu12==12.6.77 (from torch) 2025-05-07T19:48:20.5057470Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_runtime_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (897 kB) 2025-05-07T19:48:20.5058285Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 897.7/897.7 kB 4.9 MB/s eta 0:00:00 2025-05-07T19:48:20.5058666Z Collecting nvidia-cuda-cupti-cu12==12.6.80 (from torch) 2025-05-07T19:48:20.5059377Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_cupti_cu12-12.6.80-py3-none-manylinux2014_x86_64.whl (8.9 MB) 2025-05-07T19:48:20.5060160Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 8.9/8.9 MB 30.6 MB/s eta 0:00:00 2025-05-07T19:48:20.5060522Z Collecting nvidia-cudnn-cu12==9.5.1.17 (from torch) 2025-05-07T19:48:20.5061210Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cudnn_cu12-9.5.1.17-py3-none-manylinux_2_28_x86_64.whl (571.0 MB) 2025-05-07T19:48:20.5062004Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 571.0/571.0 MB 35.3 MB/s eta 0:00:00 2025-05-07T19:48:20.5062426Z Collecting nvidia-cublas-cu12==12.6.4.1 (from torch) 2025-05-07T19:48:20.5063213Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cublas_cu12-12.6.4.1-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (393.1 MB) 2025-05-07T19:48:20.5064329Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 393.1/393.1 MB 51.1 MB/s eta 0:00:00 2025-05-07T19:48:20.5064749Z Collecting nvidia-cufft-cu12==11.3.0.4 (from torch) 2025-05-07T19:48:20.5065447Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufft_cu12-11.3.0.4-py3-none-manylinux2014_x86_64.whl (200.2 MB) 2025-05-07T19:48:20.5066260Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 200.2/200.2 MB 59.8 MB/s eta 0:00:00 2025-05-07T19:48:20.5066655Z Collecting nvidia-curand-cu12==10.3.7.77 (from torch) 2025-05-07T19:48:20.5067465Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_curand_cu12-10.3.7.77-py3-none-manylinux2014_x86_64.whl (56.3 MB) 2025-05-07T19:48:20.5068295Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 56.3/56.3 MB 53.2 MB/s eta 0:00:00 2025-05-07T19:48:20.5068892Z Collecting nvidia-cusolver-cu12==11.7.1.2 (from torch) 2025-05-07T19:48:20.5069689Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusolver_cu12-11.7.1.2-py3-none-manylinux2014_x86_64.whl (158.2 MB) 2025-05-07T19:48:20.5070525Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 158.2/158.2 MB 56.8 MB/s eta 0:00:00 2025-05-07T19:48:20.5070959Z Collecting nvidia-cusparse-cu12==12.5.4.2 (from torch) 2025-05-07T19:48:20.5071771Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparse_cu12-12.5.4.2-py3-none-manylinux2014_x86_64.whl (216.6 MB) 2025-05-07T19:48:20.5072589Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 216.6/216.6 MB 62.7 MB/s eta 0:00:00 2025-05-07T19:48:20.5073008Z Collecting nvidia-cusparselt-cu12==0.6.3 (from torch) 2025-05-07T19:48:20.5073737Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparselt_cu12-0.6.3-py3-none-manylinux2014_x86_64.whl (156.8 MB) 2025-05-07T19:48:20.5074563Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 156.8/156.8 MB 60.5 MB/s eta 0:00:00 2025-05-07T19:48:20.5074931Z Collecting nvidia-nccl-cu12==2.26.2 (from torch) 2025-05-07T19:48:20.5075764Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (2.0 kB) 2025-05-07T19:48:20.5076693Z Collecting nvidia-nvtx-cu12==12.6.77 (from torch) 2025-05-07T19:48:20.5077343Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvtx_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (89 kB) 2025-05-07T19:48:20.5078026Z Collecting nvidia-nvjitlink-cu12==12.6.85 (from torch) 2025-05-07T19:48:20.5078800Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvjitlink_cu12-12.6.85-py3-none-manylinux2010_x86_64.manylinux_2_12_x86_64.whl (19.7 MB) 2025-05-07T19:48:20.5079684Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 19.7/19.7 MB 53.9 MB/s eta 0:00:00 2025-05-07T19:48:20.5080071Z Collecting nvidia-cufile-cu12==1.11.1.6 (from torch) 2025-05-07T19:48:20.5080866Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (1.5 kB) 2025-05-07T19:48:20.5081706Z Collecting pytorch-triton==3.3.0+git96316ce5 (from torch) 2025-05-07T19:48:20.5082542Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.6 kB) 2025-05-07T19:48:20.5083397Z Collecting mpmath<1.4,>=1.1.0 (from sympy>=1.13.3->torch) 2025-05-07T19:48:20.5083982Z Downloading https://download.pytorch.org/whl/nightly/mpmath-1.3.0-py3-none-any.whl (536 kB) 2025-05-07T19:48:20.5084618Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 536.2/536.2 kB 2.2 MB/s eta 0:00:00 2025-05-07T19:48:20.5085469Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from jinja2->torch) (3.0.2) 2025-05-07T19:48:20.5086574Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp313-cp313-manylinux_2_28_x86_64.whl (825.4 MB) 2025-05-07T19:48:20.5087418Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 825.4/825.4 MB 29.8 MB/s eta 0:00:00 2025-05-07T19:48:20.5088320Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (1.1 MB) 2025-05-07T19:48:20.5089187Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.1/1.1 MB 7.4 MB/s eta 0:00:00 2025-05-07T19:48:20.5089976Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (201.3 MB) 2025-05-07T19:48:20.5091167Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 201.3/201.3 MB 55.4 MB/s eta 0:00:00 2025-05-07T19:48:20.5092078Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (153.5 MB) 2025-05-07T19:48:20.5093078Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 153.5/153.5 MB 59.5 MB/s eta 0:00:00 2025-05-07T19:48:20.5094916Z Installing collected packages: nvidia-cusparselt-cu12, mpmath, sympy, pytorch-triton, nvidia-nvtx-cu12, nvidia-nvjitlink-cu12, nvidia-nccl-cu12, nvidia-curand-cu12, nvidia-cufile-cu12, nvidia-cuda-runtime-cu12, nvidia-cuda-nvrtc-cu12, nvidia-cuda-cupti-cu12, nvidia-cublas-cu12, networkx, fsspec, filelock, nvidia-cusparse-cu12, nvidia-cufft-cu12, nvidia-cudnn-cu12, nvidia-cusolver-cu12, torch 2025-05-07T19:48:20.5096623Z 2025-05-07T19:48:20.5098723Z Successfully installed filelock-3.16.1 fsspec-2024.10.0 mpmath-1.3.0 networkx-3.4.2 nvidia-cublas-cu12-12.6.4.1 nvidia-cuda-cupti-cu12-12.6.80 nvidia-cuda-nvrtc-cu12-12.6.77 nvidia-cuda-runtime-cu12-12.6.77 nvidia-cudnn-cu12-9.5.1.17 nvidia-cufft-cu12-11.3.0.4 nvidia-cufile-cu12-1.11.1.6 nvidia-curand-cu12-10.3.7.77 nvidia-cusolver-cu12-11.7.1.2 nvidia-cusparse-cu12-12.5.4.2 nvidia-cusparselt-cu12-0.6.3 nvidia-nccl-cu12-2.26.2 nvidia-nvjitlink-cu12-12.6.85 nvidia-nvtx-cu12-12.6.77 pytorch-triton-3.3.0+git96316ce5 sympy-1.13.3 torch-2.8.0.dev20250507+cu126 2025-05-07T19:48:20.5100931Z 2025-05-07T19:48:22.6335947Z torch 2.8.0.dev20250507+cu126 2025-05-07T19:48:22.6336612Z [CHECK] The installed package [torch, nightly/LATEST] is the correct variant (cu126) 2025-05-07T19:48:25.8489629Z [CHECK] Python (sub-)package 'torch.distributed' found ... 2025-05-07T19:48:29.0463005Z [CHECK] NOTE: The installed version is: 2.8.0.dev20250507+cu126 2025-05-07T19:48:29.0464314Z [CHECK] NOTE: Checking _GLIBCXX_USE_CXX11_ABI ... 2025-05-07T19:48:32.1823069Z True 2025-05-07T19:48:32.1823687Z True 2025-05-07T19:48:32.1824023Z 2025-05-07T19:48:32.2402029Z [INSTALL] Successfully installed PyTorch through PyTorch PIP 2025-05-07T19:48:32.2478673Z ##[group]Run if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:32.2479312Z if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:32.2480191Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:32.2480503Z env: 2025-05-07T19:48:32.2480714Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:32.2481014Z BUILD_ENV: build_binary 2025-05-07T19:48:32.2481256Z BUILD_TARGET: default 2025-05-07T19:48:32.2481471Z BUILD_VARIANT: cuda 2025-05-07T19:48:32.2481706Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:32.2481961Z ##[endgroup] 2025-05-07T19:48:32.6995816Z /github/home/miniconda/bin/conda 2025-05-07T19:48:32.6996805Z ################################################################################ 2025-05-07T19:48:32.6998031Z # Collect PyTorch Environment Information (for Reporting Issues) 2025-05-07T19:48:32.6999136Z # 2025-05-07T19:48:32.7019434Z # [2025-05-07T19:48:32.701Z] + collect_pytorch_env_info build_binary 2025-05-07T19:48:32.7019962Z ################################################################################ 2025-05-07T19:48:32.7020199Z 2025-05-07T19:48:32.7037510Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:32.7916614Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:32.7919776Z [INFO] Downloading the PyTorch environment info collection script ... 2025-05-07T19:48:32.7920455Z + wget -q https://raw.githubusercontent.com/pytorch/pytorch/main/torch/utils/collect_env.py 2025-05-07T19:48:32.7920885Z 2025-05-07T19:48:32.8750572Z 2025-05-07T19:48:32.8751144Z [INFO] Collecting PyTorch environment info (will be needed for reporting issues to PyTorch) ... 2025-05-07T19:48:32.8772136Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary python collect_env.py 2025-05-07T19:48:38.2343422Z Collecting environment information... 2025-05-07T19:48:38.2344645Z PyTorch version: 2.8.0.dev20250507+cu126 2025-05-07T19:48:38.2345637Z Is debug build: False 2025-05-07T19:48:38.2346370Z CUDA used to build PyTorch: 12.6 2025-05-07T19:48:38.2347232Z ROCM used to build PyTorch: N/A 2025-05-07T19:48:38.2347756Z 2025-05-07T19:48:38.2348082Z OS: Amazon Linux 2023.7.20250428 (x86_64) 2025-05-07T19:48:38.2349038Z GCC version: (conda-forge gcc 11.4.0-13) 11.4.0 2025-05-07T19:48:38.2349826Z Clang version: Could not collect 2025-05-07T19:48:38.2350133Z CMake version: version 4.0.2 2025-05-07T19:48:38.2350453Z Libc version: glibc-2.34 2025-05-07T19:48:38.2350627Z 2025-05-07T19:48:38.2350964Z Python version: 3.13.2 | packaged by conda-forge | (main, Feb 17 2025, 14:10:22) [GCC 13.3.0] (64-bit runtime) 2025-05-07T19:48:38.2351670Z Python platform: Linux-6.1.130-139.222.amzn2023.x86_64-x86_64-with-glibc2.34 2025-05-07T19:48:38.2352130Z Is CUDA available: False 2025-05-07T19:48:38.2352441Z CUDA runtime version: 12.6.85 2025-05-07T19:48:38.2352764Z CUDA_MODULE_LOADING set to: N/A 2025-05-07T19:48:38.2353100Z GPU models and configuration: Could not collect 2025-05-07T19:48:38.2353498Z Nvidia driver version: Could not collect 2025-05-07T19:48:38.2353829Z cuDNN version: Could not collect 2025-05-07T19:48:38.2354154Z HIP runtime version: N/A 2025-05-07T19:48:38.2354430Z MIOpen runtime version: N/A 2025-05-07T19:48:38.2354734Z Is XNNPACK available: True 2025-05-07T19:48:38.2354908Z 2025-05-07T19:48:38.2354994Z CPU: 2025-05-07T19:48:38.2355246Z Architecture: x86_64 2025-05-07T19:48:38.2355636Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:48:38.2356055Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:48:38.2356622Z Byte Order: Little Endian 2025-05-07T19:48:38.2356962Z CPU(s): 96 2025-05-07T19:48:38.2357449Z On-line CPU(s) list: 0-95 2025-05-07T19:48:38.2357775Z Vendor ID: GenuineIntel 2025-05-07T19:48:38.2358520Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:48:38.2358928Z CPU family: 6 2025-05-07T19:48:38.2359225Z Model: 85 2025-05-07T19:48:38.2359524Z Thread(s) per core: 2 2025-05-07T19:48:38.2359803Z Core(s) per socket: 24 2025-05-07T19:48:38.2360096Z Socket(s): 2 2025-05-07T19:48:38.2360366Z Stepping: 7 2025-05-07T19:48:38.2360669Z BogoMIPS: 5999.98 2025-05-07T19:48:38.2362862Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:48:38.2365240Z Hypervisor vendor: KVM 2025-05-07T19:48:38.2365542Z Virtualization type: full 2025-05-07T19:48:38.2365887Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:48:38.2366259Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:48:38.2366641Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:48:38.2366980Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:48:38.2367306Z NUMA node(s): 2 2025-05-07T19:48:38.2367610Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:48:38.2367945Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:48:38.2368399Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:48:38.2368938Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:48:38.2369428Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:48:38.2370004Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:38.2370878Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:48:38.2371498Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:38.2372129Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:48:38.2372502Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:48:38.2372896Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:48:38.2373293Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:48:38.2373860Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:48:38.2374728Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:48:38.2375382Z Vulnerability Srbds: Not affected 2025-05-07T19:48:38.2375777Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:48:38.2376025Z 2025-05-07T19:48:38.2376132Z Versions of relevant libraries: 2025-05-07T19:48:38.2376416Z [pip3] numpy==2.2.5 2025-05-07T19:48:38.2376672Z [pip3] nvidia-cublas-cu12==12.6.4.1 2025-05-07T19:48:38.2377089Z [pip3] nvidia-cuda-cupti-cu12==12.6.80 2025-05-07T19:48:38.2377397Z [pip3] nvidia-cuda-nvrtc-cu12==12.6.77 2025-05-07T19:48:38.2377695Z [pip3] nvidia-cuda-runtime-cu12==12.6.77 2025-05-07T19:48:38.2378008Z [pip3] nvidia-cudnn-cu12==9.5.1.17 2025-05-07T19:48:38.2378286Z [pip3] nvidia-cufft-cu12==11.3.0.4 2025-05-07T19:48:38.2378575Z [pip3] nvidia-curand-cu12==10.3.7.77 2025-05-07T19:48:38.2378858Z [pip3] nvidia-cusolver-cu12==11.7.1.2 2025-05-07T19:48:38.2379161Z [pip3] nvidia-cusparse-cu12==12.5.4.2 2025-05-07T19:48:38.2379601Z [pip3] nvidia-cusparselt-cu12==0.6.3 2025-05-07T19:48:38.2379894Z [pip3] nvidia-nccl-cu12==2.26.2 2025-05-07T19:48:38.2380192Z [pip3] nvidia-nvjitlink-cu12==12.6.85 2025-05-07T19:48:38.2380479Z [pip3] nvidia-nvtx-cu12==12.6.77 2025-05-07T19:48:38.2380777Z [pip3] pytorch-triton==3.3.0+git96316ce5 2025-05-07T19:48:38.2381068Z [pip3] torch==2.8.0.dev20250507+cu126 2025-05-07T19:48:38.2381444Z [conda] cuda-cudart 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:38.2381918Z [conda] cuda-cudart-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:38.2382628Z [conda] cuda-cudart-dev_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:38.2383182Z [conda] cuda-cudart-static 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:38.2383731Z [conda] cuda-cudart-static_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:38.2384297Z [conda] cuda-cudart_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:38.2384864Z [conda] cuda-cupti 12.6.80 hbd13f7d_0 conda-forge 2025-05-07T19:48:38.2385354Z [conda] cuda-cupti-dev 12.6.80 h5888daf_0 conda-forge 2025-05-07T19:48:38.2385845Z [conda] cuda-libraries 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:38.2386364Z [conda] cuda-libraries-dev 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:38.2386874Z [conda] cuda-nvrtc 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:38.2387344Z [conda] cuda-nvrtc-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:38.2387831Z [conda] cuda-nvtx 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:38.2388291Z [conda] cuda-opencl 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:38.2388790Z [conda] cuda-opencl-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:38.2389306Z [conda] cuda-runtime 12.6.3 ha804496_0 conda-forge 2025-05-07T19:48:38.2389770Z [conda] libcublas 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:38.2390260Z [conda] libcublas-dev 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:38.2390727Z [conda] libcufft 11.3.0.4 hbd13f7d_0 conda-forge 2025-05-07T19:48:38.2391204Z [conda] libcufft-dev 11.3.0.4 h5888daf_0 conda-forge 2025-05-07T19:48:38.2391672Z [conda] libcurand 10.3.7.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:38.2392160Z [conda] libcurand-dev 10.3.7.77 h5888daf_0 conda-forge 2025-05-07T19:48:38.2392655Z [conda] libcusolver 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:38.2393143Z [conda] libcusolver-dev 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:38.2393656Z [conda] libcusparse 12.5.4.2 hbd13f7d_0 conda-forge 2025-05-07T19:48:38.2394262Z [conda] libcusparse-dev 12.5.4.2 h5888daf_0 conda-forge 2025-05-07T19:48:38.2394744Z [conda] libnvjitlink 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:38.2395232Z [conda] libnvjitlink-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:38.2395680Z [conda] numpy 2.2.5 py313h17eae1a_0 conda-forge 2025-05-07T19:48:38.2396141Z [conda] nvidia-cublas-cu12 12.6.4.1 pypi_0 pypi 2025-05-07T19:48:38.2396626Z [conda] nvidia-cuda-cupti-cu12 12.6.80 pypi_0 pypi 2025-05-07T19:48:38.2397131Z [conda] nvidia-cuda-nvrtc-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:38.2397621Z [conda] nvidia-cuda-runtime-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:38.2398120Z [conda] nvidia-cudnn-cu12 9.5.1.17 pypi_0 pypi 2025-05-07T19:48:38.2398688Z [conda] nvidia-cufft-cu12 11.3.0.4 pypi_0 pypi 2025-05-07T19:48:38.2399157Z [conda] nvidia-curand-cu12 10.3.7.77 pypi_0 pypi 2025-05-07T19:48:38.2399647Z [conda] nvidia-cusolver-cu12 11.7.1.2 pypi_0 pypi 2025-05-07T19:48:38.2400126Z [conda] nvidia-cusparse-cu12 12.5.4.2 pypi_0 pypi 2025-05-07T19:48:38.2400631Z [conda] nvidia-cusparselt-cu12 0.6.3 pypi_0 pypi 2025-05-07T19:48:38.2401108Z [conda] nvidia-nccl-cu12 2.26.2 pypi_0 pypi 2025-05-07T19:48:38.2401596Z [conda] nvidia-nvjitlink-cu12 12.6.85 pypi_0 pypi 2025-05-07T19:48:38.2402080Z [conda] nvidia-nvtx-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:38.2402544Z [conda] pytorch-triton 3.3.0+git96316ce5 pypi_0 pypi 2025-05-07T19:48:38.2403013Z [conda] torch 2.8.0.dev20250507+cu126 pypi_0 pypi 2025-05-07T19:48:38.2403350Z 2025-05-07T19:48:38.3219653Z ##[group]Run . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:38.3220322Z . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:38.3220928Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:38.3221289Z env: 2025-05-07T19:48:38.3221564Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:38.3221873Z BUILD_ENV: build_binary 2025-05-07T19:48:38.3222150Z BUILD_TARGET: default 2025-05-07T19:48:38.3222390Z BUILD_VARIANT: cuda 2025-05-07T19:48:38.3222659Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:38.3222912Z ##[endgroup] 2025-05-07T19:48:38.7895608Z ################################################################################ 2025-05-07T19:48:38.7896010Z # Install cuDNN 2025-05-07T19:48:38.7896253Z # 2025-05-07T19:48:38.7913157Z # [2025-05-07T19:48:38.790Z] + install_cudnn build_binary /__w/FBGEMM/FBGEMM/build_only/cudnn 12.6.3 2025-05-07T19:48:38.7914889Z ################################################################################ 2025-05-07T19:48:38.7915556Z 2025-05-07T19:48:38.7933690Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:38.8793513Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:38.8793972Z [INSTALL] cuda_concat_version is determined to be: 126 2025-05-07T19:48:38.8794397Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:38.8794645Z 2025-05-07T19:48:38.8812067Z 2025-05-07T19:48:38.8812325Z + mkdir -p /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:38.8812604Z 2025-05-07T19:48:38.8830296Z 2025-05-07T19:48:38.8855825Z [INSTALL] Downloading cuDNN to /tmp/tmp.pSztf9l9cU ... 2025-05-07T19:48:38.8881208Z [EXEC] [ATTEMPT 0/3] + wget -q https://developer.download.nvidia.com/compute/cudnn/redist/cudnn/linux-x86_64/cudnn-linux-x86_64-9.5.1.17_cuda12-archive.tar.xz -O cudnn.tar.xz 2025-05-07T19:48:44.7685915Z [INSTALL] Unpacking cuDNN ... 2025-05-07T19:48:44.7686348Z + tar -xvf cudnn.tar.xz 2025-05-07T19:48:44.7686531Z 2025-05-07T19:48:44.7714393Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/ 2025-05-07T19:48:44.7715402Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/ 2025-05-07T19:48:44.7715876Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static_v9.a 2025-05-07T19:48:49.2818304Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static_v9.a 2025-05-07T19:48:49.3435775Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static_v9.a 2025-05-07T19:48:56.7151637Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static_v9.a 2025-05-07T19:48:56.9562855Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static_v9.a 2025-05-07T19:48:56.9936695Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static_v9.a 2025-05-07T19:48:57.5259749Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static_v9.a 2025-05-07T19:48:59.5950373Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static.a 2025-05-07T19:48:59.5951955Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static.a 2025-05-07T19:48:59.5953617Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static.a 2025-05-07T19:48:59.5955510Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static.a 2025-05-07T19:48:59.5956613Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static.a 2025-05-07T19:48:59.5957169Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static.a 2025-05-07T19:48:59.5957698Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static.a 2025-05-07T19:48:59.5958193Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so 2025-05-07T19:48:59.5958656Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9 2025-05-07T19:48:59.5959130Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9.5.1 2025-05-07T19:48:59.5960292Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so 2025-05-07T19:48:59.5961445Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9 2025-05-07T19:48:59.5961949Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9.5.1 2025-05-07T19:49:04.0955824Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so 2025-05-07T19:49:04.0956374Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9.5.1 2025-05-07T19:49:04.1571844Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9 2025-05-07T19:49:04.1573550Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9.5.1 2025-05-07T19:49:11.3238530Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9 2025-05-07T19:49:11.3239156Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so 2025-05-07T19:49:11.3239758Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so 2025-05-07T19:49:11.3240393Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9.5.1 2025-05-07T19:49:11.5137158Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9 2025-05-07T19:49:11.5137742Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9 2025-05-07T19:49:11.5138265Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so 2025-05-07T19:49:11.5138770Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9.5.1 2025-05-07T19:49:11.5490372Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9.5.1 2025-05-07T19:49:12.0740821Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9 2025-05-07T19:49:12.0741414Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so 2025-05-07T19:49:12.0741920Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9 2025-05-07T19:49:12.0742426Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so 2025-05-07T19:49:12.0742939Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9.5.1 2025-05-07T19:49:14.1371628Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/ 2025-05-07T19:49:14.1372129Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_v9.h 2025-05-07T19:49:14.1372644Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv_v9.h 2025-05-07T19:49:14.1373141Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend_v9.h 2025-05-07T19:49:14.1373658Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn_v9.h 2025-05-07T19:49:14.1374154Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph_v9.h 2025-05-07T19:49:14.1374678Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops_v9.h 2025-05-07T19:49:14.1377958Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version_v9.h 2025-05-07T19:49:14.1378457Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn.h 2025-05-07T19:49:14.1378954Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv.h 2025-05-07T19:49:14.1379470Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend.h 2025-05-07T19:49:14.1379990Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn.h 2025-05-07T19:49:14.1380518Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph.h 2025-05-07T19:49:14.1381000Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops.h 2025-05-07T19:49:14.1381505Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version.h 2025-05-07T19:49:14.1381947Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/LICENSE 2025-05-07T19:49:14.1391731Z 2025-05-07T19:49:14.1392217Z [INSTALL] Moving cuDNN files to /__w/FBGEMM/FBGEMM/build_only/cudnn ... 2025-05-07T19:49:14.1392714Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:49:14.1392958Z 2025-05-07T19:49:14.1411817Z 2025-05-07T19:49:14.1412678Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:14.1413427Z 2025-05-07T19:49:14.1424002Z 2025-05-07T19:49:14.1425252Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:14.1427375Z 2025-05-07T19:49:14.1452936Z 2025-05-07T19:49:14.1455338Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:14.1456549Z 2025-05-07T19:49:15.0929451Z 2025-05-07T19:49:15.0929970Z /__w/FBGEMM/FBGEMM 2025-05-07T19:49:15.0930881Z + rm -rf /tmp/tmp.pSztf9l9cU 2025-05-07T19:49:15.0931138Z 2025-05-07T19:49:15.5935726Z 2025-05-07T19:49:15.5944021Z [INSTALL] Set environment variables CUDNN_INCLUDE_DIR and CUDNN_LIBRARY ... 2025-05-07T19:49:15.5946735Z + conda env config vars set -n build_binary CUDNN_INCLUDE_DIR=/__w/FBGEMM/FBGEMM/build_only/cudnn/include CUDNN_LIBRARY=/__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:15.5948445Z 2025-05-07T19:49:16.0074204Z 2025-05-07T19:49:16.0074662Z [INSTALL] Successfully installed cuDNN (for CUDA 12.6.3) 2025-05-07T19:49:16.0145512Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:16.0146116Z . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:16.0146802Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:16.0147203Z env: 2025-05-07T19:49:16.0147457Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:16.0147810Z BUILD_ENV: build_binary 2025-05-07T19:49:16.0148074Z BUILD_TARGET: default 2025-05-07T19:49:16.0148360Z BUILD_VARIANT: cuda 2025-05-07T19:49:16.0148617Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:16.0148914Z ##[endgroup] 2025-05-07T19:49:16.4103303Z ################################################################################ 2025-05-07T19:49:16.4104417Z # Prepare FBGEMM-GPU Build 2025-05-07T19:49:16.4105174Z # 2025-05-07T19:49:16.4120389Z # [2025-05-07T19:49:16.411Z] + prepare_fbgemm_gpu_build build_binary 2025-05-07T19:49:16.4121254Z ################################################################################ 2025-05-07T19:49:16.4121493Z 2025-05-07T19:49:16.4134126Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:16.4987554Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:16.5007122Z [BUILD] Running git submodules update ... 2025-05-07T19:49:16.5040490Z [EXEC] [ATTEMPT 0/3] + git submodule sync 2025-05-07T19:49:16.5335464Z Synchronizing submodule url for '../external/asmjit' 2025-05-07T19:49:16.5336033Z Synchronizing submodule url for '../external/composable_kernel' 2025-05-07T19:49:16.5336529Z Synchronizing submodule url for '../external/cpuinfo' 2025-05-07T19:49:16.5336964Z Synchronizing submodule url for '../external/cutlass' 2025-05-07T19:49:16.5337402Z Synchronizing submodule url for '../external/googletest' 2025-05-07T19:49:16.5337859Z Synchronizing submodule url for '../external/hipify_torch' 2025-05-07T19:49:16.5338330Z Synchronizing submodule url for '../external/json' 2025-05-07T19:49:16.5373255Z [EXEC] [ATTEMPT 0/3] + git submodule update --init --recursive 2025-05-07T19:49:16.5800284Z [BUILD] Installing other build dependencies ... 2025-05-07T19:49:16.5831833Z [EXEC] [ATTEMPT 0/3] + conda run --no-capture-output -n build_binary python -m pip install -r requirements.txt 2025-05-07T19:49:18.6832425Z Collecting backports.tarfile (from -r requirements.txt (line 13)) 2025-05-07T19:49:18.7000069Z Downloading backports.tarfile-1.2.0-py3-none-any.whl.metadata (2.0 kB) 2025-05-07T19:49:18.7085393Z Requirement already satisfied: build in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 14)) (1.2.2.post1) 2025-05-07T19:49:18.8113189Z Collecting cmake (from -r requirements.txt (line 15)) 2025-05-07T19:49:18.8138727Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (6.3 kB) 2025-05-07T19:49:18.8229186Z Requirement already satisfied: click in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 16)) (8.1.8) 2025-05-07T19:49:18.8230596Z Requirement already satisfied: hypothesis in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 17)) (6.131.14) 2025-05-07T19:49:18.8232467Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 18)) (3.1.6) 2025-05-07T19:49:18.8234790Z Requirement already satisfied: mpmath==1.3.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 19)) (1.3.0) 2025-05-07T19:49:18.8498563Z Collecting ninja (from -r requirements.txt (line 20)) 2025-05-07T19:49:18.8528096Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl.metadata (5.0 kB) 2025-05-07T19:49:18.8603228Z Requirement already satisfied: numpy>=2.0.2 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 21)) (2.2.5) 2025-05-07T19:49:18.8737204Z Collecting pyre-extensions (from -r requirements.txt (line 22)) 2025-05-07T19:49:18.8767178Z Downloading pyre_extensions-0.0.32-py3-none-any.whl.metadata (4.0 kB) 2025-05-07T19:49:18.8830882Z Requirement already satisfied: pyyaml in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 23)) (6.0.2) 2025-05-07T19:49:18.8832223Z Requirement already satisfied: scikit-build in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 24)) (0.18.1) 2025-05-07T19:49:18.8839884Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from -r requirements.txt (line 25)) (78.1.1) 2025-05-07T19:49:18.9028682Z Collecting setuptools_git_versioning (from -r requirements.txt (line 26)) 2025-05-07T19:49:18.9110042Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl.metadata (6.1 kB) 2025-05-07T19:49:18.9287557Z Collecting tabulate (from -r requirements.txt (line 27)) 2025-05-07T19:49:18.9328261Z Downloading tabulate-0.9.0-py3-none-any.whl.metadata (34 kB) 2025-05-07T19:49:18.9560812Z Collecting patchelf (from -r requirements.txt (line 28)) 2025-05-07T19:49:18.9605726Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl.metadata (3.5 kB) 2025-05-07T19:49:18.9696454Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from build->-r requirements.txt (line 14)) (25.0) 2025-05-07T19:49:18.9700859Z Requirement already satisfied: pyproject_hooks in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from build->-r requirements.txt (line 14)) (1.2.0) 2025-05-07T19:49:18.9748311Z Requirement already satisfied: attrs>=22.2.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from hypothesis->-r requirements.txt (line 17)) (25.3.0) 2025-05-07T19:49:18.9750446Z Requirement already satisfied: sortedcontainers<3.0.0,>=2.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from hypothesis->-r requirements.txt (line 17)) (2.4.0) 2025-05-07T19:49:18.9799124Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from jinja2->-r requirements.txt (line 18)) (3.0.2) 2025-05-07T19:49:18.9923841Z Collecting typing-inspect (from pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:18.9951181Z Downloading typing_inspect-0.9.0-py3-none-any.whl.metadata (1.5 kB) 2025-05-07T19:49:19.0027948Z Requirement already satisfied: typing-extensions in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from pyre-extensions->-r requirements.txt (line 22)) (4.13.2) 2025-05-07T19:49:19.0038167Z Requirement already satisfied: distro in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from scikit-build->-r requirements.txt (line 24)) (1.9.0) 2025-05-07T19:49:19.0050516Z Requirement already satisfied: wheel>=0.32.0 in /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages (from scikit-build->-r requirements.txt (line 24)) (0.45.1) 2025-05-07T19:49:19.0371066Z Collecting mypy-extensions>=0.3.0 (from typing-inspect->pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:19.0400156Z Downloading mypy_extensions-1.1.0-py3-none-any.whl.metadata (1.1 kB) 2025-05-07T19:49:19.0499128Z Downloading backports.tarfile-1.2.0-py3-none-any.whl (30 kB) 2025-05-07T19:49:19.0630063Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (27.9 MB) 2025-05-07T19:49:19.2531050Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 27.9/27.9 MB 148.3 MB/s eta 0:00:00 2025-05-07T19:49:19.2563183Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (422 kB) 2025-05-07T19:49:19.2648741Z Downloading pyre_extensions-0.0.32-py3-none-any.whl (12 kB) 2025-05-07T19:49:19.2722174Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl (10 kB) 2025-05-07T19:49:19.2778663Z Downloading tabulate-0.9.0-py3-none-any.whl (35 kB) 2025-05-07T19:49:19.2861915Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl (466 kB) 2025-05-07T19:49:19.2959179Z Downloading typing_inspect-0.9.0-py3-none-any.whl (8.8 kB) 2025-05-07T19:49:19.3015727Z Downloading mypy_extensions-1.1.0-py3-none-any.whl (5.0 kB) 2025-05-07T19:49:19.4537934Z Installing collected packages: tabulate, setuptools_git_versioning, patchelf, ninja, mypy-extensions, cmake, backports.tarfile, typing-inspect, pyre-extensions 2025-05-07T19:49:20.2833494Z 2025-05-07T19:49:20.2883034Z Successfully installed backports.tarfile-1.2.0 cmake-4.0.0 mypy-extensions-1.1.0 ninja-1.11.1.4 patchelf-0.17.2.2 pyre-extensions-0.0.32 setuptools_git_versioning-2.1.0 tabulate-0.9.0 typing-inspect-0.9.0 2025-05-07T19:49:20.2885325Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:20.4345580Z ################################################################################ 2025-05-07T19:49:20.4346057Z # Install PyTorch (PyTorch PIP) 2025-05-07T19:49:20.4346377Z # 2025-05-07T19:49:20.4370716Z # [2025-05-07T19:49:20.436Z] + install_triton_pip build_binary 2025-05-07T19:49:20.4371296Z ################################################################################ 2025-05-07T19:49:20.4371565Z 2025-05-07T19:49:20.4371815Z [BUILD] Installing pytorch-triton nightly/3.2.0+git4b3bb1f8 from PIP ... 2025-05-07T19:49:20.4372290Z ################################################################################ 2025-05-07T19:49:20.4372705Z # Install Package From PyTorch PIP: pytorch-triton 2025-05-07T19:49:20.4373092Z # 2025-05-07T19:49:20.4389123Z # [2025-05-07T19:49:20.438Z] + install_from_pytorch_pip build_binary pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:20.4389778Z ################################################################################ 2025-05-07T19:49:20.4390033Z 2025-05-07T19:49:20.4408156Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:20.5231969Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:20.5232386Z ################################################################################ 2025-05-07T19:49:20.5232800Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:49:20.5233142Z # 2025-05-07T19:49:20.5249685Z # [2025-05-07T19:49:20.524Z] + __prepare_pip_arguments pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:20.5251586Z ################################################################################ 2025-05-07T19:49:20.5252262Z 2025-05-07T19:49:20.5314370Z [INSTALL] Extracted package (channel, version): (nightly, 3.2.0+git4b3bb1f8) 2025-05-07T19:49:20.5328971Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:49:20.5330379Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:20.5333856Z [INSTALL] Extracted the full PIP package: --pre pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:20.5340324Z [INSTALL] Attempting to install [pytorch-triton, 3.2.0+git4b3bb1f8] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/ ... 2025-05-07T19:49:20.5366325Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre pytorch-triton==3.2.0+git4b3bb1f8 --index-url https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:26.0276048Z ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. 2025-05-07T19:49:26.0277110Z Looking in indexes: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:26.0277569Z Collecting pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:26.0278773Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.3 kB) 2025-05-07T19:49:26.0280164Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (166.5 MB) 2025-05-07T19:49:26.0281405Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 166.5/166.5 MB 179.1 MB/s eta 0:00:00 2025-05-07T19:49:26.0281848Z Installing collected packages: pytorch-triton 2025-05-07T19:49:26.0282221Z Attempting uninstall: pytorch-triton 2025-05-07T19:49:26.0282663Z Found existing installation: pytorch-triton 3.3.0+git96316ce5 2025-05-07T19:49:26.0283123Z Uninstalling pytorch-triton-3.3.0+git96316ce5: 2025-05-07T19:49:26.0283602Z Successfully uninstalled pytorch-triton-3.3.0+git96316ce5 2025-05-07T19:49:26.0284082Z Successfully installed pytorch-triton-3.2.0+git4b3bb1f8 2025-05-07T19:49:26.0284398Z 2025-05-07T19:49:26.0285121Z torch 2.8.0.dev20250507+cu126 requires pytorch-triton==3.3.0+git96316ce5; platform_system == "Linux" and platform_machine == "x86_64", but you have pytorch-triton 3.2.0+git4b3bb1f8 which is incompatible. 2025-05-07T19:49:26.0287292Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:26.0288823Z 2025-05-07T19:49:28.1653165Z [CHECK] Python (sub-)package 'triton' found ... 2025-05-07T19:49:28.1653650Z [CHECK] Printing out the pytorch-triton version ... 2025-05-07T19:49:30.2254064Z ################################################################################ 2025-05-07T19:49:30.2254553Z [CHECK] The installed VERSION of pytorch-triton is: 3.2.0 2025-05-07T19:49:30.2254963Z ################################################################################ 2025-05-07T19:49:30.2255208Z 2025-05-07T19:49:32.2116279Z [CHECK] Python (sub-)package 'numpy' found ... 2025-05-07T19:49:34.2978952Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:49:34.2984573Z [BUILD] Successfully ran git submodules update 2025-05-07T19:49:34.3060650Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:34.3061379Z . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:34.3062073Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:34.3062412Z env: 2025-05-07T19:49:34.3062641Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:34.3063069Z BUILD_ENV: build_binary 2025-05-07T19:49:34.3063308Z BUILD_TARGET: default 2025-05-07T19:49:34.3063543Z BUILD_VARIANT: cuda 2025-05-07T19:49:34.3063786Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:34.3064138Z ##[endgroup] 2025-05-07T19:49:34.7532613Z [BUILD] BUILD_TARGET_VARIANT: default/cuda 2025-05-07T19:49:34.7533649Z [BUILD] Extracted build target: default 2025-05-07T19:49:34.7534553Z [BUILD] Extracted build variant: cuda 2025-05-07T19:49:36.5611534Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:49:36.5612322Z 2025-05-07T19:49:36.6358025Z [CHECK] Binary cc found in PATH 2025-05-07T19:49:38.4587893Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:49:38.4588685Z 2025-05-07T19:49:38.5174286Z [CHECK] Binary gcc found in PATH 2025-05-07T19:49:40.3224691Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:49:40.3224985Z 2025-05-07T19:49:40.4054357Z [CHECK] Binary c++ found in PATH 2025-05-07T19:49:42.2212469Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:49:42.2212772Z 2025-05-07T19:49:42.2952744Z [CHECK] Binary g++ found in PATH 2025-05-07T19:49:44.1891958Z [BUILD] Extracted and set Python tag: py313 2025-05-07T19:49:44.1892450Z [BUILD] Extracted and set Python platform name: manylinux_2_28_x86_64 2025-05-07T19:49:44.2130541Z core = 24 2025-05-07T19:49:44.2357215Z sockets = 2 2025-05-07T19:49:44.2358231Z [BUILD] Set multicore run option for setup.py: -j 48 2025-05-07T19:49:44.2359300Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:49:44.2360101Z [BUILD] Running pre-build cleanups ... 2025-05-07T19:49:44.2360958Z + rm -rf dist 2025-05-07T19:49:44.2361308Z 2025-05-07T19:49:44.2372547Z 2025-05-07T19:49:44.2373022Z + conda run --no-capture-output -n build_binary python setup.py clean 2025-05-07T19:49:44.2373365Z 2025-05-07T19:49:47.3007186Z INFO:root:running clean 2025-05-07T19:49:47.3008096Z [SETUP.PY] ARGV: ['setup.py', 'clean'] 2025-05-07T19:49:47.3012834Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:49:47.3014628Z [SETUP.PY] Other arguments: ['clean'] 2025-05-07T19:49:47.3015337Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:49:47.3016123Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:49:47.3016976Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:49:47.3017697Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:49:47.3018224Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:49:47.3019777Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:49:47.6288810Z 2025-05-07T19:49:47.6289637Z [BUILD] Printing git status ... 2025-05-07T19:49:47.6290717Z + git status 2025-05-07T19:49:47.6291077Z 2025-05-07T19:49:48.1012903Z HEAD detached at pull/4066/merge 2025-05-07T19:49:48.1013252Z Untracked files: 2025-05-07T19:49:48.1013569Z (use "git add ..." to include in what will be committed) 2025-05-07T19:49:48.1013953Z ../build_only/ 2025-05-07T19:49:48.1014199Z ../collect_env.py 2025-05-07T19:49:48.1014460Z fbgemm_gpu/docs/version.py 2025-05-07T19:49:48.1014633Z 2025-05-07T19:49:48.1015232Z nothing added to commit but untracked files present (use "git add" to track) 2025-05-07T19:49:48.1015604Z 2025-05-07T19:49:48.1015689Z + git diff 2025-05-07T19:49:48.1015826Z 2025-05-07T19:49:48.1289917Z 2025-05-07T19:49:48.1291102Z ################################################################################ 2025-05-07T19:49:48.1291557Z # Configure FBGEMM-GPU Build 2025-05-07T19:49:48.1291835Z # 2025-05-07T19:49:48.1308585Z # [2025-05-07T19:49:48.130Z] + __configure_fbgemm_gpu_build 2025-05-07T19:49:48.1309142Z ################################################################################ 2025-05-07T19:49:48.1309381Z 2025-05-07T19:49:48.1312190Z [BUILD] Setting the build target: default ... 2025-05-07T19:49:48.1312676Z [BUILD] Configuring build as CUDA variant (this is the default behavior) ... 2025-05-07T19:49:49.9538571Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:49:49.9539134Z 2025-05-07T19:49:50.0309055Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:49:51.8419533Z /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:49:51.8420261Z 2025-05-07T19:49:51.9001760Z [CHECK] Environment variable CUDNN_INCLUDE_DIR is defined in the Conda environment 2025-05-07T19:49:53.7081277Z /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:53.7081982Z 2025-05-07T19:49:53.7657534Z [CHECK] Environment variable CUDNN_LIBRARY is defined in the Conda environment 2025-05-07T19:49:55.5813798Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:49:55.5814796Z 2025-05-07T19:49:55.6401937Z [CHECK] Environment variable NVML_LIB_PATH is defined in the Conda environment 2025-05-07T19:49:57.5050832Z [BUILD] Using the default architectures for CUDA nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:49:57.5052360Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:49:57.5053308Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:49:57.5054254Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:49:57.5054934Z Build cuda_12.6.r12.6/compiler.35059454_0 ... 2025-05-07T19:49:57.5055324Z [BUILD] Setting the following CUDA targets: 7.0;8.0;9.0;9.0a 2025-05-07T19:49:57.5055729Z [BUILD] Looking up NVML filepath ... 2025-05-07T19:49:59.3760447Z [BUILD] Looking up NCCL filepath ... 2025-05-07T19:50:03.1843175Z [BUILD] Setting NVCC verbose mode ... 2025-05-07T19:50:03.1843667Z + conda env config vars set -n build_binary NVCC_VERBOSE=1 2025-05-07T19:50:03.1843996Z 2025-05-07T19:50:03.5926676Z 2025-05-07T19:50:03.5927108Z [BUILD] Setting CUDA build args ... 2025-05-07T19:50:05.4731081Z [BUILD] Looking up CUDA version ... 2025-05-07T19:50:09.2382406Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:09.2383361Z 2025-05-07T19:50:11.0890761Z 2025-05-07T19:50:11.0891319Z [BUILD] Setting NVCC flags ... 2025-05-07T19:50:11.0893829Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-std=c++20 -Xcompiler -std=c++20 -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler" 2025-05-07T19:50:11.0896069Z 2025-05-07T19:50:11.4999984Z 2025-05-07T19:50:11.5000853Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:50:11.5001658Z 2025-05-07T19:50:13.3043364Z -std=c++20 -Xcompiler -std=c++20 -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler 2025-05-07T19:50:13.3044922Z 2025-05-07T19:50:13.3736580Z 2025-05-07T19:50:13.3737123Z [BUILD] Setting CUDA build args ... 2025-05-07T19:50:13.3737658Z + conda run -n build_binary c++ --version 2025-05-07T19:50:13.3737880Z 2025-05-07T19:50:15.1864190Z c++ (conda-forge gcc 11.4.0-13) 11.4.0 2025-05-07T19:50:15.1864626Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:50:15.1865109Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:50:15.1865677Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:50:15.1866066Z 2025-05-07T19:50:15.1866071Z 2025-05-07T19:50:15.2630162Z 2025-05-07T19:50:15.2632099Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:15.2633031Z 2025-05-07T19:50:17.1514703Z .github/scripts/fbgemm_gpu_build.bash: line 370: [: : integer expression expected 2025-05-07T19:50:17.1515265Z 2025-05-07T19:50:17.1515613Z [BUILD] Enabling debug features in the build ... 2025-05-07T19:50:17.1518055Z [BUILD] FBGEMM_GPU build arguments have been set: --verbose --build-target=default --build-variant=cuda --nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 -DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 --debug 2025-05-07T19:50:17.1520949Z ################################################################################ 2025-05-07T19:50:17.1521283Z # Build FBGEMM-GPU Package (Wheel) 2025-05-07T19:50:17.1521561Z # 2025-05-07T19:50:17.1533941Z # [2025-05-07T19:50:17.152Z] + build_fbgemm_gpu_package build_binary nightly default/cuda 2025-05-07T19:50:17.1535362Z ################################################################################ 2025-05-07T19:50:17.1536011Z 2025-05-07T19:50:17.1536583Z [BUILD] Building FBGEMM wheel (TARGET=default, VARIANT=cuda) ... 2025-05-07T19:50:17.1541758Z + conda run --no-capture-output -n build_binary python -m build --wheel --no-isolation --config-setting=--build-option=--verbose --config-setting=--build-option=--build-target=default --config-setting=--build-option=--build-variant=cuda --config-setting=--build-option=--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --config-setting=--build-option=--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 --config-setting=--build-option=-DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' --config-setting=--build-option=-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCMAKE_CXX_STANDARD=20 --config-setting=--build-option=--debug --config-setting=--build-option=--package_channel=nightly --config-setting=--build-option=--python-tag=py313 --config-setting=--build-option=--plat-name=manylinux_2_28_x86_64 2025-05-07T19:50:17.1546089Z 2025-05-07T19:50:18.9968716Z * Getting build dependencies for wheel... 2025-05-07T19:50:20.2829777Z INFO:root:running egg_info 2025-05-07T19:50:20.2868534Z INFO:root:creating fbgemm_gpu_nightly.egg-info 2025-05-07T19:50:20.2869808Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T19:50:20.2871405Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T19:50:20.2873256Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T19:50:20.2874987Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T19:50:20.2875702Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:20.2936829Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:20.2950858Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:20.2951905Z [SETUP.PY] ARGV: ['setup.py', 'egg_info'] 2025-05-07T19:50:20.2953056Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:50:20.2954134Z [SETUP.PY] Other arguments: ['egg_info'] 2025-05-07T19:50:20.2954652Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:20.2955356Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:20.2956400Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:20.2956986Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:20.2957418Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:20.2958687Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:50:20.5778615Z * Building wheel... 2025-05-07T19:50:21.8536100Z [SETUP.PY] ARGV: ['setup.py', 'bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-3xp3vcls', '--verbose', '--build-target=default', '--build-variant=cuda', '--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--debug', '--package_channel=nightly', '--python-tag=py313', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:21.8540538Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=True, debug=True, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path='/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', nccl_lib_path='/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2', use_fb_only=False, cxxprefix=None) 2025-05-07T19:50:21.8543413Z [SETUP.PY] Other arguments: ['bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-3xp3vcls', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--python-tag=py313', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:21.8545123Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:21.8545673Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:21.8546213Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:21.8546744Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:21.8547119Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:21.8551603Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DCMAKE_VERBOSE_MAKEFILE=ON', '-DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', '-DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '-DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include', '-DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2', "-DCMAKE_C_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib'", "-DCMAKE_CXX_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib'", '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20'] 2025-05-07T19:50:21.8556085Z 2025-05-07T19:50:21.8556090Z 2025-05-07T19:50:21.8556399Z -------------------------------------------------------------------------------- 2025-05-07T19:50:21.8556776Z -- Trying 'Ninja' generator 2025-05-07T19:50:21.8557024Z -------------------------------- 2025-05-07T19:50:21.8557286Z --------------------------- 2025-05-07T19:50:21.8557516Z ---------------------- 2025-05-07T19:50:21.8557742Z ----------------- 2025-05-07T19:50:21.8557943Z ------------ 2025-05-07T19:50:21.8558149Z ------- 2025-05-07T19:50:21.8558328Z -- 2025-05-07T19:50:21.8941043Z CMake Deprecation Warning at CMakeLists.txt:1 (cmake_minimum_required): 2025-05-07T19:50:21.8942599Z Not searching for unused variables given on the command line. 2025-05-07T19:50:21.8944143Z Compatibility with CMake < 3.10 will be removed from a future version of 2025-05-07T19:50:21.8945337Z CMake. 2025-05-07T19:50:21.8945649Z 2025-05-07T19:50:21.8946364Z Update the VERSION argument value. Or, use the ... syntax 2025-05-07T19:50:21.8947301Z to tell CMake that the project requires at least but has been updated 2025-05-07T19:50:21.8947754Z to work with policies introduced by or earlier. 2025-05-07T19:50:21.8948006Z 2025-05-07T19:50:21.8948011Z 2025-05-07T19:50:21.9380560Z -- The C compiler identification is GNU 11.4.0 2025-05-07T19:50:21.9453946Z -- Detecting C compiler ABI info 2025-05-07T19:50:22.0336149Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:22.0515034Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc - skipped 2025-05-07T19:50:22.0516133Z -- Detecting C compile features 2025-05-07T19:50:22.0518881Z -- Detecting C compile features - done 2025-05-07T19:50:22.1296414Z -- The CXX compiler identification is GNU 11.4.0 2025-05-07T19:50:22.1368470Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:22.2325353Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:22.2514528Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ - skipped 2025-05-07T19:50:22.2516000Z -- Detecting CXX compile features 2025-05-07T19:50:22.2523950Z -- Detecting CXX compile features - done 2025-05-07T19:50:22.2589068Z -- Configuring done (0.4s) 2025-05-07T19:50:22.2635564Z -- Generating done (0.0s) 2025-05-07T19:50:22.2654020Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_cmake_test_compile/build 2025-05-07T19:50:22.2689386Z -- 2025-05-07T19:50:22.2690013Z ------- 2025-05-07T19:50:22.2690857Z ------------ 2025-05-07T19:50:22.2691425Z ----------------- 2025-05-07T19:50:22.2692036Z ---------------------- 2025-05-07T19:50:22.2692676Z --------------------------- 2025-05-07T19:50:22.2693389Z -------------------------------- 2025-05-07T19:50:22.2694173Z -- Trying 'Ninja' generator - success 2025-05-07T19:50:22.2695176Z -------------------------------------------------------------------------------- 2025-05-07T19:50:22.2695459Z 2025-05-07T19:50:22.2701359Z Configuring Project 2025-05-07T19:50:22.2701911Z Working directory: 2025-05-07T19:50:22.2702740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build 2025-05-07T19:50:22.2703205Z Command: 2025-05-07T19:50:22.2721548Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/cmake/data/bin/cmake /__w/FBGEMM/FBGEMM/fbgemm_gpu -G Ninja -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja --no-warn-unused-cli -DCMAKE_INSTALL_PREFIX:PATH=/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install -DPYTHON_VERSION_STRING:STRING=3.13.2 -DSKBUILD:INTERNAL=TRUE -DCMAKE_MODULE_PATH:PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/skbuild/resources/cmake -DPYTHON_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPYTHON_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.13 -DPYTHON_LIBRARY:PATH=/github/home/miniconda/envs/build_binary/lib/libpython3.13.so -DPython_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython_FIND_REGISTRY:STRING=NEVER -DPython_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.13 -DPython_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/numpy/_core/include -DPython3_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython3_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython3_FIND_REGISTRY:STRING=NEVER -DPython3_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.13 -DPython3_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/numpy/_core/include -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja -DCMAKE_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ar -DCMAKE_CXX_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_C_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ranlib -DCMAKE_CXX_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_C_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_LINKER=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ld -DCMAKE_STRIP=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-strip -DCMAKE_BUILD_TYPE=Release -DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch -D_GLIBCXX_USE_CXX11_ABI=1 -DCMAKE_VERBOSE_MAKEFILE=ON -DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE -DFBGEMM_BUILD_TARGET=default -DFBGEMM_BUILD_VARIANT=cuda -DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 '-DCMAKE_C_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib'"'"'' '-DCMAKE_CXX_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib'"'"'' '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 -DCMAKE_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ar -DCMAKE_CXX_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_C_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ranlib -DCMAKE_CXX_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_C_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_LINKER=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ld -DCMAKE_STRIP=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-strip -DCMAKE_BUILD_TYPE=Release 2025-05-07T19:50:22.2741362Z 2025-05-07T19:50:22.3127523Z 2025-05-07T19:50:22.3128278Z Not searching for unused variables given on the command line. 2025-05-07T19:50:22.3129652Z 2025-05-07T19:50:22.3129982Z ================================================================================ 2025-05-07T19:50:22.3131121Z Default C compiler flags 2025-05-07T19:50:22.3132153Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:22.3133064Z 2025-05-07T19:50:22.3134831Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib 2025-05-07T19:50:22.3136799Z ================================================================================ 2025-05-07T19:50:22.3155873Z 2025-05-07T19:50:22.3155880Z 2025-05-07T19:50:22.3155906Z 2025-05-07T19:50:22.3156070Z ================================================================================ 2025-05-07T19:50:22.3156473Z Default C++ compiler flags 2025-05-07T19:50:22.3156842Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:22.3157127Z 2025-05-07T19:50:22.3157576Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib 2025-05-07T19:50:22.3158198Z ================================================================================ 2025-05-07T19:50:22.3158843Z 2025-05-07T19:50:22.3158847Z 2025-05-07T19:50:22.3158852Z 2025-05-07T19:50:22.3158972Z ================================================================================ 2025-05-07T19:50:22.3159279Z AVX2_FLAGS: 2025-05-07T19:50:22.3159420Z 2025-05-07T19:50:22.3159503Z -mavx2 2025-05-07T19:50:22.3159709Z -mf16c 2025-05-07T19:50:22.3159890Z -mfma 2025-05-07T19:50:22.3160094Z -fopenmp 2025-05-07T19:50:22.3160314Z ================================================================================ 2025-05-07T19:50:22.3160533Z 2025-05-07T19:50:22.3160555Z 2025-05-07T19:50:22.3160559Z 2025-05-07T19:50:22.3160669Z ================================================================================ 2025-05-07T19:50:22.3160970Z AVX512_FLAGS: 2025-05-07T19:50:22.3161116Z 2025-05-07T19:50:22.3161198Z -mavx2 2025-05-07T19:50:22.3161398Z -mf16c 2025-05-07T19:50:22.3161577Z -mfma 2025-05-07T19:50:22.3161781Z -mavx512f 2025-05-07T19:50:22.3161972Z -mavx512bw 2025-05-07T19:50:22.3162180Z -mavx512dq 2025-05-07T19:50:22.3162373Z -mavx512vl 2025-05-07T19:50:22.3162585Z -fopenmp 2025-05-07T19:50:22.3162805Z ================================================================================ 2025-05-07T19:50:22.3163045Z 2025-05-07T19:50:22.3163049Z 2025-05-07T19:50:22.3163052Z 2025-05-07T19:50:22.3163163Z ================================================================================ 2025-05-07T19:50:22.3163514Z The project is built using scikit-build 2025-05-07T19:50:22.3163827Z ================================================================================ 2025-05-07T19:50:22.3164045Z 2025-05-07T19:50:22.3164049Z 2025-05-07T19:50:22.3164054Z 2025-05-07T19:50:22.3164181Z ================================================================================ 2025-05-07T19:50:22.3164593Z Build Settings 2025-05-07T19:50:22.3164728Z 2025-05-07T19:50:22.3164823Z FBGEMM_BUILD_TARGET : default 2025-05-07T19:50:22.3165085Z FBGEMM_BUILD_VARIANT : cuda 2025-05-07T19:50:22.3165267Z 2025-05-07T19:50:22.3165360Z NVCC_VERBOSE : 2025-05-07T19:50:22.3165617Z CUDNN_INCLUDE_DIR : 2025-05-07T19:50:22.3165847Z CUDNN_LIBRARY : 2025-05-07T19:50:22.3166264Z NVML_LIB_PATH : /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:22.3166711Z TORCH_CUDA_ARCH_LIST : 7.0 2025-05-07T19:50:22.3166964Z 8.0 2025-05-07T19:50:22.3167136Z 9.0 2025-05-07T19:50:22.3167320Z 9.0a 2025-05-07T19:50:22.3167425Z 2025-05-07T19:50:22.3167513Z HIP_ROOT_DIR : 2025-05-07T19:50:22.3167762Z HIPCC_VERBOSE : 2025-05-07T19:50:22.3168011Z AMDGPU_TARGETS : 2025-05-07T19:50:22.3168241Z PYTORCH_ROCM_ARCH : 2025-05-07T19:50:22.3168511Z ================================================================================ 2025-05-07T19:50:22.3168719Z 2025-05-07T19:50:22.3893973Z -- The CXX compiler identification is GNU 11.4.0 2025-05-07T19:50:22.4285875Z -- The C compiler identification is GNU 11.4.0 2025-05-07T19:50:23.3633428Z -- The CUDA compiler identification is NVIDIA 12.6.85 with host compiler GNU 11.4.0 2025-05-07T19:50:23.3730528Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:23.4695135Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:23.4893772Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ - skipped 2025-05-07T19:50:23.4895637Z -- Detecting CXX compile features 2025-05-07T19:50:23.4902229Z -- Detecting CXX compile features - done 2025-05-07T19:50:23.5022872Z -- Detecting C compiler ABI info 2025-05-07T19:50:23.5905394Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:23.6093925Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc - skipped 2025-05-07T19:50:23.6095225Z -- Detecting C compile features 2025-05-07T19:50:23.6098171Z -- Detecting C compile features - done 2025-05-07T19:50:23.6201465Z -- Detecting CUDA compiler ABI info 2025-05-07T19:50:24.5346507Z -- Detecting CUDA compiler ABI info - done 2025-05-07T19:50:24.5915191Z -- Check for working CUDA compiler: /github/home/miniconda/envs/build_binary/bin/nvcc - skipped 2025-05-07T19:50:24.5936434Z -- Detecting CUDA compile features 2025-05-07T19:50:24.5939741Z -- Detecting CUDA compile features - done 2025-05-07T19:50:24.6016961Z -- Performing Test C_HAS_AVX_1 2025-05-07T19:50:24.8593540Z -- Performing Test C_HAS_AVX_1 - Failed 2025-05-07T19:50:24.8594524Z -- Performing Test C_HAS_AVX_2 2025-05-07T19:50:25.1368624Z -- Performing Test C_HAS_AVX_2 - Success 2025-05-07T19:50:25.1369138Z -- Performing Test C_HAS_AVX2_1 2025-05-07T19:50:25.3962160Z -- Performing Test C_HAS_AVX2_1 - Failed 2025-05-07T19:50:25.3963183Z -- Performing Test C_HAS_AVX2_2 2025-05-07T19:50:25.6693852Z -- Performing Test C_HAS_AVX2_2 - Success 2025-05-07T19:50:25.6694884Z -- Performing Test C_HAS_AVX512_1 2025-05-07T19:50:25.9288337Z -- Performing Test C_HAS_AVX512_1 - Failed 2025-05-07T19:50:25.9289354Z -- Performing Test C_HAS_AVX512_2 2025-05-07T19:50:26.1491668Z -- Performing Test C_HAS_AVX512_2 - Success 2025-05-07T19:50:26.1492711Z -- Performing Test CXX_HAS_AVX_1 2025-05-07T19:50:26.4090687Z -- Performing Test CXX_HAS_AVX_1 - Failed 2025-05-07T19:50:26.4091760Z -- Performing Test CXX_HAS_AVX_2 2025-05-07T19:50:26.6849920Z -- Performing Test CXX_HAS_AVX_2 - Success 2025-05-07T19:50:26.6851212Z -- Performing Test CXX_HAS_AVX2_1 2025-05-07T19:50:26.9438768Z -- Performing Test CXX_HAS_AVX2_1 - Failed 2025-05-07T19:50:26.9439731Z -- Performing Test CXX_HAS_AVX2_2 2025-05-07T19:50:27.2166145Z -- Performing Test CXX_HAS_AVX2_2 - Success 2025-05-07T19:50:27.2166547Z -- Performing Test CXX_HAS_AVX512_1 2025-05-07T19:50:27.4779383Z -- Performing Test CXX_HAS_AVX512_1 - Failed 2025-05-07T19:50:27.4780433Z -- Performing Test CXX_HAS_AVX512_2 2025-05-07T19:50:27.7005162Z -- Performing Test CXX_HAS_AVX512_2 - Success 2025-05-07T19:50:27.7179862Z -- Found CUDA: /github/home/miniconda/envs/build_binary/targets/x86_64-linux (found version "12.6") 2025-05-07T19:50:27.7214169Z -- Found CUDAToolkit: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include (found version "12.6.85") 2025-05-07T19:50:27.7293434Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD 2025-05-07T19:50:27.8210976Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Failed 2025-05-07T19:50:27.8212120Z -- Looking for pthread_create in pthreads 2025-05-07T19:50:27.8991854Z -- Looking for pthread_create in pthreads - not found 2025-05-07T19:50:27.8992991Z -- Looking for pthread_create in pthread 2025-05-07T19:50:27.9901219Z -- Looking for pthread_create in pthread - found 2025-05-07T19:50:27.9911358Z -- Found Threads: TRUE 2025-05-07T19:50:28.1513521Z -- PyTorch: CUDA detected: 12.6 2025-05-07T19:50:28.1515053Z -- PyTorch: CUDA nvcc is: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/bin/nvcc 2025-05-07T19:50:28.1517228Z -- PyTorch: CUDA toolkit directory: /github/home/miniconda/envs/build_binary/targets/x86_64-linux 2025-05-07T19:50:28.2727820Z -- PyTorch: Header version is: 12.6 2025-05-07T19:50:28.3716239Z -- Found Python: /github/home/miniconda/envs/build_binary/bin/python (found version "3.13.2") found components: Interpreter 2025-05-07T19:50:28.3733011Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:140 (message): 2025-05-07T19:50:28.3735220Z -- USE_CUDNN is set to 0. Compiling without cuDNN support 2025-05-07T19:50:28.3735715Z -- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support 2025-05-07T19:50:28.3736183Z Failed to compute shorthash for libnvrtc.so 2025-05-07T19:50:28.3736537Z Call Stack (most recent call first): 2025-05-07T19:50:28.3737252Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:50:28.3738481Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:50:28.3739424Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:28.3739839Z CMakeLists.txt:112 (include) 2025-05-07T19:50:28.3740125Z 2025-05-07T19:50:28.3740129Z 2025-05-07T19:50:28.3740282Z -- USE_CUDSS is set to 0. Compiling without cuDSS support 2025-05-07T19:50:28.3740698Z -- USE_CUFILE is set to 0. Compiling without cuFile support 2025-05-07T19:50:28.3741477Z -- Added CUDA NVCC flags for: -gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_80,code=sm_80;-gencode;arch=compute_90,code=sm_90;-gencode;arch=compute_90a,code=sm_90a 2025-05-07T19:50:28.4071055Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:22 (message): 2025-05-07T19:50:28.4073537Z static library kineto_LIBRARY-NOTFOUND not found. 2025-05-07T19:50:28.4074563Z Call Stack (most recent call first): 2025-05-07T19:50:28.4076659Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:125 (append_torchlib_if_found) 2025-05-07T19:50:28.4077590Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:28.4078048Z CMakeLists.txt:112 (include) 2025-05-07T19:50:28.4078228Z 2025-05-07T19:50:28.4078257Z 2025-05-07T19:50:28.4078288Z 2025-05-07T19:50:28.4078292Z 2025-05-07T19:50:28.4078408Z ================================================================================ 2025-05-07T19:50:28.4078749Z PyTorch Flags: 2025-05-07T19:50:28.4078956Z 2025-05-07T19:50:28.4079164Z TORCH_INCLUDE_DIRS: 2025-05-07T19:50:28.4079594Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:28.4080398Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:28.4080979Z 2025-05-07T19:50:28.4081182Z TORCH_LIBRARIES: 2025-05-07T19:50:28.4081396Z torch 2025-05-07T19:50:28.4081602Z torch_library 2025-05-07T19:50:28.4082038Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:28.4082725Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:28.4083446Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:28.4083963Z 2025-05-07T19:50:28.4084179Z TORCH_CUDA_OPTIONS: 2025-05-07T19:50:28.4084423Z --expt-relaxed-constexpr 2025-05-07T19:50:28.4084712Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:28.4084991Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:28.4085302Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:28.4085589Z ================================================================================ 2025-05-07T19:50:28.4085835Z 2025-05-07T19:50:28.4086330Z -- Found Torch: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so 2025-05-07T19:50:28.4086809Z 2025-05-07T19:50:28.4086813Z 2025-05-07T19:50:28.4086922Z ================================================================================ 2025-05-07T19:50:28.4087213Z NCCL Flags 2025-05-07T19:50:28.4087341Z 2025-05-07T19:50:28.4087696Z NCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:28.4088763Z NCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:28.4089360Z ================================================================================ 2025-05-07T19:50:28.4089587Z 2025-05-07T19:50:28.4089591Z 2025-05-07T19:50:28.4089595Z 2025-05-07T19:50:28.4089698Z ================================================================================ 2025-05-07T19:50:28.4090005Z CUDA Driver Path 2025-05-07T19:50:28.4090134Z 2025-05-07T19:50:28.4090778Z CUDA_DRIVER_LIBRARIES=/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:28.4091440Z ================================================================================ 2025-05-07T19:50:28.4091665Z 2025-05-07T19:50:28.4091961Z -- Found NVML_LIB_PATH: /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:28.4105632Z 2025-05-07T19:50:28.4105718Z 2025-05-07T19:50:28.4106327Z ================================================================================ 2025-05-07T19:50:28.4106807Z GPU CPP Library Target: asmjit (SHARED) 2025-05-07T19:50:28.4107105Z 2025-05-07T19:50:28.4107323Z CPU_SRCS: 2025-05-07T19:50:28.4107440Z 2025-05-07T19:50:28.4107518Z 2025-05-07T19:50:28.4107784Z GPU_SRCS: 2025-05-07T19:50:28.4107896Z 2025-05-07T19:50:28.4107992Z 2025-05-07T19:50:28.4108181Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:28.4108326Z 2025-05-07T19:50:28.4108422Z 2025-05-07T19:50:28.4108609Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:28.4108755Z 2025-05-07T19:50:28.4108850Z 2025-05-07T19:50:28.4109032Z OTHER_SRCS: 2025-05-07T19:50:28.4109426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:28.4110043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:28.4110658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:28.4111299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:28.4111910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:28.4112511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:28.4113087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:28.4113682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:28.4114367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:28.4114954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:28.4115546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:28.4116131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:28.4116735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:28.4117303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:28.4117949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:28.4118812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:28.4119410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:28.4120054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:28.4120651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:28.4121241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:28.4121839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:28.4122428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:28.4123227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:28.4123846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:28.4124462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:28.4125047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:28.4125642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:28.4126264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:28.4126824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:28.4127396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:28.4127984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:28.4128915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:28.4129526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:28.4130095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:28.4130772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:28.4131421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:28.4132003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:28.4132583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:28.4133146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:28.4133726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:28.4134287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:28.4134873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:28.4135431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:28.4136003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:28.4136586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:28.4137167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:28.4137767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:28.4138351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:28.4138955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:28.4139557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:28.4140168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:28.4140777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:28.4141378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:28.4141998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:28.4142575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:28.4143268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:28.4143847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:28.4144411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:28.4144993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:28.4145404Z 2025-05-07T19:50:28.4145602Z CC_FLAGS: 2025-05-07T19:50:28.4145717Z 2025-05-07T19:50:28.4145793Z 2025-05-07T19:50:28.4145985Z NVCC_FLAGS: 2025-05-07T19:50:28.4146101Z 2025-05-07T19:50:28.4146343Z 2025-05-07T19:50:28.4146545Z HIPCC_FLAGS: 2025-05-07T19:50:28.4146667Z 2025-05-07T19:50:28.4146762Z 2025-05-07T19:50:28.4146946Z INCLUDE_DIRS: 2025-05-07T19:50:28.4147168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:28.4147482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:28.4147753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:28.4148063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:28.4148557Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:28.4149320Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:28.4149964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:28.4150363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:28.4150796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:28.4151344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:28.4151862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:28.4152331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:28.4152873Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:28.4153374Z 2025-05-07T19:50:28.4153565Z Selected Source Files: 2025-05-07T19:50:28.4153733Z 2025-05-07T19:50:28.4153808Z 2025-05-07T19:50:28.4153992Z HIPified Source Files: 2025-05-07T19:50:28.4154155Z 2025-05-07T19:50:28.4154231Z 2025-05-07T19:50:28.4154419Z Library Dependencies: 2025-05-07T19:50:28.4154657Z torch 2025-05-07T19:50:28.4154855Z torch_library 2025-05-07T19:50:28.4155278Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:28.4155952Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:28.4156637Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:28.4157430Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:28.4158148Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:28.4158731Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:28.4159138Z 2025-05-07T19:50:28.4159322Z Output Library: 2025-05-07T19:50:28.4159540Z asmjit 2025-05-07T19:50:28.4159717Z 2025-05-07T19:50:28.4159920Z Destination Directory: 2025-05-07T19:50:28.4160146Z fbgemm_gpu 2025-05-07T19:50:28.4160385Z ================================================================================ 2025-05-07T19:50:28.4160608Z 2025-05-07T19:50:28.4160647Z 2025-05-07T19:50:28.4160651Z 2025-05-07T19:50:28.4160775Z ================================================================================ 2025-05-07T19:50:28.4161106Z GPU CPP Library Target: fbgemm (SHARED) 2025-05-07T19:50:28.4161402Z 2025-05-07T19:50:28.4161583Z CPU_SRCS: 2025-05-07T19:50:28.4161713Z 2025-05-07T19:50:28.4161789Z 2025-05-07T19:50:28.4161964Z GPU_SRCS: 2025-05-07T19:50:28.4162092Z 2025-05-07T19:50:28.4162166Z 2025-05-07T19:50:28.4162368Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:28.4162505Z 2025-05-07T19:50:28.4162583Z 2025-05-07T19:50:28.4162784Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:28.4162920Z 2025-05-07T19:50:28.4162993Z 2025-05-07T19:50:28.4163184Z OTHER_SRCS: 2025-05-07T19:50:28.4163446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDM.cc 2025-05-07T19:50:28.4163893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:28.4164342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMNBit.cc 2025-05-07T19:50:28.4164756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtils.cc 2025-05-07T19:50:28.4165161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RefImplementations.cc 2025-05-07T19:50:28.4165628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:28.4166158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/SparseAdagrad.cc 2025-05-07T19:50:28.4166521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/Utils.cc 2025-05-07T19:50:28.4166917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:28.4167324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:28.4167749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:28.4168174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:28.4168591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:28.4168956Z 2025-05-07T19:50:28.4169135Z CC_FLAGS: 2025-05-07T19:50:28.4169246Z 2025-05-07T19:50:28.4169339Z 2025-05-07T19:50:28.4169520Z NVCC_FLAGS: 2025-05-07T19:50:28.4169650Z 2025-05-07T19:50:28.4169725Z 2025-05-07T19:50:28.4169906Z HIPCC_FLAGS: 2025-05-07T19:50:28.4170045Z 2025-05-07T19:50:28.4170119Z 2025-05-07T19:50:28.4170469Z INCLUDE_DIRS: 2025-05-07T19:50:28.4170893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:28.4171286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:28.4171567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:28.4171891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:28.4172387Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:28.4173186Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:28.4173832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:28.4174259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:28.4174699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:28.4175165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:28.4175696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:28.4176160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:28.4176736Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:28.4177236Z 2025-05-07T19:50:28.4177446Z Selected Source Files: 2025-05-07T19:50:28.4177597Z 2025-05-07T19:50:28.4177674Z 2025-05-07T19:50:28.4177880Z HIPified Source Files: 2025-05-07T19:50:28.4178030Z 2025-05-07T19:50:28.4178125Z 2025-05-07T19:50:28.4178316Z Library Dependencies: 2025-05-07T19:50:28.4178561Z torch 2025-05-07T19:50:28.4178747Z torch_library 2025-05-07T19:50:28.4179196Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:28.4179870Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:28.4180575Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:28.4181363Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:28.4182113Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:28.4182591Z asmjit 2025-05-07T19:50:28.4183025Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:28.4183412Z 2025-05-07T19:50:28.4183586Z Output Library: 2025-05-07T19:50:28.4183797Z fbgemm 2025-05-07T19:50:28.4183965Z 2025-05-07T19:50:28.4184160Z Destination Directory: 2025-05-07T19:50:28.4184378Z fbgemm_gpu 2025-05-07T19:50:28.4184603Z ================================================================================ 2025-05-07T19:50:28.4184816Z 2025-05-07T19:50:28.4184820Z 2025-05-07T19:50:28.4184823Z 2025-05-07T19:50:28.4184941Z ================================================================================ 2025-05-07T19:50:28.4185246Z Running code generation script ... 2025-05-07T19:50:28.4185950Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py --opensource 2025-05-07T19:50:28.4186665Z ================================================================================ 2025-05-07T19:50:28.4186973Z 2025-05-07T19:50:28.9574358Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:28.9576994Z [GENERAATE BACKWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py', '--opensource'] 2025-05-07T19:50:28.9579145Z Written: gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:28.9580521Z Written: gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:28.9581960Z Written: gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:28.9583439Z Written: gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:28.9584863Z Written: gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:28.9585548Z Written: gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:28.9586048Z Written: gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:28.9586846Z Written: gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:28.9587371Z Written: gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:28.9588378Z Written: gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:28.9588872Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:28.9589502Z Written: gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:28.9589997Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:28.9590534Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:28.9591049Z Written: gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:28.9591540Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:28.9592045Z Written: gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:28.9592545Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:28.9593087Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:28.9593591Z Written: gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:28.9594073Z Written: gen_embedding_optimizer_dense_split_device_kernel.cuh 2025-05-07T19:50:28.9594487Z Written: gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:28.9594837Z Written: gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:28.9595246Z Written: gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:28.9595712Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:28.9596203Z Written: gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:28.9596645Z Written: gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:28.9597135Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:28.9597638Z Written: gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:28.9598110Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:28.9598640Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:28.9599158Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:28.9599668Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:28.9600191Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:28.9600719Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:28.9601212Z Written: gen_embedding_optimizer_adagrad_split_device_kernel.cuh 2025-05-07T19:50:28.9601619Z Written: gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:28.9602003Z Written: gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:28.9602427Z Written: gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:28.9602822Z Written: lookup_adagrad.py 2025-05-07T19:50:28.9603254Z Written: gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:28.9603665Z Written: gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:28.9604084Z Written: gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:28.9604563Z Written: gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:28.9605015Z Written: gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:28.9605465Z Written: gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:28.9605950Z Written: gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:28.9606394Z Written: gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:28.9606847Z Written: gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:28.9607280Z Written: gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:28.9607811Z Written: gen_embedding_backward_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:28.9608304Z Written: gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:28.9608750Z Written: gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:28.9609225Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:28.9609702Z Written: gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:28.9610302Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:28.9610999Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:28.9611531Z Written: gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:28.9612059Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:28.9612566Z Written: gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:28.9613095Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:28.9613656Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:28.9614194Z Written: gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:28.9614676Z Written: gen_embedding_optimizer_adam_split_device_kernel.cuh 2025-05-07T19:50:28.9615096Z Written: gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:28.9615471Z Written: gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:28.9615887Z Written: gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:28.9616279Z Written: lookup_adam.py 2025-05-07T19:50:28.9616560Z Written: gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:28.9617095Z Written: gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:28.9617512Z Written: gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:28.9617956Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:28.9618414Z Written: gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:28.9618835Z Written: gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:28.9619282Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:28.9619734Z Written: gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:28.9620185Z Written: gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:28.9620663Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:28.9621157Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:28.9621624Z Written: gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:28.9622105Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:28.9622609Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:28.9623063Z Written: gen_embedding_optimizer_lamb_split_device_kernel.cuh 2025-05-07T19:50:28.9623459Z Written: gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:28.9623899Z Written: gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:28.9624303Z Written: gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:28.9624656Z Written: lookup_lamb.py 2025-05-07T19:50:28.9624928Z Written: gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:28.9625319Z Written: gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:28.9625748Z Written: gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:28.9626217Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:28.9626687Z Written: gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:28.9627142Z Written: gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:28.9627612Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:28.9628099Z Written: gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:28.9629115Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:28.9629726Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:28.9630301Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:28.9630833Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:28.9631401Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:28.9631962Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:28.9632496Z Written: gen_embedding_optimizer_lars_sgd_split_device_kernel.cuh 2025-05-07T19:50:28.9632943Z Written: gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:28.9633329Z Written: gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:28.9633793Z Written: gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:28.9634201Z Written: lookup_lars_sgd.py 2025-05-07T19:50:28.9634529Z Written: gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:28.9634973Z Written: gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:28.9635599Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:28.9636159Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:28.9636714Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:28.9637256Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:28.9637812Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:28.9638385Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:28.9638932Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:28.9639532Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:28.9640145Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:28.9640714Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:28.9641312Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:28.9641905Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:29.0453219Z Written: gen_embedding_optimizer_partial_rowwise_adam_split_device_kernel.cuh 2025-05-07T19:50:29.0453844Z Written: gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:29.0454378Z Written: gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:29.0454941Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:29.0455439Z Written: lookup_partial_rowwise_adam.py 2025-05-07T19:50:29.0455880Z Written: gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:29.0456668Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:29.0457294Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:29.0457897Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:29.0458739Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:29.0459284Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:29.0459861Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:29.0460447Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:29.0461010Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:29.0461629Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:29.0462354Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:29.0462957Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:29.0463582Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:29.0464193Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:29.0464776Z Written: gen_embedding_optimizer_partial_rowwise_lamb_split_device_kernel.cuh 2025-05-07T19:50:29.0465270Z Written: gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:29.0465738Z Written: gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:29.0466250Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:29.0466706Z Written: lookup_partial_rowwise_lamb.py 2025-05-07T19:50:29.0467107Z Written: gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:29.0467616Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:29.0468164Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:29.0468678Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:29.0469195Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:29.0469684Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:29.0470210Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:29.0470766Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:29.0471300Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:29.0471841Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:29.0472363Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:29.0472891Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:29.0473421Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:29.0473967Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:29.0474494Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_meta.cpp 2025-05-07T19:50:29.0475002Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:29.0475527Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_meta.cpp 2025-05-07T19:50:29.0476095Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:29.0476643Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:29.0477196Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:29.0477725Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_meta.cpp 2025-05-07T19:50:29.0478390Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:29.0478951Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:29.0479512Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:29.0480078Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:29.0480604Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:29.0481178Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:29.0481761Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:29.0482362Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:29.0482954Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:29.0483577Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:29.0484136Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:29.0484691Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:29.0485273Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:29.0485840Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:29.0486375Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:29.0486960Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:29.0487549Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:29.0488152Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:29.0488735Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:29.0489319Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:29.0489888Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:29.0490759Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:29.0491407Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:29.0492066Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:29.0492711Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:29.0493366Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:29.0493995Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:29.0494662Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:29.0495334Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:29.0495948Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:29.0496549Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:29.0497130Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:29.0497742Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:29.0498303Z Written: gen_embedding_optimizer_rowwise_adagrad_ssd_device_kernel.cuh 2025-05-07T19:50:29.0498869Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:29.0499387Z Written: gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:29.0499828Z Written: gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:29.0500354Z Written: gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:29.0500885Z Written: lookup_rowwise_adagrad_ssd.py 2025-05-07T19:50:29.0501283Z Written: gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:29.0501734Z Written: gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:29.0502271Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:29.0502744Z Written: lookup_rowwise_adagrad.py 2025-05-07T19:50:29.0503226Z Written: gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:29.0503679Z Written: gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:29.0504151Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:29.0504707Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:29.0505216Z Written: gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:29.0505791Z Written: gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:29.0506319Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:29.0506861Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:29.0507391Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:29.0507992Z Written: gen_embedding_optimizer_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:29.0508588Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:29.0509130Z Written: gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:29.0509748Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:29.0510369Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:29.0510966Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:29.0511662Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:29.0512294Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:29.0512897Z Written: gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:29.0513568Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:29.0514226Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:29.1491966Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:29.1494224Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:29.1496114Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:29.1497623Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:29.1498315Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:29.1498992Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:29.1499884Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:29.1500492Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:29.1501130Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:29.1501768Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:29.1502403Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:29.1503034Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:29.1503925Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:29.1504612Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:29.1505295Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:29.1505969Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:29.1506618Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:29.1507281Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:29.1507959Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:29.1508638Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:29.1509411Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:29.1510038Z Written: gen_embedding_optimizer_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:29.1510617Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:29.1511145Z Written: gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:29.1511714Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:29.1512223Z Written: lookup_rowwise_adagrad_with_counter.py 2025-05-07T19:50:29.1512651Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:29.1513233Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:29.1513861Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:29.1514475Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:29.1515054Z Written: gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:29.1515675Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:29.1516308Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:29.1516919Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:29.1517554Z Written: gen_embedding_optimizer_rowwise_weighted_adagrad_split_device_kernel.cuh 2025-05-07T19:50:29.1518094Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:29.1518572Z Written: gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:29.1519126Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:29.1519668Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:29.1520223Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:29.1520737Z Written: gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:29.1521180Z Written: gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:29.1521630Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:29.1522085Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:29.1522540Z Written: gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:29.1522977Z Written: gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:29.1523436Z Written: gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:29.1523884Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:29.1524370Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:29.1524835Z Written: gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:29.1525386Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:29.1525903Z Written: gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:29.1526397Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:29.1526942Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:29.1527437Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:29.1527962Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:29.1528992Z Written: gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:29.1529544Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:29.1530214Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:29.1530781Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:29.1531437Z Written: gen_embedding_optimizer_sgd_split_device_kernel.cuh 2025-05-07T19:50:29.1531878Z Written: gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:29.1532298Z Written: gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:29.1532751Z Written: gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:29.1533146Z Written: lookup_sgd.py 2025-05-07T19:50:29.1533463Z Written: gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:29.1533845Z Written: gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:29.1534291Z Written: gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:29.1534790Z Written: gen_embedding_optimizer_approx_sgd_split_device_kernel.cuh 2025-05-07T19:50:29.1535274Z Written: gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:29.1535693Z Written: gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:29.1536192Z Written: gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:29.1536799Z Written: gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:29.1537238Z Written: gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:29.1537710Z Written: gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:29.1538153Z Written: gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:29.1538623Z Written: gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:29.1539057Z Written: gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:29.1539545Z Written: gen_embedding_backward_none_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:29.1540060Z Written: gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:29.1540525Z Written: gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:29.1541067Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:29.1541577Z Written: gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:29.1542100Z Written: gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:29.1542608Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:29.1543155Z Written: gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:29.1543646Z Written: gen_embedding_optimizer_none_split_device_kernel.cuh 2025-05-07T19:50:29.1544047Z Written: gen_embedding_backward_split_none.cpp 2025-05-07T19:50:29.1544429Z Written: gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:29.1544845Z Written: gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:29.1545247Z Written: lookup_none.py 2025-05-07T19:50:29.1545531Z Written: gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:29.1545966Z Written: gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:29.1546434Z Written: gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:29.1546958Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:29.1547591Z Written: gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:29.1548080Z Written: gen_embedding_backward_ssd_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:29.1548571Z Written: gen_embedding_backward_split_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:29.1549029Z Written: gen_embedding_backward_ssd_weighted_device_kernel.cuh 2025-05-07T19:50:29.1549491Z Written: gen_embedding_backward_split_weighted_device_kernel.cuh 2025-05-07T19:50:29.1549962Z Written: gen_embedding_backward_ssd_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:29.1550482Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:29.1550994Z Written: gen_embedding_backward_ssd_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:29.1551483Z Written: gen_embedding_backward_split_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:29.1551967Z Written: gen_embedding_backward_ssd_unweighted_device_kernel.cuh 2025-05-07T19:50:29.1552484Z Written: gen_embedding_backward_split_unweighted_device_kernel.cuh 2025-05-07T19:50:29.1552953Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:29.1553383Z Written: gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:29.1553851Z Written: gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:29.1554372Z Written: gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:29.1554861Z Written: gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:29.1555289Z Written: pt2_arg_utils.h 2025-05-07T19:50:29.1555536Z Written: __init__.py 2025-05-07T19:50:29.1555803Z Written: lookup_args_ssd.py 2025-05-07T19:50:29.1556059Z Written: lookup_args.py 2025-05-07T19:50:29.1632756Z 2025-05-07T19:50:29.1633217Z 2025-05-07T19:50:29.1633733Z ================================================================================ 2025-05-07T19:50:29.1634790Z Running code generation script ... 2025-05-07T19:50:29.1635842Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py --opensource 2025-05-07T19:50:29.1636681Z ================================================================================ 2025-05-07T19:50:29.1636921Z 2025-05-07T19:50:29.2700716Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:29.2703294Z [GENERATE OPTIMIZERS]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py', '--opensource'] 2025-05-07T19:50:29.2705479Z Written: gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:29.2706889Z Written: gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:29.2707946Z Written: gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:29.2708422Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:29.2708927Z Written: split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T19:50:29.2709294Z Written: optimizer_args.py 2025-05-07T19:50:29.2812761Z 2025-05-07T19:50:29.2813190Z 2025-05-07T19:50:29.2813919Z ================================================================================ 2025-05-07T19:50:29.2815033Z Running code generation script ... 2025-05-07T19:50:29.2816549Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py --opensource 2025-05-07T19:50:29.2817356Z ================================================================================ 2025-05-07T19:50:29.2817588Z 2025-05-07T19:50:29.3991276Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:29.3993893Z [GENERATE FORWARD QUANTIZED]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py', '--opensource'] 2025-05-07T19:50:29.3995745Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:29.3996428Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:29.3997349Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:29.3998057Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:29.3998812Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:29.3999445Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:29.4000109Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:29.4000826Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:29.4001512Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:29.4002221Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:29.4003057Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:29.4003745Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:29.4004439Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:29.4005092Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:29.4005760Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:29.4006425Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:29.4007070Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:29.4007738Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:29.4008364Z Written: gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:29.4009016Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:29.4009660Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:29.4010331Z Written: gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:29.4011054Z Written: gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:29.4096072Z 2025-05-07T19:50:29.4096164Z 2025-05-07T19:50:29.4096676Z ================================================================================ 2025-05-07T19:50:29.4097803Z Running code generation script ... 2025-05-07T19:50:29.4098632Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py --opensource 2025-05-07T19:50:29.4099423Z ================================================================================ 2025-05-07T19:50:29.4099710Z 2025-05-07T19:50:29.7454248Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:29.7456075Z [GENERATE FORWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py', '--opensource'] 2025-05-07T19:50:29.7456846Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:29.7457387Z Written: gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:29.7457928Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:29.7458444Z Written: gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:29.7459085Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:29.7459555Z Written: gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:29.7460032Z Written: gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:29.7460475Z Written: gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:29.7460979Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:29.7461726Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:29.7462244Z Written: gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:29.7462729Z Written: gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:29.7463215Z Written: gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:29.7463726Z Written: gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:29.7464218Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:29.7464750Z Written: gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:29.7465232Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:29.7465721Z Written: gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:29.7466307Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:29.7466790Z Written: gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:29.7467277Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:29.7467735Z Written: gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:29.7468193Z Written: gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:29.7468619Z Written: gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:29.7469079Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:29.7469566Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:29.7470023Z Written: gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:29.7470488Z Written: gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:29.7470925Z Written: gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:29.7471354Z Written: gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:29.7471776Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:29.7472227Z Written: gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:29.7472679Z Written: gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:29.7473090Z Written: gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:29.7473510Z Written: gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:29.7473905Z Written: gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:29.7474309Z Written: gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:29.7474711Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:29.7475163Z Written: gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:29.7475608Z Written: gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:29.7476029Z Written: gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:29.7476459Z Written: gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:29.7476855Z Written: gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:29.7477291Z Written: gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:29.7477716Z Written: gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:29.7478164Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:29.7478626Z Written: gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:29.7479044Z Written: gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:29.7479494Z Written: gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:29.7479956Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:29.7480472Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:29.7480949Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:29.7481447Z Written: gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:29.7481987Z Written: gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:29.7482401Z Written: gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:29.7482815Z Written: gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:29.7577549Z 2025-05-07T19:50:29.7577803Z 2025-05-07T19:50:29.7578105Z ================================================================================ 2025-05-07T19:50:29.7578527Z Running code generation script ... 2025-05-07T19:50:29.7579313Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py --opensource 2025-05-07T19:50:29.7580075Z ================================================================================ 2025-05-07T19:50:29.7580326Z 2025-05-07T19:50:30.0205927Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:30.0208829Z [INDEX SELECT GENERATOR]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py', '--opensource'] 2025-05-07T19:50:30.0211069Z Written: gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:30.0212337Z Written: gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:30.0213557Z Written: gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:30.0214870Z Written: gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:30.0216155Z Written: gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:30.0217450Z Written: gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:30.0217915Z Written: gen_embedding_backward_split_batch_index_select_device_kernel.cuh 2025-05-07T19:50:30.0218400Z Written: gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:30.0218838Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:30.0325898Z -- Adding merge_pooled_embeddings sources 2025-05-07T19:50:30.0338835Z 2025-05-07T19:50:30.0338952Z 2025-05-07T19:50:30.0339272Z ================================================================================ 2025-05-07T19:50:30.0339718Z GPU CPP Library Target: fbgemm_gpu_tbe_cache (SHARED) 2025-05-07T19:50:30.0340182Z 2025-05-07T19:50:30.0340393Z CPU_SRCS: 2025-05-07T19:50:30.0340823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:30.0341499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:30.0342179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:30.0342780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:30.0343415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:30.0343898Z 2025-05-07T19:50:30.0344109Z GPU_SRCS: 2025-05-07T19:50:30.0344455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:30.0345080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:30.0345731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:30.0346485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:30.0347090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:30.0347657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:30.0348293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:30.0348857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:30.0349435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:30.0350078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:30.0350535Z 2025-05-07T19:50:30.0350746Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.0350886Z 2025-05-07T19:50:30.0351173Z 2025-05-07T19:50:30.0351389Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.0351527Z 2025-05-07T19:50:30.0351603Z 2025-05-07T19:50:30.0351799Z OTHER_SRCS: 2025-05-07T19:50:30.0351916Z 2025-05-07T19:50:30.0351992Z 2025-05-07T19:50:30.0352184Z CC_FLAGS: 2025-05-07T19:50:30.0352295Z 2025-05-07T19:50:30.0352387Z 2025-05-07T19:50:30.0352564Z NVCC_FLAGS: 2025-05-07T19:50:30.0352795Z --expt-relaxed-constexpr 2025-05-07T19:50:30.0353123Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:30.0353487Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:30.0353775Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:30.0354038Z 2025-05-07T19:50:30.0354213Z HIPCC_FLAGS: 2025-05-07T19:50:30.0354351Z 2025-05-07T19:50:30.0354426Z 2025-05-07T19:50:30.0354668Z INCLUDE_DIRS: 2025-05-07T19:50:30.0354915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.0438652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.0439287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.0439612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.0440189Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.0440989Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.0441734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.0442142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.0442544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.0443014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.0443509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.0443965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.0444516Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.0444999Z 2025-05-07T19:50:30.0445201Z Selected Source Files: 2025-05-07T19:50:30.0445606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:30.0446248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:30.0446868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:30.0447453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:30.0448035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:30.0448651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:30.0449230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:30.0449825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:30.0450765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:30.0451414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:30.0452007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:30.0452632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:30.0453207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:30.0453796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:30.0454430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:30.0454895Z 2025-05-07T19:50:30.0455081Z HIPified Source Files: 2025-05-07T19:50:30.0455247Z 2025-05-07T19:50:30.0455322Z 2025-05-07T19:50:30.0455513Z Library Dependencies: 2025-05-07T19:50:30.0455782Z torch 2025-05-07T19:50:30.0455979Z torch_library 2025-05-07T19:50:30.0456414Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.0457216Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.0457912Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.0458707Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.0459426Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.0460032Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.0460437Z 2025-05-07T19:50:30.0460622Z Output Library: 2025-05-07T19:50:30.0460843Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:30.0461056Z 2025-05-07T19:50:30.0461256Z Destination Directory: 2025-05-07T19:50:30.0461480Z fbgemm_gpu 2025-05-07T19:50:30.0461716Z ================================================================================ 2025-05-07T19:50:30.0462010Z 2025-05-07T19:50:30.0907942Z 2025-05-07T19:50:30.0908192Z 2025-05-07T19:50:30.0908504Z ================================================================================ 2025-05-07T19:50:30.0908966Z GPU CPP Library Target: fbgemm_gpu_tbe_inference (SHARED) 2025-05-07T19:50:30.0909418Z 2025-05-07T19:50:30.0909605Z CPU_SRCS: 2025-05-07T19:50:30.0909897Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:30.0910357Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:30.0910777Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:30.0911125Z 2025-05-07T19:50:30.0911301Z GPU_SRCS: 2025-05-07T19:50:30.0911574Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:30.0912023Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:30.0912578Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:30.0913191Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:30.0913908Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:30.0914593Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:30.0915143Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:30.0915709Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:30.0916280Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:30.0916896Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:30.0917508Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:30.0918105Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:30.0918717Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:30.0919326Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:30.0919923Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:30.0920502Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:30.0921064Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:30.0921632Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:30.0922188Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:30.0922767Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:30.0923297Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:30.0923828Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:30.0924382Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:30.0924970Z 2025-05-07T19:50:30.0925164Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.0925293Z 2025-05-07T19:50:30.0925357Z 2025-05-07T19:50:30.0925532Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.0925656Z 2025-05-07T19:50:30.0925727Z 2025-05-07T19:50:30.0925905Z OTHER_SRCS: 2025-05-07T19:50:30.0926013Z 2025-05-07T19:50:30.0926074Z 2025-05-07T19:50:30.0926222Z CC_FLAGS: 2025-05-07T19:50:30.0926317Z 2025-05-07T19:50:30.0926379Z 2025-05-07T19:50:30.0926527Z NVCC_FLAGS: 2025-05-07T19:50:30.0926707Z --expt-relaxed-constexpr 2025-05-07T19:50:30.0926942Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:30.0927181Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:30.0927437Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:30.0927660Z 2025-05-07T19:50:30.0927830Z HIPCC_FLAGS: 2025-05-07T19:50:30.0927936Z 2025-05-07T19:50:30.0928002Z 2025-05-07T19:50:30.0928254Z INCLUDE_DIRS: 2025-05-07T19:50:30.0928979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.0929482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.0929820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.0930103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.0930723Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.0931600Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.0932264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.0932661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.0933083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.0933559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.0934063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.0934509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.0935065Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.0935566Z 2025-05-07T19:50:30.0935760Z Selected Source Files: 2025-05-07T19:50:30.0936081Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:30.0936647Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:30.0937059Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:30.0937463Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:30.0937892Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:30.0938433Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:30.0939009Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:30.0939582Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:30.0940164Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:30.0940739Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:30.0941323Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:30.0941921Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:30.0942567Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:30.0943200Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:30.0943829Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:30.0944490Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:30.0945116Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:30.0945744Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:30.0946468Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:30.0947067Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:30.0947663Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:30.0948252Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:30.0948851Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:30.0949421Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:30.0949971Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:30.0950543Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:30.0950942Z 2025-05-07T19:50:30.0951217Z HIPified Source Files: 2025-05-07T19:50:30.0951362Z 2025-05-07T19:50:30.0951433Z 2025-05-07T19:50:30.0951629Z Library Dependencies: 2025-05-07T19:50:30.0951843Z torch 2025-05-07T19:50:30.0952024Z torch_library 2025-05-07T19:50:30.0952451Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.0953099Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.0953776Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.0954535Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.0955246Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.0955677Z asmjit 2025-05-07T19:50:30.0955851Z fbgemm 2025-05-07T19:50:30.0956024Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:30.0956243Z fbgemm_gpu_config 2025-05-07T19:50:30.0956579Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.0956957Z 2025-05-07T19:50:30.0957135Z Output Library: 2025-05-07T19:50:30.0957346Z fbgemm_gpu_tbe_inference 2025-05-07T19:50:30.0957573Z 2025-05-07T19:50:30.0957750Z Destination Directory: 2025-05-07T19:50:30.0957969Z fbgemm_gpu 2025-05-07T19:50:30.0958178Z ================================================================================ 2025-05-07T19:50:30.0958402Z 2025-05-07T19:50:30.3619498Z 2025-05-07T19:50:30.3619620Z 2025-05-07T19:50:30.3620315Z ================================================================================ 2025-05-07T19:50:30.3621465Z GPU CPP Library Target: fbgemm_gpu_config (SHARED) 2025-05-07T19:50:30.3622427Z 2025-05-07T19:50:30.3622922Z CPU_SRCS: 2025-05-07T19:50:30.3623513Z src/config/feature_gates.cpp 2025-05-07T19:50:30.3624189Z 2025-05-07T19:50:30.3624667Z GPU_SRCS: 2025-05-07T19:50:30.3624973Z 2025-05-07T19:50:30.3625164Z 2025-05-07T19:50:30.3625674Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.3626089Z 2025-05-07T19:50:30.3626277Z 2025-05-07T19:50:30.3626771Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.3627140Z 2025-05-07T19:50:30.3627343Z 2025-05-07T19:50:30.3627727Z OTHER_SRCS: 2025-05-07T19:50:30.3627839Z 2025-05-07T19:50:30.3627909Z 2025-05-07T19:50:30.3628084Z CC_FLAGS: 2025-05-07T19:50:30.3628189Z 2025-05-07T19:50:30.3628273Z 2025-05-07T19:50:30.3628741Z NVCC_FLAGS: 2025-05-07T19:50:30.3628862Z 2025-05-07T19:50:30.3628948Z 2025-05-07T19:50:30.3629188Z HIPCC_FLAGS: 2025-05-07T19:50:30.3629311Z 2025-05-07T19:50:30.3629485Z 2025-05-07T19:50:30.3629655Z INCLUDE_DIRS: 2025-05-07T19:50:30.3629890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3630191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.3630473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.3630766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3631258Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.3632045Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.3634173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.3634610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.3635031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.3635504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.3636011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.3636472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.3637028Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.3637797Z 2025-05-07T19:50:30.3638027Z Selected Source Files: 2025-05-07T19:50:30.3638268Z src/config/feature_gates.cpp 2025-05-07T19:50:30.3638529Z 2025-05-07T19:50:30.3638822Z HIPified Source Files: 2025-05-07T19:50:30.3638975Z 2025-05-07T19:50:30.3639166Z 2025-05-07T19:50:30.3639353Z Library Dependencies: 2025-05-07T19:50:30.3639580Z torch 2025-05-07T19:50:30.3639756Z torch_library 2025-05-07T19:50:30.3640193Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.3640871Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.3641555Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.3642349Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.3643072Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.3643669Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.3644063Z 2025-05-07T19:50:30.3644240Z Output Library: 2025-05-07T19:50:30.3644456Z fbgemm_gpu_config 2025-05-07T19:50:30.3644653Z 2025-05-07T19:50:30.3644846Z Destination Directory: 2025-05-07T19:50:30.3645070Z fbgemm_gpu 2025-05-07T19:50:30.3645309Z ================================================================================ 2025-05-07T19:50:30.3645530Z 2025-05-07T19:50:30.3645613Z 2025-05-07T19:50:30.3645617Z 2025-05-07T19:50:30.3645725Z ================================================================================ 2025-05-07T19:50:30.3646099Z GPU CPP Library Target: fbgemm_gpu_tbe_utils (SHARED) 2025-05-07T19:50:30.3646417Z 2025-05-07T19:50:30.3646597Z CPU_SRCS: 2025-05-07T19:50:30.3646874Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:30.3647385Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:30.3647737Z 2025-05-07T19:50:30.3647919Z GPU_SRCS: 2025-05-07T19:50:30.3648218Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:30.3648627Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:30.3649043Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:30.3649418Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:30.3649809Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:30.3650137Z 2025-05-07T19:50:30.3650443Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.3650576Z 2025-05-07T19:50:30.3650649Z 2025-05-07T19:50:30.3650851Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.3650984Z 2025-05-07T19:50:30.3651056Z 2025-05-07T19:50:30.3651246Z OTHER_SRCS: 2025-05-07T19:50:30.3651374Z 2025-05-07T19:50:30.3651446Z 2025-05-07T19:50:30.3651625Z CC_FLAGS: 2025-05-07T19:50:30.3651732Z 2025-05-07T19:50:30.3651813Z 2025-05-07T19:50:30.3651978Z NVCC_FLAGS: 2025-05-07T19:50:30.3652087Z 2025-05-07T19:50:30.3652176Z 2025-05-07T19:50:30.3652344Z HIPCC_FLAGS: 2025-05-07T19:50:30.3652463Z 2025-05-07T19:50:30.3652544Z 2025-05-07T19:50:30.3652715Z INCLUDE_DIRS: 2025-05-07T19:50:30.3652938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3653232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.3653510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.3653807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3654394Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.3655175Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.3655801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.3656204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.3656740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.3657373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.3657881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.3658408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.3658961Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.3659449Z 2025-05-07T19:50:30.3659712Z Selected Source Files: 2025-05-07T19:50:30.3660024Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:30.3660474Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:30.3660899Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:30.3661309Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:30.3661679Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:30.3662059Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:30.3662449Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:30.3662776Z 2025-05-07T19:50:30.3662969Z HIPified Source Files: 2025-05-07T19:50:30.3663118Z 2025-05-07T19:50:30.3663188Z 2025-05-07T19:50:30.3663383Z Library Dependencies: 2025-05-07T19:50:30.3663593Z torch 2025-05-07T19:50:30.3663782Z torch_library 2025-05-07T19:50:30.3664198Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.3664881Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.3665582Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.3666362Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.3667094Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.3667678Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.3668066Z 2025-05-07T19:50:30.3668239Z Output Library: 2025-05-07T19:50:30.3668456Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:30.3668673Z 2025-05-07T19:50:30.3668855Z Destination Directory: 2025-05-07T19:50:30.3669086Z fbgemm_gpu 2025-05-07T19:50:30.3669307Z ================================================================================ 2025-05-07T19:50:30.3669525Z 2025-05-07T19:50:30.3669540Z 2025-05-07T19:50:30.3669544Z 2025-05-07T19:50:30.3669661Z ================================================================================ 2025-05-07T19:50:30.3670058Z GPU CPP Library Target: fbgemm_gpu_sparse_async_cumsum (SHARED) 2025-05-07T19:50:30.3670430Z 2025-05-07T19:50:30.3670613Z CPU_SRCS: 2025-05-07T19:50:30.3670819Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:30.3671096Z 2025-05-07T19:50:30.3671267Z GPU_SRCS: 2025-05-07T19:50:30.3671488Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:30.3671753Z 2025-05-07T19:50:30.3671940Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.3672077Z 2025-05-07T19:50:30.3672148Z 2025-05-07T19:50:30.3672339Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.3672474Z 2025-05-07T19:50:30.3672542Z 2025-05-07T19:50:30.3672722Z OTHER_SRCS: 2025-05-07T19:50:30.3672833Z 2025-05-07T19:50:30.3672913Z 2025-05-07T19:50:30.3673080Z CC_FLAGS: 2025-05-07T19:50:30.3673185Z 2025-05-07T19:50:30.3673273Z 2025-05-07T19:50:30.3673446Z NVCC_FLAGS: 2025-05-07T19:50:30.3673555Z 2025-05-07T19:50:30.3673640Z 2025-05-07T19:50:30.3673813Z HIPCC_FLAGS: 2025-05-07T19:50:30.3673945Z 2025-05-07T19:50:30.3674016Z 2025-05-07T19:50:30.3674176Z INCLUDE_DIRS: 2025-05-07T19:50:30.3674487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3674786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.3675081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.3675390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3675870Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.3676669Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.3677296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.3677764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.3678186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.3678649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.3679227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.3679672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.3680223Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.3680720Z 2025-05-07T19:50:30.3680918Z Selected Source Files: 2025-05-07T19:50:30.3681171Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:30.3681489Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:30.3681742Z 2025-05-07T19:50:30.3681942Z HIPified Source Files: 2025-05-07T19:50:30.3682089Z 2025-05-07T19:50:30.3682186Z 2025-05-07T19:50:30.3682367Z Library Dependencies: 2025-05-07T19:50:30.3682589Z torch 2025-05-07T19:50:30.3682758Z torch_library 2025-05-07T19:50:30.3683188Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.3683844Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.3684521Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.3685304Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.3686033Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.3686497Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:30.3686841Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.3687237Z 2025-05-07T19:50:30.3687410Z Output Library: 2025-05-07T19:50:30.3687644Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:30.3687880Z 2025-05-07T19:50:30.3688075Z Destination Directory: 2025-05-07T19:50:30.3688287Z fbgemm_gpu 2025-05-07T19:50:30.3688512Z ================================================================================ 2025-05-07T19:50:30.3688728Z 2025-05-07T19:50:30.3688821Z 2025-05-07T19:50:30.3688825Z 2025-05-07T19:50:30.3688940Z ================================================================================ 2025-05-07T19:50:30.3689303Z GPU CPP Library Target: fbgemm_gpu_tbe_common (SHARED) 2025-05-07T19:50:30.3689628Z 2025-05-07T19:50:30.3689799Z CPU_SRCS: 2025-05-07T19:50:30.3690046Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:30.3690609Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:30.3691004Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:30.3691315Z 2025-05-07T19:50:30.3691500Z GPU_SRCS: 2025-05-07T19:50:30.3691735Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:30.3692062Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:30.3692399Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:30.3692689Z 2025-05-07T19:50:30.3692872Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.3693001Z 2025-05-07T19:50:30.3693073Z 2025-05-07T19:50:30.3693258Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.3693390Z 2025-05-07T19:50:30.3693459Z 2025-05-07T19:50:30.3693635Z OTHER_SRCS: 2025-05-07T19:50:30.3693747Z 2025-05-07T19:50:30.3693833Z 2025-05-07T19:50:30.3693996Z CC_FLAGS: 2025-05-07T19:50:30.3694102Z 2025-05-07T19:50:30.3694184Z 2025-05-07T19:50:30.3694437Z NVCC_FLAGS: 2025-05-07T19:50:30.3694662Z --expt-relaxed-constexpr 2025-05-07T19:50:30.3694919Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:30.3695189Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:30.3695468Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:30.3695722Z 2025-05-07T19:50:30.3695891Z HIPCC_FLAGS: 2025-05-07T19:50:30.3696020Z 2025-05-07T19:50:30.3696091Z 2025-05-07T19:50:30.3696257Z INCLUDE_DIRS: 2025-05-07T19:50:30.3696485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3696902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.3697164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.3697458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3697917Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.3698878Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.3699603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.3700011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.3700433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.3700891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.3701403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.3701837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.3702417Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.3702905Z 2025-05-07T19:50:30.3703119Z Selected Source Files: 2025-05-07T19:50:30.3703392Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:30.3703789Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:30.3704177Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:30.3704525Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:30.3704868Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:30.3705194Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:30.3705471Z 2025-05-07T19:50:30.3705691Z HIPified Source Files: 2025-05-07T19:50:30.3705853Z 2025-05-07T19:50:30.3705939Z 2025-05-07T19:50:30.3706143Z Library Dependencies: 2025-05-07T19:50:30.3706358Z torch 2025-05-07T19:50:30.3706567Z torch_library 2025-05-07T19:50:30.3706990Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.3707684Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.3708366Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.3709271Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.3710075Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.3710561Z fbgemm 2025-05-07T19:50:30.3710794Z fbgemm_gpu_config 2025-05-07T19:50:30.3711128Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.3711534Z 2025-05-07T19:50:30.3711723Z Output Library: 2025-05-07T19:50:30.3711951Z fbgemm_gpu_tbe_common 2025-05-07T19:50:30.3712176Z 2025-05-07T19:50:30.3712361Z Destination Directory: 2025-05-07T19:50:30.3712600Z fbgemm_gpu 2025-05-07T19:50:30.3712917Z ================================================================================ 2025-05-07T19:50:30.3713131Z 2025-05-07T19:50:30.3713134Z 2025-05-07T19:50:30.3713138Z 2025-05-07T19:50:30.3713237Z ================================================================================ 2025-05-07T19:50:30.3713583Z GPU CPP Library Target: fbgemm_gpu_tbe_optimizers (SHARED) 2025-05-07T19:50:30.3713904Z 2025-05-07T19:50:30.3714069Z CPU_SRCS: 2025-05-07T19:50:30.3714166Z 2025-05-07T19:50:30.3714236Z 2025-05-07T19:50:30.3714401Z GPU_SRCS: 2025-05-07T19:50:30.3714617Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:30.3715093Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:30.3715462Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:30.3715767Z 2025-05-07T19:50:30.3715928Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.3716057Z 2025-05-07T19:50:30.3716119Z 2025-05-07T19:50:30.3716275Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.3716398Z 2025-05-07T19:50:30.3716459Z 2025-05-07T19:50:30.3716615Z OTHER_SRCS: 2025-05-07T19:50:30.3716713Z 2025-05-07T19:50:30.3716775Z 2025-05-07T19:50:30.3716929Z CC_FLAGS: 2025-05-07T19:50:30.3717029Z 2025-05-07T19:50:30.3717090Z 2025-05-07T19:50:30.3717249Z NVCC_FLAGS: 2025-05-07T19:50:30.3717429Z --expt-relaxed-constexpr 2025-05-07T19:50:30.3717669Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:30.3717910Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:30.3718172Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:30.3718454Z 2025-05-07T19:50:30.3718618Z HIPCC_FLAGS: 2025-05-07T19:50:30.3718722Z 2025-05-07T19:50:30.3718792Z 2025-05-07T19:50:30.3718945Z INCLUDE_DIRS: 2025-05-07T19:50:30.3719150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3719417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.3719665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.3719927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3720366Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.3721063Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.3721646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.3722008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.3722380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.3722803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.3723260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.3723677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.3724174Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.3724621Z 2025-05-07T19:50:30.3724786Z Selected Source Files: 2025-05-07T19:50:30.3725034Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:30.3725387Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:30.3725745Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:30.3726041Z 2025-05-07T19:50:30.3726200Z HIPified Source Files: 2025-05-07T19:50:30.3726336Z 2025-05-07T19:50:30.3726396Z 2025-05-07T19:50:30.3726553Z Library Dependencies: 2025-05-07T19:50:30.3726746Z torch 2025-05-07T19:50:30.3726900Z torch_library 2025-05-07T19:50:30.3727293Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.3727905Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.3728731Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.3729689Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.3730482Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.3731068Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.3731441Z 2025-05-07T19:50:30.3731611Z Output Library: 2025-05-07T19:50:30.3731825Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:30.3732038Z 2025-05-07T19:50:30.3732216Z Destination Directory: 2025-05-07T19:50:30.3732425Z fbgemm_gpu 2025-05-07T19:50:30.3732633Z ================================================================================ 2025-05-07T19:50:30.3732850Z 2025-05-07T19:50:30.3732854Z 2025-05-07T19:50:30.3732863Z 2025-05-07T19:50:30.3732962Z ================================================================================ 2025-05-07T19:50:30.3733495Z GPU CPP Library Target: fbgemm_gpu_tbe_training_forward (SHARED) 2025-05-07T19:50:30.3733856Z 2025-05-07T19:50:30.3734017Z CPU_SRCS: 2025-05-07T19:50:30.3734255Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3734545Z 2025-05-07T19:50:30.3734718Z GPU_SRCS: 2025-05-07T19:50:30.3734945Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:30.3735302Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:30.3735632Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:30.3736004Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:30.3736397Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:30.3736805Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:30.3737182Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:30.3737683Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:30.3738070Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:30.3738490Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:30.3738898Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:30.3739351Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:30.3739761Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:30.3740218Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:30.3740618Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:30.3741082Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:30.3741527Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:30.3741927Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:30.3742365Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:30.3742746Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:30.3743191Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:30.3743609Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:30.3744100Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:30.3744498Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:30.3744869Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:30.3745249Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:30.3745676Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:30.3746082Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:30.3746485Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:30.3746863Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:30.3747311Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:30.3747709Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:30.3748173Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:30.3748622Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:30.3749046Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:30.3749569Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:30.3750003Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:30.3750482Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:30.3750969Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:30.3751431Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:30.3751959Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:30.3752404Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:30.3752867Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:30.3753332Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:30.3753771Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:30.3754370Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:30.3754838Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:30.3755279Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:30.3755741Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:30.3756215Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:30.3756624Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3757016Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3757323Z 2025-05-07T19:50:30.3757530Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.3757671Z 2025-05-07T19:50:30.3757764Z 2025-05-07T19:50:30.3757950Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.3758087Z 2025-05-07T19:50:30.3758183Z 2025-05-07T19:50:30.3758373Z OTHER_SRCS: 2025-05-07T19:50:30.3758488Z 2025-05-07T19:50:30.3758584Z 2025-05-07T19:50:30.3758835Z CC_FLAGS: 2025-05-07T19:50:30.3758968Z 2025-05-07T19:50:30.3759044Z 2025-05-07T19:50:30.3759224Z NVCC_FLAGS: 2025-05-07T19:50:30.3759465Z --expt-relaxed-constexpr 2025-05-07T19:50:30.3759734Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:30.3760033Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:30.3760343Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:30.3760590Z 2025-05-07T19:50:30.3760790Z HIPCC_FLAGS: 2025-05-07T19:50:30.3760914Z 2025-05-07T19:50:30.3761003Z 2025-05-07T19:50:30.3761223Z INCLUDE_DIRS: 2025-05-07T19:50:30.3761451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3761774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.3762052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.3762370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3762858Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.3763826Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.3764482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.3764888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.3765323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.3765786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.3766321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.3766776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.3767346Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.3767859Z 2025-05-07T19:50:30.3768056Z Selected Source Files: 2025-05-07T19:50:30.3768364Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3768756Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:30.3769184Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:30.3769602Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:30.3770027Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:30.3770513Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:30.3770925Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:30.3771357Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:30.3771775Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:30.3772222Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:30.3772667Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:30.3773064Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3773418Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3773771Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:30.3774117Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:30.3774454Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:30.3774833Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:30.3775326Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:30.3775725Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:30.3776092Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:30.3776458Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:30.3776807Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:30.3777176Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:30.3777575Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:30.3777960Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:30.3778357Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:30.3778730Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:30.3779109Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:30.3779591Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:30.3779993Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:30.3780361Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:30.3780754Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:30.3781186Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:30.3781587Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:30.3781988Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:30.3782363Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:30.3782746Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:30.3783129Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:30.3783500Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:30.3783889Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:30.3784288Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:30.3784683Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:30.3785079Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:30.3785522Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:30.3785949Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:30.3786355Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:30.3786750Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:30.3787152Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:30.3787557Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:30.3787942Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:30.3788346Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:30.3788785Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:30.3789218Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:30.3789558Z 2025-05-07T19:50:30.3789744Z HIPified Source Files: 2025-05-07T19:50:30.3789897Z 2025-05-07T19:50:30.3789974Z 2025-05-07T19:50:30.3790154Z Library Dependencies: 2025-05-07T19:50:30.3790375Z torch 2025-05-07T19:50:30.3790553Z torch_library 2025-05-07T19:50:30.3790985Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.3791644Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.3792329Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.3793110Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.3793841Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.3794311Z fbgemm_gpu_tbe_common 2025-05-07T19:50:30.3794663Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.3795051Z 2025-05-07T19:50:30.3795221Z Output Library: 2025-05-07T19:50:30.3795513Z fbgemm_gpu_tbe_training_forward 2025-05-07T19:50:30.3795754Z 2025-05-07T19:50:30.3795945Z Destination Directory: 2025-05-07T19:50:30.3796162Z fbgemm_gpu 2025-05-07T19:50:30.3796394Z ================================================================================ 2025-05-07T19:50:30.3796611Z 2025-05-07T19:50:30.3796834Z 2025-05-07T19:50:30.3796838Z 2025-05-07T19:50:30.3796957Z ================================================================================ 2025-05-07T19:50:30.3797382Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_pt2 (SHARED) 2025-05-07T19:50:30.3797764Z 2025-05-07T19:50:30.3797930Z CPU_SRCS: 2025-05-07T19:50:30.3798168Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:30.3798530Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:30.3798902Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:30.3799292Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:30.3799602Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:30.3799948Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:30.3800323Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:30.3800753Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:30.3801125Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:30.3801535Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:30.3801951Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:30.3802359Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:30.3802857Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:30.3803410Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:30.3803970Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:30.3804464Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:30.3804893Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:30.3805292Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3805753Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3806201Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3806587Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3806985Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3807382Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3807854Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3808379Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3808854Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3809349Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3809866Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3810469Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3811056Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3811721Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3812365Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3812962Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3813377Z 2025-05-07T19:50:30.3813548Z GPU_SRCS: 2025-05-07T19:50:30.3813829Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3814278Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3814720Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3815112Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3815593Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3815998Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3816482Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3817016Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3817476Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3817975Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3818493Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3819005Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3819592Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3820262Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3820995Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3821587Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3822122Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3822484Z 2025-05-07T19:50:30.3822685Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.3822816Z 2025-05-07T19:50:30.3822883Z 2025-05-07T19:50:30.3823069Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.3823198Z 2025-05-07T19:50:30.3823289Z 2025-05-07T19:50:30.3823458Z OTHER_SRCS: 2025-05-07T19:50:30.3823576Z 2025-05-07T19:50:30.3823656Z 2025-05-07T19:50:30.3823820Z CC_FLAGS: 2025-05-07T19:50:30.3823926Z 2025-05-07T19:50:30.3824011Z 2025-05-07T19:50:30.3824172Z NVCC_FLAGS: 2025-05-07T19:50:30.3824381Z --expt-relaxed-constexpr 2025-05-07T19:50:30.3824636Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:30.3824908Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:30.3825186Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:30.3825419Z 2025-05-07T19:50:30.3825599Z HIPCC_FLAGS: 2025-05-07T19:50:30.3825718Z 2025-05-07T19:50:30.3825786Z 2025-05-07T19:50:30.3825957Z INCLUDE_DIRS: 2025-05-07T19:50:30.3826168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3826466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.3826723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.3827012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3827481Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.3828257Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.3829094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.3829485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.3829901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.3830355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.3830875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.3831311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.3831869Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.3832349Z 2025-05-07T19:50:30.3832526Z Selected Source Files: 2025-05-07T19:50:30.3832800Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:30.3833157Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:30.3833527Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:30.3833841Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:30.3834168Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:30.3834491Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:30.3834874Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:30.3835290Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:30.3835672Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:30.3836205Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:30.3836623Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:30.3837026Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:30.3837503Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:30.3838072Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:30.3838624Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:30.3839117Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:30.3839534Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:30.3839924Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3840381Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3840905Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3841333Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3841725Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3842149Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3842654Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3843185Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3843665Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3844158Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3844704Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3845199Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3845812Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3846502Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3847154Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3847762Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:30.3848270Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3848752Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3849202Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3849625Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3850052Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3850552Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3851052Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3851595Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3852110Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3852603Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3853127Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3853627Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3854207Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3854877Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3855524Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3856118Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3856638Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:30.3857013Z 2025-05-07T19:50:30.3857209Z HIPified Source Files: 2025-05-07T19:50:30.3857360Z 2025-05-07T19:50:30.3857432Z 2025-05-07T19:50:30.3857725Z Library Dependencies: 2025-05-07T19:50:30.3857938Z torch 2025-05-07T19:50:30.3858124Z torch_library 2025-05-07T19:50:30.3858541Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.3859224Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.3860109Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.3860900Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.3861630Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.3862072Z fbgemm 2025-05-07T19:50:30.3862271Z fbgemm_gpu_config 2025-05-07T19:50:30.3862479Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:30.3862826Z fbgemm_gpu_tbe_common 2025-05-07T19:50:30.3863095Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:30.3863330Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:30.3863698Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.3864080Z 2025-05-07T19:50:30.3864267Z Output Library: 2025-05-07T19:50:30.3864483Z fbgemm_gpu_tbe_training_backward_pt2 2025-05-07T19:50:30.3864744Z 2025-05-07T19:50:30.3864919Z Destination Directory: 2025-05-07T19:50:30.3865148Z fbgemm_gpu 2025-05-07T19:50:30.3865369Z ================================================================================ 2025-05-07T19:50:30.3865774Z 2025-05-07T19:50:30.3865778Z 2025-05-07T19:50:30.3865782Z 2025-05-07T19:50:30.3865888Z ================================================================================ 2025-05-07T19:50:30.3866299Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward (SHARED) 2025-05-07T19:50:30.3866667Z 2025-05-07T19:50:30.3866848Z CPU_SRCS: 2025-05-07T19:50:30.3867150Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:30.3867590Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:30.3867923Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:30.3868306Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:30.3868655Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:30.3868977Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:30.3869292Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:30.3869632Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:30.3870018Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:30.3870438Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:30.3870818Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:30.3871213Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:30.3871643Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:30.3872031Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:30.3872531Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:30.3873105Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:30.3873654Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:30.3874156Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:30.3874565Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:30.3874931Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:30.3875281Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:30.3875558Z 2025-05-07T19:50:30.3875725Z GPU_SRCS: 2025-05-07T19:50:30.3875964Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:30.3876373Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:30.3876802Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:30.3877233Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:30.3877653Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:30.3878181Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:30.3878698Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:30.3879200Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3879722Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3880267Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3880777Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:30.3881352Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3881864Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3882379Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:30.3882863Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3884146Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3884668Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3885212Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3885740Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3886238Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:30.3886718Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3887187Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3887701Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:30.3888178Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3888684Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3889195Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3889737Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3890369Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3890895Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:30.3891383Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3908224Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3908812Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:30.3909202Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3909623Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3910038Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3910494Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3910963Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3911406Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:30.3911806Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3912240Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3912647Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:30.3913034Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3913447Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3913865Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3914325Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3914798Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3915239Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:30.3915648Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3916078Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3916489Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:30.3917005Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3917440Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3917857Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3918313Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3918811Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3919246Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:30.3919653Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3920189Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3920606Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:30.3921008Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3921518Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3921964Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3922443Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3922951Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3923405Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:30.3923835Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3924286Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3924762Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:30.3925266Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3925807Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3926346Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3926907Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3927507Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3928048Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:30.3928956Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3929511Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3930047Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:30.3930663Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3931217Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3931763Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3932337Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3932952Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3933512Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:30.3934028Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3934579Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3935039Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:30.3935430Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3935841Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3936270Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3936719Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3937194Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3937634Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:30.3938038Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3938618Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3939115Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:30.3939694Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3940300Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3940902Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3941536Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3942191Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3942913Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:30.3943531Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3944117Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3944530Z 2025-05-07T19:50:30.3944693Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.3944818Z 2025-05-07T19:50:30.3944891Z 2025-05-07T19:50:30.3945046Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.3945351Z gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:30.3945784Z gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:30.3946205Z gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:30.3946519Z 2025-05-07T19:50:30.3946679Z OTHER_SRCS: 2025-05-07T19:50:30.3946784Z 2025-05-07T19:50:30.3946853Z 2025-05-07T19:50:30.3947004Z CC_FLAGS: 2025-05-07T19:50:30.3947101Z 2025-05-07T19:50:30.3947171Z 2025-05-07T19:50:30.3947318Z NVCC_FLAGS: 2025-05-07T19:50:30.3947505Z --expt-relaxed-constexpr 2025-05-07T19:50:30.3947756Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:30.3947994Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:30.3948257Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:30.3948470Z 2025-05-07T19:50:30.3948635Z HIPCC_FLAGS: 2025-05-07T19:50:30.3948741Z 2025-05-07T19:50:30.3948814Z 2025-05-07T19:50:30.3948964Z INCLUDE_DIRS: 2025-05-07T19:50:30.3949174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3949443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.3949690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.3949954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3950408Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.3951115Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.3951699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.3952072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.3952454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.3952878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.3953332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.3953741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.3954019Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.3954088Z 2025-05-07T19:50:30.3954177Z Selected Source Files: 2025-05-07T19:50:30.3954360Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:30.3954462Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:30.3954576Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:30.3954707Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:30.3954808Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:30.3954919Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:30.3955019Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:30.3955187Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:30.3955337Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:30.3955489Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:30.3955588Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:30.3955762Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:30.3955887Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:30.3956039Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:30.3956236Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:30.3956450Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:30.3956650Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:30.3956812Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:30.3957009Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:30.3957151Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:30.3957251Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:30.3957373Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:30.3957533Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:30.3957684Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:30.3957827Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:30.3957974Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:30.3958154Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:30.3958330Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:30.3958511Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3958719Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3958929Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3959088Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:30.3959272Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3959454Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3959586Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:30.3959746Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3959903Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3960066Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3960246Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3960437Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3960579Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:30.3960743Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3960920Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3961076Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:30.3961257Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3961450Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3961632Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3961837Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3962046Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3962223Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:30.3962408Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3962600Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3962778Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:30.3962918Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3963222Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3963376Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3963542Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3963712Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3963835Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:30.3963984Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3964131Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3964249Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:30.3964397Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3964608Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3964757Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3964931Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3965100Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3965229Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:30.3965375Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3965530Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3965652Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:30.3965790Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3965941Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3966086Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3966259Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3966449Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3966579Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:30.3966727Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3966877Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3967016Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:30.3967168Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3967328Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3967497Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3967677Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3967863Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3968008Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:30.3968172Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3968342Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3968521Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:30.3968719Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3968917Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3969119Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3969355Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3969581Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3969765Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:30.3969971Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3970262Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3970494Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:30.3970883Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3971098Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3971317Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3971568Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3971810Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3972008Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:30.3972227Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3972455Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3972647Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:30.3972806Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3972967Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3973125Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3973308Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3973502Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3973641Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:30.3973799Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3973964Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3974189Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:30.3974429Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.3974675Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.3974937Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.3975207Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.3975481Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.3975715Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:30.3975959Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.3976206Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.3976282Z 2025-05-07T19:50:30.3976367Z HIPified Source Files: 2025-05-07T19:50:30.3976372Z 2025-05-07T19:50:30.3976439Z 2025-05-07T19:50:30.3976524Z Library Dependencies: 2025-05-07T19:50:30.3976605Z torch 2025-05-07T19:50:30.3976680Z torch_library 2025-05-07T19:50:30.3976987Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.3977241Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.3977563Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.3977906Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.3978175Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.3978242Z fbgemm 2025-05-07T19:50:30.3978321Z fbgemm_gpu_config 2025-05-07T19:50:30.3978402Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:30.3978488Z fbgemm_gpu_tbe_common 2025-05-07T19:50:30.3978567Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:30.3978660Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:30.3978872Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.3978943Z 2025-05-07T19:50:30.3979022Z Output Library: 2025-05-07T19:50:30.3979182Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:30.3979260Z 2025-05-07T19:50:30.3979347Z Destination Directory: 2025-05-07T19:50:30.3979420Z fbgemm_gpu 2025-05-07T19:50:30.3979529Z ================================================================================ 2025-05-07T19:50:30.3979534Z 2025-05-07T19:50:30.3979538Z 2025-05-07T19:50:30.3979542Z 2025-05-07T19:50:30.3979641Z ================================================================================ 2025-05-07T19:50:30.3979837Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_gwd (SHARED) 2025-05-07T19:50:30.3979912Z 2025-05-07T19:50:30.3979983Z CPU_SRCS: 2025-05-07T19:50:30.3979987Z 2025-05-07T19:50:30.3980052Z 2025-05-07T19:50:30.3980127Z GPU_SRCS: 2025-05-07T19:50:30.3980325Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:30.3980535Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:30.3980804Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:30.3981010Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:30.3981228Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:30.3981445Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:30.3981647Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:30.3981865Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:30.3982087Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:30.3982296Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:30.3982531Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:30.3982763Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:30.3982831Z 2025-05-07T19:50:30.3983033Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.3983038Z 2025-05-07T19:50:30.3983102Z 2025-05-07T19:50:30.3983175Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.3983179Z 2025-05-07T19:50:30.3983247Z 2025-05-07T19:50:30.3983313Z OTHER_SRCS: 2025-05-07T19:50:30.3983317Z 2025-05-07T19:50:30.3983378Z 2025-05-07T19:50:30.3983446Z CC_FLAGS: 2025-05-07T19:50:30.3983450Z 2025-05-07T19:50:30.3983521Z 2025-05-07T19:50:30.3983592Z NVCC_FLAGS: 2025-05-07T19:50:30.3983681Z --expt-relaxed-constexpr 2025-05-07T19:50:30.3983773Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:30.3983863Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:30.3983950Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:30.3984015Z 2025-05-07T19:50:30.3984091Z HIPCC_FLAGS: 2025-05-07T19:50:30.3984095Z 2025-05-07T19:50:30.3984155Z 2025-05-07T19:50:30.3984222Z INCLUDE_DIRS: 2025-05-07T19:50:30.3984325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3984408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.3984496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.3984592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.3984849Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.3985206Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.3985334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.3985483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.3985619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.3985804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.3985991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.3986119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.3986398Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.3986469Z 2025-05-07T19:50:30.3986546Z Selected Source Files: 2025-05-07T19:50:30.3986767Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:30.3986963Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:30.3987167Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:30.3987345Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:30.3987542Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:30.3987748Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:30.3987933Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:30.3988134Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:30.3988351Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:30.3988596Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:30.3988813Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:30.3989030Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:30.3989105Z 2025-05-07T19:50:30.3989182Z HIPified Source Files: 2025-05-07T19:50:30.3989186Z 2025-05-07T19:50:30.3989247Z 2025-05-07T19:50:30.3989337Z Library Dependencies: 2025-05-07T19:50:30.3989401Z torch 2025-05-07T19:50:30.3989469Z torch_library 2025-05-07T19:50:30.3989747Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.3989978Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.3990274Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.3990589Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.3990845Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.3990935Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:30.3991125Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.3991193Z 2025-05-07T19:50:30.3991263Z Output Library: 2025-05-07T19:50:30.3991355Z fbgemm_gpu_tbe_training_backward_gwd 2025-05-07T19:50:30.3991416Z 2025-05-07T19:50:30.3991504Z Destination Directory: 2025-05-07T19:50:30.3991572Z fbgemm_gpu 2025-05-07T19:50:30.3991668Z ================================================================================ 2025-05-07T19:50:30.3991672Z 2025-05-07T19:50:30.3991676Z 2025-05-07T19:50:30.3991679Z 2025-05-07T19:50:30.3991776Z ================================================================================ 2025-05-07T19:50:30.3991956Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_vbe (SHARED) 2025-05-07T19:50:30.3992023Z 2025-05-07T19:50:30.3992096Z CPU_SRCS: 2025-05-07T19:50:30.3992100Z 2025-05-07T19:50:30.3992162Z 2025-05-07T19:50:30.3992232Z GPU_SRCS: 2025-05-07T19:50:30.3992415Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:30.3992584Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:30.3992767Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:30.3992938Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:30.3993164Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:30.3993386Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:30.3993525Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:30.3993676Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:30.3993814Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:30.3993963Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:30.3994187Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:30.3994334Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:30.3994505Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:30.3994701Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.3994902Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.3995067Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:30.3995250Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.3995446Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.3995622Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:30.3995822Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.3996087Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.3996262Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:30.3996454Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.3996656Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.3996869Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:30.3997104Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.3997341Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.3997570Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:30.3997808Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.3998051Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.3998198Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:30.3998348Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.3998503Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.3998647Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:30.3998807Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.3998969Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.3999103Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:30.3999269Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.3999428Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.3999570Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:30.3999744Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.3999917Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.4000054Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:30.4000216Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.4000374Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.4000516Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:30.4000686Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.4000852Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.4000916Z 2025-05-07T19:50:30.4000993Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.4000997Z 2025-05-07T19:50:30.4001068Z 2025-05-07T19:50:30.4001139Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.4001143Z 2025-05-07T19:50:30.4001204Z 2025-05-07T19:50:30.4001281Z OTHER_SRCS: 2025-05-07T19:50:30.4001284Z 2025-05-07T19:50:30.4001346Z 2025-05-07T19:50:30.4001414Z CC_FLAGS: 2025-05-07T19:50:30.4001418Z 2025-05-07T19:50:30.4001481Z 2025-05-07T19:50:30.4001557Z NVCC_FLAGS: 2025-05-07T19:50:30.4001691Z --expt-relaxed-constexpr 2025-05-07T19:50:30.4001776Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:30.4001876Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:30.4001958Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:30.4002022Z 2025-05-07T19:50:30.4002093Z HIPCC_FLAGS: 2025-05-07T19:50:30.4002097Z 2025-05-07T19:50:30.4002167Z 2025-05-07T19:50:30.4002237Z INCLUDE_DIRS: 2025-05-07T19:50:30.4002333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.4002427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.4002518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.4002609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.4002865Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.4003234Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.4003413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.4003558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.4003708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.4003889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.4004065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.4004202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.4004482Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.4004546Z 2025-05-07T19:50:30.4004629Z Selected Source Files: 2025-05-07T19:50:30.4004807Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:30.4004978Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:30.4005160Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:30.4005342Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:30.4005571Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:30.4005807Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:30.4005944Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:30.4006085Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:30.4006231Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:30.4006377Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:30.4006512Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:30.4006663Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:30.4006834Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:30.4007033Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.4007236Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.4007412Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:30.4007599Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.4007788Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.4007973Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:30.4008174Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.4008380Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.4008556Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:30.4008747Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.4008943Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.4009171Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:30.4009453Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.4009690Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.4009913Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:30.4010242Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.4010491Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.4010804Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:30.4010976Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.4011143Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.4011289Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:30.4011536Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.4011717Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.4011861Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:30.4012043Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.4012217Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.4012371Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:30.4012550Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.4012738Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.4012883Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:30.4013050Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.4013226Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.4013379Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:30.4013560Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:30.4013750Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:30.4013824Z 2025-05-07T19:50:30.4013909Z HIPified Source Files: 2025-05-07T19:50:30.4013914Z 2025-05-07T19:50:30.4013985Z 2025-05-07T19:50:30.4014079Z Library Dependencies: 2025-05-07T19:50:30.4014146Z torch 2025-05-07T19:50:30.4014224Z torch_library 2025-05-07T19:50:30.4014537Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.4014781Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.4015109Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.4015461Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.4015727Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.4015827Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:30.4016034Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.4016109Z 2025-05-07T19:50:30.4016187Z Output Library: 2025-05-07T19:50:30.4016285Z fbgemm_gpu_tbe_training_backward_vbe 2025-05-07T19:50:30.4016362Z 2025-05-07T19:50:30.4016447Z Destination Directory: 2025-05-07T19:50:30.4016520Z fbgemm_gpu 2025-05-07T19:50:30.4016620Z ================================================================================ 2025-05-07T19:50:30.4016625Z 2025-05-07T19:50:30.4016634Z 2025-05-07T19:50:30.4016638Z 2025-05-07T19:50:30.4016737Z ================================================================================ 2025-05-07T19:50:30.4016940Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_dense (SHARED) 2025-05-07T19:50:30.4017007Z 2025-05-07T19:50:30.4017087Z CPU_SRCS: 2025-05-07T19:50:30.4017095Z 2025-05-07T19:50:30.4017162Z 2025-05-07T19:50:30.4017232Z GPU_SRCS: 2025-05-07T19:50:30.4017428Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:30.4017573Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:30.4017730Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.4017890Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.4018066Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.4018236Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:30.4018422Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.4018620Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.4018762Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:30.4018905Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:30.4019079Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.4019302Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.4019413Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:30.4019480Z 2025-05-07T19:50:30.4019572Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.4019577Z 2025-05-07T19:50:30.4019645Z 2025-05-07T19:50:30.4019726Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.4019730Z 2025-05-07T19:50:30.4019802Z 2025-05-07T19:50:30.4019874Z OTHER_SRCS: 2025-05-07T19:50:30.4019878Z 2025-05-07T19:50:30.4019945Z 2025-05-07T19:50:30.4020029Z CC_FLAGS: 2025-05-07T19:50:30.4020166Z 2025-05-07T19:50:30.4020233Z 2025-05-07T19:50:30.4020305Z NVCC_FLAGS: 2025-05-07T19:50:30.4020398Z --expt-relaxed-constexpr 2025-05-07T19:50:30.4020497Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:30.4020592Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:30.4020680Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:30.4020755Z 2025-05-07T19:50:30.4020830Z HIPCC_FLAGS: 2025-05-07T19:50:30.4020834Z 2025-05-07T19:50:30.4020904Z 2025-05-07T19:50:30.4020978Z INCLUDE_DIRS: 2025-05-07T19:50:30.4021090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.4021184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.4021276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.4021380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.4021651Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.4022029Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.4022173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.4022324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.4022471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.4022661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.4022973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.4023099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.4023379Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.4023454Z 2025-05-07T19:50:30.4023530Z Selected Source Files: 2025-05-07T19:50:30.4023659Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:30.4023823Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:30.4023956Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:30.4024053Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:30.4024173Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:30.4024321Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:30.4024469Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:30.4024623Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:30.4024800Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:30.4024978Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:30.4025178Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:30.4025341Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:30.4025494Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:30.4025555Z 2025-05-07T19:50:30.4025634Z HIPified Source Files: 2025-05-07T19:50:30.4025638Z 2025-05-07T19:50:30.4025704Z 2025-05-07T19:50:30.4025783Z Library Dependencies: 2025-05-07T19:50:30.4025845Z torch 2025-05-07T19:50:30.4025923Z torch_library 2025-05-07T19:50:30.4026199Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.4026423Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.4026719Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.4027095Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.4027341Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.4027431Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:30.4027624Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.4027684Z 2025-05-07T19:50:30.4027756Z Output Library: 2025-05-07T19:50:30.4027860Z fbgemm_gpu_tbe_training_backward_dense 2025-05-07T19:50:30.4027920Z 2025-05-07T19:50:30.4027999Z Destination Directory: 2025-05-07T19:50:30.4028067Z fbgemm_gpu 2025-05-07T19:50:30.4028170Z ================================================================================ 2025-05-07T19:50:30.4028175Z 2025-05-07T19:50:30.4028179Z 2025-05-07T19:50:30.4028182Z 2025-05-07T19:50:30.4028272Z ================================================================================ 2025-05-07T19:50:30.4028652Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_split_host (SHARED) 2025-05-07T19:50:30.4028732Z 2025-05-07T19:50:30.4028976Z CPU_SRCS: 2025-05-07T19:50:30.4028980Z 2025-05-07T19:50:30.4029050Z 2025-05-07T19:50:30.4029131Z GPU_SRCS: 2025-05-07T19:50:30.4029240Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:30.4029367Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:30.4029467Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:30.4029580Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:30.4029682Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:30.4029789Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:30.4029943Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:30.4030083Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:30.4030183Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:30.4030358Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:30.4030487Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:30.4030636Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:30.4030841Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:30.4031067Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:30.4031257Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:30.4031410Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:30.4031534Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:30.4031680Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:30.4031833Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:30.4032016Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:30.4032201Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:30.4032334Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:30.4032471Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:30.4032617Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:30.4033262Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:30.4033399Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:30.4033553Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:30.4033700Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:30.4033857Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:30.4034053Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:30.4034268Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:30.4034462Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:30.4034663Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:30.4034807Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:30.4035025Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:30.4035256Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:30.4035502Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:30.4035567Z 2025-05-07T19:50:30.4035651Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.4035655Z 2025-05-07T19:50:30.4035731Z 2025-05-07T19:50:30.4035809Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.4035814Z 2025-05-07T19:50:30.4035879Z 2025-05-07T19:50:30.4035952Z OTHER_SRCS: 2025-05-07T19:50:30.4035956Z 2025-05-07T19:50:30.4036028Z 2025-05-07T19:50:30.4036100Z CC_FLAGS: 2025-05-07T19:50:30.4036104Z 2025-05-07T19:50:30.4036172Z 2025-05-07T19:50:30.4036254Z NVCC_FLAGS: 2025-05-07T19:50:30.4036347Z --expt-relaxed-constexpr 2025-05-07T19:50:30.4036441Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:30.4036537Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:30.4036632Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:30.4036700Z 2025-05-07T19:50:30.4036773Z HIPCC_FLAGS: 2025-05-07T19:50:30.4036777Z 2025-05-07T19:50:30.4036848Z 2025-05-07T19:50:30.4036925Z INCLUDE_DIRS: 2025-05-07T19:50:30.4037024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.4037111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.4037215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.4037313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.4037585Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.4037983Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.4038122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.4038273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.4038439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.4038637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.4038838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.4038979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.4039276Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.4039343Z 2025-05-07T19:50:30.4039431Z Selected Source Files: 2025-05-07T19:50:30.4039536Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:30.4039661Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:30.4039769Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:30.4039866Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:30.4039962Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:30.4040066Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:30.4040215Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:30.4040356Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:30.4040455Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:30.4040641Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:30.4040804Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:30.4040953Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:30.4041158Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:30.4041373Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:30.4041562Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:30.4041720Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:30.4041849Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:30.4042105Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:30.4042255Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:30.4042433Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:30.4042614Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:30.4042793Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:30.4042938Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:30.4043068Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:30.4043204Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:30.4043331Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:30.4043474Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:30.4043616Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:30.4043765Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:30.4043963Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:30.4044159Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:30.4044346Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:30.4044553Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:30.4044683Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:30.4044821Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:30.4045037Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:30.4045269Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:30.4045336Z 2025-05-07T19:50:30.4045419Z HIPified Source Files: 2025-05-07T19:50:30.4045424Z 2025-05-07T19:50:30.4045497Z 2025-05-07T19:50:30.4045577Z Library Dependencies: 2025-05-07T19:50:30.4045644Z torch 2025-05-07T19:50:30.4045718Z torch_library 2025-05-07T19:50:30.4046020Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.4046258Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.4046573Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.4046924Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.4047297Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.4047370Z fbgemm_gpu_config 2025-05-07T19:50:30.4047452Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:30.4047639Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.4047699Z 2025-05-07T19:50:30.4047771Z Output Library: 2025-05-07T19:50:30.4047880Z fbgemm_gpu_tbe_training_backward_split_host 2025-05-07T19:50:30.4047941Z 2025-05-07T19:50:30.4048020Z Destination Directory: 2025-05-07T19:50:30.4048097Z fbgemm_gpu 2025-05-07T19:50:30.4048190Z ================================================================================ 2025-05-07T19:50:30.4048194Z 2025-05-07T19:50:30.4048198Z 2025-05-07T19:50:30.4048202Z 2025-05-07T19:50:30.4048290Z ================================================================================ 2025-05-07T19:50:30.4048502Z GPU CPP Library Target: fbgemm_gpu_tbe_index_select (SHARED) 2025-05-07T19:50:30.4048564Z 2025-05-07T19:50:30.4048631Z CPU_SRCS: 2025-05-07T19:50:30.4048818Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:30.4048994Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:30.4049055Z 2025-05-07T19:50:30.4049122Z GPU_SRCS: 2025-05-07T19:50:30.4049303Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:30.4049422Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:30.4049527Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:30.4049650Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:30.4049773Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:30.4049890Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:30.4050003Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:30.4050278Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:30.4050340Z 2025-05-07T19:50:30.4050419Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.4050423Z 2025-05-07T19:50:30.4050493Z 2025-05-07T19:50:30.4050741Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.4050746Z 2025-05-07T19:50:30.4050810Z 2025-05-07T19:50:30.4050882Z OTHER_SRCS: 2025-05-07T19:50:30.4050893Z 2025-05-07T19:50:30.4050960Z 2025-05-07T19:50:30.4051032Z CC_FLAGS: 2025-05-07T19:50:30.4051036Z 2025-05-07T19:50:30.4051101Z 2025-05-07T19:50:30.4051182Z NVCC_FLAGS: 2025-05-07T19:50:30.4051277Z --expt-relaxed-constexpr 2025-05-07T19:50:30.4051365Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:30.4051481Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:30.4051569Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:30.4051634Z 2025-05-07T19:50:30.4051706Z HIPCC_FLAGS: 2025-05-07T19:50:30.4051710Z 2025-05-07T19:50:30.4051784Z 2025-05-07T19:50:30.4051856Z INCLUDE_DIRS: 2025-05-07T19:50:30.4051959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.4052057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.4052156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.4052253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.4052526Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.4052916Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.4053048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.4053204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.4053361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.4053557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.4053748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.4053892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.4054196Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.4054268Z 2025-05-07T19:50:30.4054351Z Selected Source Files: 2025-05-07T19:50:30.4054571Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:30.4054752Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:30.4054933Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:30.4055068Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:30.4055182Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:30.4055307Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:30.4055446Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:30.4055571Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:30.4055698Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:30.4055820Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:30.4055897Z 2025-05-07T19:50:30.4055982Z HIPified Source Files: 2025-05-07T19:50:30.4055987Z 2025-05-07T19:50:30.4056108Z 2025-05-07T19:50:30.4056197Z Library Dependencies: 2025-05-07T19:50:30.4056264Z torch 2025-05-07T19:50:30.4056335Z torch_library 2025-05-07T19:50:30.4056637Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.4056889Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.4057208Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.4057549Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.4057817Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.4057910Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:30.4058006Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:30.4058214Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.4058340Z 2025-05-07T19:50:30.4058418Z Output Library: 2025-05-07T19:50:30.4058507Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:30.4058578Z 2025-05-07T19:50:30.4058659Z Destination Directory: 2025-05-07T19:50:30.4058731Z fbgemm_gpu 2025-05-07T19:50:30.4058838Z ================================================================================ 2025-05-07T19:50:30.4058843Z 2025-05-07T19:50:30.4058847Z 2025-05-07T19:50:30.4058850Z 2025-05-07T19:50:30.4058948Z ================================================================================ 2025-05-07T19:50:30.4059131Z GPU CPP Library Target: fbgemm_gpu_embedding_inplace_ops (SHARED) 2025-05-07T19:50:30.4059203Z 2025-05-07T19:50:30.4059276Z CPU_SRCS: 2025-05-07T19:50:30.4059443Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:30.4059507Z 2025-05-07T19:50:30.4059585Z GPU_SRCS: 2025-05-07T19:50:30.4059746Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:30.4059895Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:30.4059966Z 2025-05-07T19:50:30.4060049Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.4060053Z 2025-05-07T19:50:30.4060116Z 2025-05-07T19:50:30.4060195Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.4060200Z 2025-05-07T19:50:30.4060272Z 2025-05-07T19:50:30.4060342Z OTHER_SRCS: 2025-05-07T19:50:30.4060346Z 2025-05-07T19:50:30.4060411Z 2025-05-07T19:50:30.4060490Z CC_FLAGS: 2025-05-07T19:50:30.4060494Z 2025-05-07T19:50:30.4060559Z 2025-05-07T19:50:30.4060630Z NVCC_FLAGS: 2025-05-07T19:50:30.4060720Z --expt-relaxed-constexpr 2025-05-07T19:50:30.4060814Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:30.4060909Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:30.4060994Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:30.4061066Z 2025-05-07T19:50:30.4061138Z HIPCC_FLAGS: 2025-05-07T19:50:30.4061143Z 2025-05-07T19:50:30.4061208Z 2025-05-07T19:50:30.4061279Z INCLUDE_DIRS: 2025-05-07T19:50:30.4061388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.4061477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.4061576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.4061680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.4061951Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.4062334Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.4062473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.4062627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.4062774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.4063078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.4063261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.4063386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.4063818Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.4063941Z 2025-05-07T19:50:30.4064019Z Selected Source Files: 2025-05-07T19:50:30.4064178Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:30.4064339Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:30.4064477Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:30.4064540Z 2025-05-07T19:50:30.4064619Z HIPified Source Files: 2025-05-07T19:50:30.4064623Z 2025-05-07T19:50:30.4064692Z 2025-05-07T19:50:30.4064771Z Library Dependencies: 2025-05-07T19:50:30.4064834Z torch 2025-05-07T19:50:30.4064911Z torch_library 2025-05-07T19:50:30.4065185Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.4065410Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.4065713Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.4066086Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.4066328Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.4066517Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.4066594Z 2025-05-07T19:50:30.4066669Z Output Library: 2025-05-07T19:50:30.4066761Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:30.4066829Z 2025-05-07T19:50:30.4066911Z Destination Directory: 2025-05-07T19:50:30.4066981Z fbgemm_gpu 2025-05-07T19:50:30.4067078Z ================================================================================ 2025-05-07T19:50:30.4067083Z 2025-05-07T19:50:30.4067101Z 2025-05-07T19:50:30.4067105Z 2025-05-07T19:50:30.4067197Z ================================================================================ 2025-05-07T19:50:30.4067310Z GPU CPP Library Target: fbgemm_gpu_py (SHARED) 2025-05-07T19:50:30.4067378Z 2025-05-07T19:50:30.4067456Z CPU_SRCS: 2025-05-07T19:50:30.4067552Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:30.4067646Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:30.4067837Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:30.4068033Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:30.4068221Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:30.4068423Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:30.4068616Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:30.4068830Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:30.4068966Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:30.4069098Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:30.4069212Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:30.4069317Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:30.4069463Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:30.4069559Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:30.4069654Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:30.4069769Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:30.4069866Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:30.4069955Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:30.4070038Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:30.4070125Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:30.4070218Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:30.4070304Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:30.4070405Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:30.4070493Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:30.4070708Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:30.4070846Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:30.4071092Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:30.4071305Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:30.4071397Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:30.4071492Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:30.4071579Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:30.4071687Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:30.4071865Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:30.4071958Z src/topology_utils.cpp 2025-05-07T19:50:30.4072027Z 2025-05-07T19:50:30.4072093Z GPU_SRCS: 2025-05-07T19:50:30.4072207Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:30.4072301Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:30.4072491Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:30.4072579Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:30.4072726Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:30.4072900Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:30.4073067Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:30.4073198Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:30.4073326Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:30.4073559Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:30.4073731Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:30.4073891Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:30.4074018Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:30.4074155Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:30.4074283Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:30.4074398Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:30.4074518Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:30.4074629Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:30.4074776Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:30.4074917Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:30.4075040Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:30.4075178Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:30.4075296Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:30.4075386Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:30.4075593Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:30.4075770Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:30.4075938Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:30.4076047Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:30.4076151Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:30.4076268Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:30.4076396Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:30.4076487Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:30.4076579Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:30.4076694Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:30.4076793Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:30.4076905Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:30.4077026Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:30.4077143Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:30.4077264Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:30.4077397Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:30.4077525Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:30.4077626Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:30.4077719Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:30.4077813Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:30.4077974Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:30.4078092Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:30.4078206Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:30.4078302Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:30.4078402Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:30.4078495Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:30.4078604Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:30.4078704Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:30.4078806Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:30.4078904Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:30.4079005Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:30.4079067Z 2025-05-07T19:50:30.4079147Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:30.4079151Z 2025-05-07T19:50:30.4079217Z 2025-05-07T19:50:30.4079376Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:30.4079380Z 2025-05-07T19:50:30.4079446Z 2025-05-07T19:50:30.4079516Z OTHER_SRCS: 2025-05-07T19:50:30.4079523Z 2025-05-07T19:50:30.4079601Z 2025-05-07T19:50:30.4079669Z CC_FLAGS: 2025-05-07T19:50:30.4079674Z 2025-05-07T19:50:30.4079740Z 2025-05-07T19:50:30.4079812Z NVCC_FLAGS: 2025-05-07T19:50:30.4079911Z --expt-relaxed-constexpr 2025-05-07T19:50:30.4079999Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:30.4080092Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:30.4080189Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:30.4080254Z 2025-05-07T19:50:30.4080326Z HIPCC_FLAGS: 2025-05-07T19:50:30.4080330Z 2025-05-07T19:50:30.4080393Z 2025-05-07T19:50:30.4080476Z INCLUDE_DIRS: 2025-05-07T19:50:30.4080567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.4080655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:30.4080755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:30.4080846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:30.4081105Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include 2025-05-07T19:50:30.4081473Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:30.4081599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:30.4081745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:30.4081888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:30.4082081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:30.4082263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:30.4082396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:30.4082687Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include 2025-05-07T19:50:30.4082752Z 2025-05-07T19:50:30.4082834Z Selected Source Files: 2025-05-07T19:50:30.4082924Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:30.4083031Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:30.4083216Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:30.4083410Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:30.4083606Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:30.4083806Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:30.4083996Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:30.4084220Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:30.4084354Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:30.4084471Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:30.4084583Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:30.4084701Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:30.4084836Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:30.4084936Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:30.4085089Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:30.4085205Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:30.4085293Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:30.4085393Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:30.4085476Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:30.4085559Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:30.4085651Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:30.4085749Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:30.4085850Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:30.4085937Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:30.4086166Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:30.4086303Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:30.4086499Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:30.4086770Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:30.4086878Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:30.4086967Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:30.4087055Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:30.4087171Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:30.4087347Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:30.4087428Z src/topology_utils.cpp 2025-05-07T19:50:30.4087542Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:30.4087635Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:30.4087828Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:30.4087917Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:30.4088018Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:30.4088187Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:30.4088355Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:30.4088480Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:30.4088604Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:30.4088833Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:30.4089008Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:30.4089168Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:30.4089297Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:30.4089434Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:30.4089565Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:30.4089680Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:30.4089794Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:30.4089908Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:30.4090051Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:30.4090274Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:30.4090395Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:30.4090542Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:30.4090845Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:30.4090944Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:30.4091168Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:30.4091356Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:30.4091534Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:30.4091647Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:30.4091750Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:30.4091869Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:30.4091986Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:30.4092092Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:30.4092184Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:30.4092365Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:30.4092467Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:30.4092583Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:30.4092710Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:30.4092822Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:30.4092960Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:30.4093095Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:30.4093230Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:30.4093338Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:30.4093433Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:30.4093532Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:30.4093643Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:30.4093764Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:30.4093941Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:30.4094042Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:30.4094145Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:30.4094243Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:30.4094355Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:30.4094456Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:30.4094563Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:30.4094666Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:30.4094756Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:30.4094831Z 2025-05-07T19:50:30.4094913Z HIPified Source Files: 2025-05-07T19:50:30.4094918Z 2025-05-07T19:50:30.4094984Z 2025-05-07T19:50:30.4095073Z Library Dependencies: 2025-05-07T19:50:30.4095140Z torch 2025-05-07T19:50:30.4095214Z torch_library 2025-05-07T19:50:30.4095519Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so 2025-05-07T19:50:30.4095772Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:30.4096096Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:30.4096438Z /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:30.4096709Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:30.4096779Z fbgemm 2025-05-07T19:50:30.4096873Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:30.4096975Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:30.4097061Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:30.4097139Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:30.4097223Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:30.4097307Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:30.4097512Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:30.4097577Z 2025-05-07T19:50:30.4097661Z Output Library: 2025-05-07T19:50:30.4097737Z fbgemm_gpu_py 2025-05-07T19:50:30.4097801Z 2025-05-07T19:50:30.4097883Z Destination Directory: 2025-05-07T19:50:30.4097963Z fbgemm_gpu 2025-05-07T19:50:30.4098063Z ================================================================================ 2025-05-07T19:50:30.4098068Z 2025-05-07T19:50:30.4098154Z -- Configuring done (8.1s) 2025-05-07T19:50:30.5293341Z -- Generating done (0.1s) 2025-05-07T19:50:30.5311663Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build 2025-05-07T19:50:30.5447615Z Change Dir: '/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build' 2025-05-07T19:50:30.5448016Z 2025-05-07T19:50:30.5448765Z Run Build Command(s): /github/home/miniconda/envs/build_binary/bin/ninja -v -j 48 install 2025-05-07T19:50:30.6855457Z [1/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:30.7095351Z [2/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:30.7115579Z [3/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:30.7198976Z [4/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:30.7263515Z [5/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:30.7282921Z [6/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:30.7340018Z [7/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:30.7459737Z [8/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:30.7478985Z [9/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:30.7617144Z [10/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:30.7871860Z [11/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:30.7949223Z [12/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:30.7995809Z [13/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:30.8031665Z [14/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:30.8081512Z [15/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:30.8115088Z [16/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:30.8125783Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp:10: 2025-05-07T19:50:30.8127586Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:30.8131108Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8134797Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:30.8136710Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8138223Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:30.8141464Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8145207Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:30.8147097Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8148569Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:30.8152017Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8155841Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:30.8157729Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8159356Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:30.8162699Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8166423Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:30.8168312Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8169822Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:30.8173135Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8177002Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:30.8178961Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8180448Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:30.8183697Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8187466Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:30.8189376Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8190865Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:30.8194068Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8197734Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:30.8199883Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8201353Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:30.8204618Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8208342Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:30.8210586Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8212090Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:30.8215256Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8218997Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:30.8220959Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8222511Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:30.8225750Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8229663Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:30.8231626Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8232285Z At global scope: 2025-05-07T19:50:30.8233601Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:30.8307892Z [17/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:30.8327151Z [18/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:30.8585210Z [19/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:30.8660468Z [20/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:30.8680995Z [21/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:30.8776284Z [22/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:30.8787188Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp:10: 2025-05-07T19:50:30.8788983Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:30.8791926Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8795431Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:30.8797397Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8798970Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:30.8802151Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8805905Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:30.8807910Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8809685Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:30.8813115Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8816875Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:30.8818850Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8820383Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:30.8823901Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8827489Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:30.8829597Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8831153Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:30.8834445Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8838275Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:30.8840277Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8841819Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:30.8845139Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8848895Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:30.8851046Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8852550Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:30.8855800Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8859786Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:30.8861763Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8863288Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:30.8866605Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8870586Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:30.8872582Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8874151Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:30.8877398Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8881213Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:30.8883264Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8884792Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:30.8888108Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.8892072Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:30.8894147Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.8894765Z At global scope: 2025-05-07T19:50:30.8896038Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:30.8982436Z [23/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:30.9001868Z [24/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:30.9161412Z [25/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:30.9242936Z [26/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:30.9278283Z [27/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:30.9288842Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64instdb_p.h:12, 2025-05-07T19:50:30.9290028Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp:11: 2025-05-07T19:50:30.9291826Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:30.9295211Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9298985Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:30.9301038Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9302608Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:30.9306065Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9309889Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:30.9311876Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9313526Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:30.9316747Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9320395Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:30.9322342Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9324154Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:30.9327384Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9331254Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:30.9333112Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9334856Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:30.9338043Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9341816Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:30.9343772Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9345232Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:30.9348500Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9352083Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:30.9354082Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9355671Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:30.9358889Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9362620Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:30.9364562Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9366153Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:30.9369762Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9373647Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:30.9375647Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9377220Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:30.9380707Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9384767Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:30.9386780Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9388384Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:30.9391853Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9395588Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:30.9397521Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9398108Z At global scope: 2025-05-07T19:50:30.9399283Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:30.9409480Z [28/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:30.9419696Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64instdb_p.h:12, 2025-05-07T19:50:30.9421141Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp:13: 2025-05-07T19:50:30.9422778Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:30.9426013Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9430084Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:30.9432041Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9433783Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:30.9437042Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9440720Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:30.9442727Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9444318Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:30.9447562Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9451435Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:30.9453388Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9454896Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:30.9458184Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9461693Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:30.9463516Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9464977Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:30.9468507Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9472308Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:30.9474288Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9475830Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:30.9479135Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9483088Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:30.9485110Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9486654Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:30.9489951Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9493815Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:30.9495789Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9497349Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:30.9500744Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9504581Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:30.9506508Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9508124Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:30.9511565Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9515648Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:30.9517693Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9519299Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:30.9522557Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9526291Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:30.9528708Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9529331Z At global scope: 2025-05-07T19:50:30.9530617Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:30.9541696Z [29/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:30.9561337Z [30/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:30.9584416Z [31/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:30.9594988Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64instdb_p.h:12, 2025-05-07T19:50:30.9596273Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp:13: 2025-05-07T19:50:30.9597980Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:30.9601320Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9605150Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:30.9607136Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9608731Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:30.9612197Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9616039Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:30.9618049Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9641914Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:30.9645328Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9649134Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:30.9651237Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9653114Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:30.9656631Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9660295Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:30.9662115Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9663720Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:30.9667353Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9671242Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:30.9673312Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9674889Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:30.9678379Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9682306Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:30.9684237Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9685764Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:30.9689208Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9693187Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:30.9695234Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9696807Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:30.9700493Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9704382Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:30.9706431Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9708029Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:30.9711318Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9715271Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:30.9717261Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9718877Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:30.9722197Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9726021Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:30.9728017Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9728903Z At global scope: 2025-05-07T19:50:30.9730220Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:30.9960706Z [32/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:30.9971192Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/a64archtraits_p.h:13, 2025-05-07T19:50:30.9972444Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp:16: 2025-05-07T19:50:30.9974459Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:30.9977631Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9981238Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:30.9983111Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9984764Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:30.9987930Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:30.9991662Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:30.9993545Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:30.9995110Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:30.9998364Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0002124Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:31.0004068Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0005621Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:31.0009005Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0012735Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:31.0014542Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0016062Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:31.0019620Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0023378Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:31.0025319Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0026893Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:31.0030465Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0034512Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:31.0036455Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0037979Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:31.0041287Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0045023Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:31.0046938Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0048480Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:31.0051768Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0055396Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:31.0057360Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0058912Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:31.0062239Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0066136Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:31.0068061Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0069585Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:31.0072970Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0076771Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:31.0078988Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0079572Z At global scope: 2025-05-07T19:50:31.0080727Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:31.0091300Z [33/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:31.0110523Z [34/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:31.0280974Z [35/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:31.0300592Z [36/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:31.0311518Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp:12: 2025-05-07T19:50:31.0313314Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:31.0316470Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0320225Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:31.0322094Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0323610Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:31.0326912Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0330882Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:31.0332804Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0334598Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:31.0337866Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0341358Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:31.0343362Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0344815Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:31.0348220Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0351805Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:31.0353660Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0355218Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:31.0358511Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0362068Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:31.0363937Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0365405Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:31.0368566Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0372278Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:31.0374238Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0375719Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:31.0378858Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0382767Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:31.0384743Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0386228Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:31.0389410Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0393289Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:31.0395237Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0396732Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:31.0399930Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0403553Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:31.0405468Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0406986Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:31.0410290Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0413819Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:31.0415825Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0416440Z At global scope: 2025-05-07T19:50:31.0417655Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:31.0428298Z [37/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:31.0472975Z [38/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:31.0483027Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:31.0484254Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64assembler.h:10, 2025-05-07T19:50:31.0485315Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp:9: 2025-05-07T19:50:31.0486997Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:31.0490060Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0493698Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:31.0495560Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0497072Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:31.0500305Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0503748Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:31.0505487Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0507166Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:31.0509996Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0513527Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:31.0515416Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0516910Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:31.0520394Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0523918Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:31.0525648Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0527223Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:31.0530853Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0534543Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:31.0536411Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0537906Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:31.0541201Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0544825Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:31.0546717Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0548284Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:31.0551835Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0555553Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:31.0557459Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0558752Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:31.0561521Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0564949Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:31.0566645Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0568111Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:31.0571486Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0575257Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:31.0577252Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0578875Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:31.0582290Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.0586138Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:31.0587903Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.0588448Z At global scope: 2025-05-07T19:50:31.0589568Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:31.1026351Z [39/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:31.1037152Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:31.1038455Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64assembler.h:10, 2025-05-07T19:50:31.1039900Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp:9: 2025-05-07T19:50:31.1041727Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:31.1045111Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1048600Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:31.1050581Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1052160Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:31.1055566Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1059404Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:31.1061368Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1063036Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:31.1066461Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1070401Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:31.1072431Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1074343Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:31.1077712Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1081351Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:31.1083141Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1084704Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:31.1088332Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1092118Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:31.1094103Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1095752Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:31.1099114Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1102922Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:31.1104816Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1106342Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:31.1109863Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1113891Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:31.1115876Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1117354Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:31.1120578Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1124203Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:31.1126064Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1127549Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:31.1131021Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1134811Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:31.1136721Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1138277Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:31.1141569Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1145343Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:31.1147409Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1148029Z At global scope: 2025-05-07T19:50:31.1149271Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:31.1160332Z [40/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:31.1620617Z [41/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:31.1632072Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:31.1633241Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64emithelper_p.h:13, 2025-05-07T19:50:31.1634335Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp:14: 2025-05-07T19:50:31.1636129Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:31.1639468Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1643287Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:31.1645346Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1647016Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:31.1650569Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1654427Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:31.1656458Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1658103Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:31.1661603Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1665505Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:31.1667834Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1669479Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:31.1672977Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1676745Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:31.1678860Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1680490Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:31.1684013Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1687820Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:31.1689847Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1691554Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:31.1695028Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1698661Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:31.1700377Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1701930Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:31.1705356Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1709127Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:31.1711126Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1712943Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:31.1716474Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1720470Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:31.1722520Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1724185Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:31.1727889Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1732212Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:31.1734300Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1735992Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:31.1739546Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.1743357Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:31.1745424Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.1746049Z At global scope: 2025-05-07T19:50:31.1747257Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:31.1758211Z [42/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:31.1942324Z [43/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:31.1963819Z [44/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:31.2084092Z [45/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:31.2682500Z [46/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:31.3702110Z [47/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:31.4072163Z [48/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:31.5434989Z [49/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:31.6334166Z [50/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:31.6355504Z [51/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:31.6365685Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:31.6366968Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64assembler.h:10, 2025-05-07T19:50:31.6368028Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp:12: 2025-05-07T19:50:31.6369762Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:31.6373111Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.6376601Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:31.6378510Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.6380053Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:31.6383603Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.6387261Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:31.6389178Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.6390746Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:31.6394057Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.6397888Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:31.6399845Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.6401445Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:31.6404623Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.6408161Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:31.6409957Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.6411690Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:31.6414798Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.6418166Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:31.6419982Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.6421429Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:31.6424463Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.6427848Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:31.6430222Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.6431669Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:31.6434692Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.6438121Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:31.6440182Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.6441641Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:31.6444767Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.6448140Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:31.6449930Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.6451518Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:31.6454535Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.6457887Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:31.6459683Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.6461181Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:31.6464191Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:31.6467566Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:31.6469387Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:31.6469991Z At global scope: 2025-05-07T19:50:31.6471358Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:31.7241968Z [52/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:31.9211140Z [53/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:32.1008929Z [54/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -mavx512f -mavx512bw -mavx512dq -mavx512vl -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:32.1369194Z [55/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:32.1462638Z [56/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:32.1764594Z [57/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:32.2653450Z [58/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtils.cc 2025-05-07T19:50:32.5401809Z [59/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:32.7968491Z [60/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:32.7979034Z In file included from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/a64emitter.h:12, 2025-05-07T19:50:32.7980399Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/a64assembler.h:10, 2025-05-07T19:50:32.7981513Z from /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp:18: 2025-05-07T19:50:32.7983639Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB8() const': 2025-05-07T19:50:32.7986992Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:132:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:32.7990751Z 132 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:32.7992640Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:32.7994200Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH4() const': 2025-05-07T19:50:32.7997602Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:133:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:32.8001294Z 133 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:32.8003260Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:32.8004844Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS2() const': 2025-05-07T19:50:32.8008097Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:134:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:32.8012100Z 134 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:32.8014085Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:32.8015628Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD1() const': 2025-05-07T19:50:32.8021037Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:135:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:32.8024757Z 135 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD1() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature); } 2025-05-07T19:50:32.8026557Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:32.8028054Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB16() const': 2025-05-07T19:50:32.8031545Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:137:112: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:32.8035522Z 137 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB16() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB); } 2025-05-07T19:50:32.8037413Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:32.8038934Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH8() const': 2025-05-07T19:50:32.8042172Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:138:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:32.8045834Z 138 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH8() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH); } 2025-05-07T19:50:32.8047719Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:32.8049258Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecS4() const': 2025-05-07T19:50:32.8052662Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:139:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:32.8056322Z 139 | ASMJIT_INLINE_NODEBUG constexpr bool isVecS4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementS); } 2025-05-07T19:50:32.8058218Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:32.8059740Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecD2() const': 2025-05-07T19:50:32.8063042Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:140:111: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:32.8066927Z 140 | ASMJIT_INLINE_NODEBUG constexpr bool isVecD2() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementD); } 2025-05-07T19:50:32.8068753Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:32.8070312Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecB4x4() const': 2025-05-07T19:50:32.8073649Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:141:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:32.8077520Z 141 | ASMJIT_INLINE_NODEBUG constexpr bool isVecB4x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementB4); } 2025-05-07T19:50:32.8079455Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:32.8081066Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h: In member function 'constexpr bool asmjit::_abi_1_13::a64::Vec::isVecH2x4() const': 2025-05-07T19:50:32.8084405Z /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/../arm/../arm/../arm/a64operand.h:142:113: warning: bitwise operation between different enumeration types 'asmjit::_abi_1_13::BaseReg::' and 'asmjit::_abi_1_13::arm::BaseVec::AdditionalBits' is deprecated [-Wdeprecated-enum-enum-conversion] 2025-05-07T19:50:32.8088042Z 142 | ASMJIT_INLINE_NODEBUG constexpr bool isVecH2x4() const noexcept { return _signature.subset(kBaseSignatureMask | kSignatureRegElementTypeMask) == (RegTraits::kSignature | kSignatureElementH2); } 2025-05-07T19:50:32.8090014Z | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2025-05-07T19:50:32.8090797Z At global scope: 2025-05-07T19:50:32.8092005Z cc1plus: note: unrecognized command-line option '-Wno-deprecated-anon-enum-enum-conversion' may have been intended to silence earlier diagnostics 2025-05-07T19:50:33.1376605Z [61/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:33.5546275Z [62/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -c /__w/FBGEMM/FBGEMM/src/Utils.cc 2025-05-07T19:50:34.0352012Z [63/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:34.6260308Z [64/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,asmjit.so -o asmjit.so CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:34.7080530Z [65/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -c /__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc 2025-05-07T19:50:37.0861301Z [66/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -c /__w/FBGEMM/FBGEMM/src/RefImplementations.cc 2025-05-07T19:50:39.2881677Z [67/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -c /__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:39.3262799Z [68/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:39.3491441Z [69/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:39.3668665Z [70/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:39.5684529Z [71/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:40.8166152Z [72/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:41.0659396Z [73/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:41.4860190Z [74/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:41.7653949Z [75/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:43.4425803Z [76/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:44.1502255Z [77/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_config_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -MF CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o.d -o CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/config/feature_gates.cpp 2025-05-07T19:50:44.6984805Z [78/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_config.so -o fbgemm_gpu_config.so CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:47.1489887Z [79/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:47.6685581Z [80/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc 2025-05-07T19:50:49.0439558Z [81/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:49.5920317Z [82/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:51.5609720Z [83/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:53.3500885Z [84/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:54.0490042Z [85/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:56.4075257Z [86/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:57.5896224Z [87/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:58.8590254Z [88/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc 2025-05-07T19:51:03.3803180Z [89/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:51:03.6082091Z [90/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:51:06.4950553Z [91/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:51:07.7786863Z [92/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:51:08.6608978Z [93/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:51:10.0635668Z [94/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:51:13.4399219Z [95/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:51:13.7586253Z [96/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:51:16.7302641Z [97/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:51:17.8935939Z [98/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:19.0199213Z [99/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:20.2383040Z [100/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:23.7565115Z [101/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:33.8262278Z [102/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:51:34.0779792Z [103/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:38.1976252Z [104/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o 2025-05-07T19:51:39.0727485Z [105/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o 2025-05-07T19:51:40.1914241Z [106/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o 2025-05-07T19:51:40.3681731Z [107/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o 2025-05-07T19:51:40.3871802Z [108/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o 2025-05-07T19:51:40.4052392Z [109/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o 2025-05-07T19:51:40.7998080Z [110/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc 2025-05-07T19:51:40.8455497Z [111/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o 2025-05-07T19:51:41.1430871Z [112/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o 2025-05-07T19:51:41.2461057Z [113/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o 2025-05-07T19:51:41.5212195Z [114/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm.so -o fbgemm.so CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so asmjit.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T19:51:42.4732144Z [115/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o 2025-05-07T19:51:51.9398855Z [116/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o 2025-05-07T19:51:54.8609968Z [117/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o 2025-05-07T19:51:55.4850479Z [118/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_common.so -o fbgemm_gpu_tbe_common.so CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T19:51:57.6838595Z [119/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o 2025-05-07T19:52:00.6183722Z [120/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o 2025-05-07T19:52:07.2136948Z [121/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o 2025-05-07T19:52:07.7821048Z [122/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cu -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o 2025-05-07T19:52:25.0073682Z [123/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o 2025-05-07T19:52:31.9080984Z [124/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o 2025-05-07T19:52:34.5607833Z [125/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o 2025-05-07T19:52:40.5766506Z [126/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o 2025-05-07T19:52:40.7582242Z [127/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o 2025-05-07T19:52:41.1718620Z [128/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_optimizers.so -o fbgemm_gpu_tbe_optimizers.so CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:52:41.4504883Z [129/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_cache.so -o fbgemm_gpu_tbe_cache.so CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:52:45.3767644Z [130/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:54.0960135Z [131/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o 2025-05-07T19:52:59.1527667Z [132/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:59.9554463Z [133/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o 2025-05-07T19:53:02.5066179Z [134/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:05.2472376Z [135/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:11.1131288Z [136/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o 2025-05-07T19:53:11.1155036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:11.1157028Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:11.1157595Z ^ 2025-05-07T19:53:11.1157887Z 2025-05-07T19:53:11.1158362Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:11.1159071Z 2025-05-07T19:53:11.1160750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:11.1162765Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:11.1163363Z ^ 2025-05-07T19:53:11.1163645Z 2025-05-07T19:53:11.1165143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:11.1166795Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:11.1167285Z ^ 2025-05-07T19:53:11.1167588Z 2025-05-07T19:53:11.1169416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:11.1171463Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:11.1172006Z ^ 2025-05-07T19:53:11.1172322Z 2025-05-07T19:53:11.1172757Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:11.1173417Z 2025-05-07T19:53:11.1174983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:11.1176870Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:11.1177435Z ^ 2025-05-07T19:53:11.1177731Z 2025-05-07T19:53:11.1179323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:11.1181556Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:11.1182151Z ^ 2025-05-07T19:53:11.1182459Z 2025-05-07T19:53:11.1183976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:11.1185995Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:11.1186534Z ^ 2025-05-07T19:53:11.1186855Z 2025-05-07T19:53:11.1187298Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:11.1187929Z 2025-05-07T19:53:11.1189554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:11.1191645Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:11.1192240Z ^ 2025-05-07T19:53:11.1192545Z 2025-05-07T19:53:11.1194170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:11.1196201Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:11.1196795Z ^ 2025-05-07T19:53:11.1197101Z 2025-05-07T19:53:11.1198741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:11.1200813Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:11.1201403Z ^ 2025-05-07T19:53:11.1201705Z 2025-05-07T19:53:11.1202150Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:11.1202800Z 2025-05-07T19:53:11.1204434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:11.1206472Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:11.1207071Z ^ 2025-05-07T19:53:11.1207380Z 2025-05-07T19:53:11.1209043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:11.1211268Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:11.1211857Z ^ 2025-05-07T19:53:11.1212152Z 2025-05-07T19:53:11.1213815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:11.1215823Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:11.1216340Z ^ 2025-05-07T19:53:11.1216624Z 2025-05-07T19:53:11.1217053Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:11.1217742Z 2025-05-07T19:53:11.1219305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:11.1221089Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:11.1221685Z ^ 2025-05-07T19:53:11.1222149Z 2025-05-07T19:53:11.1223807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:11.1225807Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:11.1226346Z ^ 2025-05-07T19:53:11.1226641Z 2025-05-07T19:53:18.7510600Z [137/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:18.7530878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7532834Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:18.7533519Z ^ 2025-05-07T19:53:18.7533824Z 2025-05-07T19:53:18.7534244Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:18.7534839Z 2025-05-07T19:53:18.7536296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7538187Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.7538980Z ^ 2025-05-07T19:53:18.7539227Z 2025-05-07T19:53:18.7540672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7542439Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.7543006Z ^ 2025-05-07T19:53:18.7543276Z 2025-05-07T19:53:18.7544783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7546569Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.7547070Z ^ 2025-05-07T19:53:18.7547360Z 2025-05-07T19:53:18.7548751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7564338Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:18.7565116Z ^ 2025-05-07T19:53:18.7565436Z 2025-05-07T19:53:18.7565902Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:18.7566588Z 2025-05-07T19:53:18.7568179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7570135Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.7570868Z ^ 2025-05-07T19:53:18.7571146Z 2025-05-07T19:53:18.7572679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7574693Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.7575257Z ^ 2025-05-07T19:53:18.7575529Z 2025-05-07T19:53:18.7577094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7579117Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.7579675Z ^ 2025-05-07T19:53:18.7579990Z 2025-05-07T19:53:18.7581536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7583689Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:18.7584417Z ^ 2025-05-07T19:53:18.7584732Z 2025-05-07T19:53:18.7585410Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:18.7586084Z 2025-05-07T19:53:18.7587660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7589593Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.7590182Z ^ 2025-05-07T19:53:18.7590472Z 2025-05-07T19:53:18.7591968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7593908Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.7594484Z ^ 2025-05-07T19:53:18.7594939Z 2025-05-07T19:53:18.7596530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7598497Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.7599045Z ^ 2025-05-07T19:53:18.7599342Z 2025-05-07T19:53:18.7600842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7602945Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:18.7603684Z ^ 2025-05-07T19:53:18.7603997Z 2025-05-07T19:53:18.7604455Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:18.7605152Z 2025-05-07T19:53:18.7606684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7608637Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.7609177Z ^ 2025-05-07T19:53:18.7609468Z 2025-05-07T19:53:18.7611163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7613133Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.7613718Z ^ 2025-05-07T19:53:18.7613942Z 2025-05-07T19:53:18.7615440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7617424Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.7617964Z ^ 2025-05-07T19:53:18.7618273Z 2025-05-07T19:53:18.7619787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7621916Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:18.7622687Z ^ 2025-05-07T19:53:18.7623003Z 2025-05-07T19:53:18.7623461Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:18.7624147Z 2025-05-07T19:53:18.7625694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7627906Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.7628730Z ^ 2025-05-07T19:53:18.7629026Z 2025-05-07T19:53:18.7630527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7632403Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.7632926Z ^ 2025-05-07T19:53:18.7633226Z 2025-05-07T19:53:18.7634696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:18.7636640Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:18.7637184Z ^ 2025-05-07T19:53:18.7637827Z 2025-05-07T19:53:20.3579775Z [138/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o 2025-05-07T19:53:24.3132675Z [139/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o 2025-05-07T19:53:24.9888198Z [140/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_utils.so -o fbgemm_gpu_tbe_utils.so CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:53:25.5823946Z [141/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_sparse_async_cumsum.so -o fbgemm_gpu_sparse_async_cumsum.so CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T19:53:26.0745843Z [142/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:26.0769498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0771888Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:26.0772785Z ^ 2025-05-07T19:53:26.0773069Z 2025-05-07T19:53:26.0773553Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:26.0774236Z 2025-05-07T19:53:26.0775795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0777792Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.0778363Z ^ 2025-05-07T19:53:26.0778658Z 2025-05-07T19:53:26.0780178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0782137Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.0782674Z ^ 2025-05-07T19:53:26.0782977Z 2025-05-07T19:53:26.0784488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0786400Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.0786934Z ^ 2025-05-07T19:53:26.0787207Z 2025-05-07T19:53:26.0788713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0790910Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:26.0791655Z ^ 2025-05-07T19:53:26.0791992Z 2025-05-07T19:53:26.0792443Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:26.0793321Z 2025-05-07T19:53:26.0794863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0796761Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.0797312Z ^ 2025-05-07T19:53:26.0797598Z 2025-05-07T19:53:26.0799164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0801155Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.0801701Z ^ 2025-05-07T19:53:26.0801970Z 2025-05-07T19:53:26.0803740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0805715Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.0806284Z ^ 2025-05-07T19:53:26.0806561Z 2025-05-07T19:53:26.0808115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0810212Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:26.0811101Z ^ 2025-05-07T19:53:26.0811419Z 2025-05-07T19:53:26.0811873Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:26.0812550Z 2025-05-07T19:53:26.0814179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0816131Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.0816718Z ^ 2025-05-07T19:53:26.0817008Z 2025-05-07T19:53:26.0818588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0820577Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.0821136Z ^ 2025-05-07T19:53:26.0821418Z 2025-05-07T19:53:26.0822997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0825005Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.0825595Z ^ 2025-05-07T19:53:26.0825872Z 2025-05-07T19:53:26.0827346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0829786Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:26.0830524Z ^ 2025-05-07T19:53:26.0830824Z 2025-05-07T19:53:26.0831251Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:26.0831924Z 2025-05-07T19:53:26.0833535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0835511Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.0836099Z ^ 2025-05-07T19:53:26.0838714Z 2025-05-07T19:53:26.0840376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0842362Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.0842937Z ^ 2025-05-07T19:53:26.0843211Z 2025-05-07T19:53:26.0844754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0846734Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.0847267Z ^ 2025-05-07T19:53:26.0847558Z 2025-05-07T19:53:26.0849389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0851734Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:26.0852500Z ^ 2025-05-07T19:53:26.0852819Z 2025-05-07T19:53:26.0853286Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:26.0853975Z 2025-05-07T19:53:26.0855553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0857518Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.0858092Z ^ 2025-05-07T19:53:26.0858376Z 2025-05-07T19:53:26.0859888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0861815Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.0862351Z ^ 2025-05-07T19:53:26.0862621Z 2025-05-07T19:53:26.0864114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:26.0866048Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:26.0866562Z ^ 2025-05-07T19:53:26.0866871Z 2025-05-07T19:53:27.4154566Z [143/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:29.3352491Z [144/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:42.5560039Z [145/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o 2025-05-07T19:53:43.8033701Z [146/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o 2025-05-07T19:53:48.5387940Z [147/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:49.4793752Z [148/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:52.5475318Z [149/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:54:10.5978940Z [150/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:54:15.5485432Z [151/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:54:15.8594497Z [152/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:54:15.9858950Z [153/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:54:18.9312463Z [154/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:21.2456236Z [155/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:54:24.5566721Z [156/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:28.9660526Z [157/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o 2025-05-07T19:54:28.9683702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9685749Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9686674Z ^ 2025-05-07T19:54:28.9690300Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:28.9693276Z 2025-05-07T19:54:28.9693752Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:28.9694417Z 2025-05-07T19:54:28.9695707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9697637Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9698538Z ^ 2025-05-07T19:54:28.9701966Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:28.9705091Z 2025-05-07T19:54:28.9706623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9708645Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9709519Z ^ 2025-05-07T19:54:28.9713006Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:28.9716500Z 2025-05-07T19:54:28.9717790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9719636Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9720501Z ^ 2025-05-07T19:54:28.9723841Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:28.9727047Z 2025-05-07T19:54:28.9728371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9730685Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9731588Z ^ 2025-05-07T19:54:28.9734266Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:28.9736707Z 2025-05-07T19:54:28.9737898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9739743Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9740607Z ^ 2025-05-07T19:54:28.9743929Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:28.9746962Z 2025-05-07T19:54:28.9748579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9750495Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9751374Z ^ 2025-05-07T19:54:28.9754770Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:28.9757988Z 2025-05-07T19:54:28.9759219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9761304Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9762206Z ^ 2025-05-07T19:54:28.9765612Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:28.9768711Z 2025-05-07T19:54:28.9769991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9772084Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9773009Z ^ 2025-05-07T19:54:28.9776477Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:28.9779682Z 2025-05-07T19:54:28.9781136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9783125Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9783973Z ^ 2025-05-07T19:54:28.9787273Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:28.9790364Z 2025-05-07T19:54:28.9791565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9793756Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9794655Z ^ 2025-05-07T19:54:28.9798131Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:28.9801368Z 2025-05-07T19:54:28.9802677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9804812Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9805728Z ^ 2025-05-07T19:54:28.9809196Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:28.9812552Z 2025-05-07T19:54:28.9813838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9815783Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9816681Z ^ 2025-05-07T19:54:28.9819987Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:28.9823203Z 2025-05-07T19:54:28.9824447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9826413Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9827281Z ^ 2025-05-07T19:54:28.9830897Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:28.9834064Z 2025-05-07T19:54:28.9835274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9837132Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9838344Z ^ 2025-05-07T19:54:28.9841764Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:28.9845025Z 2025-05-07T19:54:28.9846313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9848307Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9849351Z ^ 2025-05-07T19:54:28.9852825Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:28.9856020Z 2025-05-07T19:54:28.9857313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9859332Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9860247Z ^ 2025-05-07T19:54:28.9863814Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:28.9866718Z 2025-05-07T19:54:28.9867686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9869138Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9869916Z ^ 2025-05-07T19:54:28.9873069Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:28.9876244Z 2025-05-07T19:54:28.9877533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9879434Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9880327Z ^ 2025-05-07T19:54:28.9883952Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:28.9887239Z 2025-05-07T19:54:28.9888465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9890461Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9891490Z ^ 2025-05-07T19:54:28.9894940Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:28.9898058Z 2025-05-07T19:54:28.9899323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9901212Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9902123Z ^ 2025-05-07T19:54:28.9905572Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:28.9908795Z 2025-05-07T19:54:28.9910089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9912020Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9912943Z ^ 2025-05-07T19:54:28.9916415Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:28.9919735Z 2025-05-07T19:54:28.9920955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9922914Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9923805Z ^ 2025-05-07T19:54:28.9927469Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:28.9931003Z 2025-05-07T19:54:28.9932279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9934205Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9935064Z ^ 2025-05-07T19:54:28.9938925Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:28.9942389Z 2025-05-07T19:54:28.9943666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9945644Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9946538Z ^ 2025-05-07T19:54:28.9949987Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:28.9953244Z 2025-05-07T19:54:28.9953685Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:28.9954361Z 2025-05-07T19:54:28.9955606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9957597Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9958525Z ^ 2025-05-07T19:54:28.9961906Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:28.9964923Z 2025-05-07T19:54:28.9966172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9968152Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9969054Z ^ 2025-05-07T19:54:28.9972955Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:28.9976107Z 2025-05-07T19:54:28.9977308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9979306Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9980350Z ^ 2025-05-07T19:54:28.9983871Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:28.9986969Z 2025-05-07T19:54:28.9988198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:28.9990184Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:28.9991081Z ^ 2025-05-07T19:54:28.9994597Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:28.9997767Z 2025-05-07T19:54:28.9998911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0000446Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0001110Z ^ 2025-05-07T19:54:29.0003787Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:29.0006757Z 2025-05-07T19:54:29.0008051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0009935Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0010985Z ^ 2025-05-07T19:54:29.0014593Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:29.0017611Z 2025-05-07T19:54:29.0018866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0020956Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0021862Z ^ 2025-05-07T19:54:29.0028990Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:29.0032155Z 2025-05-07T19:54:29.0033392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0035336Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0036239Z ^ 2025-05-07T19:54:29.0039729Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:29.0042873Z 2025-05-07T19:54:29.0044094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0046044Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0046950Z ^ 2025-05-07T19:54:29.0050538Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:29.0053739Z 2025-05-07T19:54:29.0055176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0057022Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0057915Z ^ 2025-05-07T19:54:29.0061348Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:29.0064391Z 2025-05-07T19:54:29.0065575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0067508Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0068558Z ^ 2025-05-07T19:54:29.0071869Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:29.0075192Z 2025-05-07T19:54:29.0076450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0078332Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0079221Z ^ 2025-05-07T19:54:29.0082837Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:29.0085975Z 2025-05-07T19:54:29.0087239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0089241Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0090302Z ^ 2025-05-07T19:54:29.0093698Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:29.0097097Z 2025-05-07T19:54:29.0098300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0100281Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0101176Z ^ 2025-05-07T19:54:29.0104874Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:29.0108036Z 2025-05-07T19:54:29.0109311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0111315Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0112184Z ^ 2025-05-07T19:54:29.0115485Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:29.0118337Z 2025-05-07T19:54:29.0119563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0121388Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0122241Z ^ 2025-05-07T19:54:29.0125628Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:29.0129009Z 2025-05-07T19:54:29.0130293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0132230Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0133058Z ^ 2025-05-07T19:54:29.0136495Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:29.0139717Z 2025-05-07T19:54:29.0140947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0142928Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0143808Z ^ 2025-05-07T19:54:29.0147973Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:29.0151146Z 2025-05-07T19:54:29.0152443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0154417Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0155318Z ^ 2025-05-07T19:54:29.0158694Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:29.0162171Z 2025-05-07T19:54:29.0163455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0165578Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0166494Z ^ 2025-05-07T19:54:29.0169932Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:29.0172933Z 2025-05-07T19:54:29.0174103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0175977Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0176864Z ^ 2025-05-07T19:54:29.0180142Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:29.0183265Z 2025-05-07T19:54:29.0184438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0186346Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0187397Z ^ 2025-05-07T19:54:29.0191123Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:29.0194415Z 2025-05-07T19:54:29.0195675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0197589Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0198507Z ^ 2025-05-07T19:54:29.0202041Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:29.0205505Z 2025-05-07T19:54:29.0206763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0208740Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0209644Z ^ 2025-05-07T19:54:29.0213204Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:29.0216308Z 2025-05-07T19:54:29.0216765Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:29.0217445Z 2025-05-07T19:54:29.0218758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0220705Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0221621Z ^ 2025-05-07T19:54:29.0224964Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:29.0228243Z 2025-05-07T19:54:29.0229557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0231256Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0232130Z ^ 2025-05-07T19:54:29.0235881Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:29.0238987Z 2025-05-07T19:54:29.0240192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0242285Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0243185Z ^ 2025-05-07T19:54:29.0246614Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:29.0250027Z 2025-05-07T19:54:29.0251508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0253500Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0254629Z ^ 2025-05-07T19:54:29.0258152Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:29.0261392Z 2025-05-07T19:54:29.0262696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0264705Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0265643Z ^ 2025-05-07T19:54:29.0269137Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:29.0272410Z 2025-05-07T19:54:29.0273637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0275784Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0276683Z ^ 2025-05-07T19:54:29.0280447Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:29.0283504Z 2025-05-07T19:54:29.0284780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0286441Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0287226Z ^ 2025-05-07T19:54:29.0290675Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:29.0294053Z 2025-05-07T19:54:29.0295297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0297178Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0298051Z ^ 2025-05-07T19:54:29.0301384Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:29.0304584Z 2025-05-07T19:54:29.0305767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0307722Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0308592Z ^ 2025-05-07T19:54:29.0312004Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:29.0315253Z 2025-05-07T19:54:29.0316544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0318500Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0319408Z ^ 2025-05-07T19:54:29.0323133Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:29.0326486Z 2025-05-07T19:54:29.0327784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0330020Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0331001Z ^ 2025-05-07T19:54:29.0334395Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:29.0337891Z 2025-05-07T19:54:29.0339104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0341030Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0341921Z ^ 2025-05-07T19:54:29.0344918Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:29.0347966Z 2025-05-07T19:54:29.0349203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0351306Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0352147Z ^ 2025-05-07T19:54:29.0355509Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:29.0358680Z 2025-05-07T19:54:29.0359880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0361866Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0362943Z ^ 2025-05-07T19:54:29.0366283Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:29.0369990Z 2025-05-07T19:54:29.0371403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0373382Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0374270Z ^ 2025-05-07T19:54:29.0377771Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:29.0381211Z 2025-05-07T19:54:29.0382441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0384569Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0385449Z ^ 2025-05-07T19:54:29.0388838Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:29.0392019Z 2025-05-07T19:54:29.0393308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0395207Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0396045Z ^ 2025-05-07T19:54:29.0399200Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:29.0402184Z 2025-05-07T19:54:29.0403414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0405373Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0406243Z ^ 2025-05-07T19:54:29.0409605Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:29.0412828Z 2025-05-07T19:54:29.0414383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0416331Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0417246Z ^ 2025-05-07T19:54:29.0420440Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:29.0423779Z 2025-05-07T19:54:29.0425097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0427048Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0427967Z ^ 2025-05-07T19:54:29.0431733Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:29.0434873Z 2025-05-07T19:54:29.0436129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0438073Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0438978Z ^ 2025-05-07T19:54:29.0442418Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:29.0445624Z 2025-05-07T19:54:29.0446839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0448831Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0449747Z ^ 2025-05-07T19:54:29.0453274Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:29.0456030Z 2025-05-07T19:54:29.0457645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0459477Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0460377Z ^ 2025-05-07T19:54:29.0463730Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:29.0467095Z 2025-05-07T19:54:29.0468282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0470220Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0471091Z ^ 2025-05-07T19:54:29.0474464Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:29.0477534Z 2025-05-07T19:54:29.0477988Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:29.0478664Z 2025-05-07T19:54:29.0479909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0481845Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0482734Z ^ 2025-05-07T19:54:29.0486173Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:29.0489403Z 2025-05-07T19:54:29.0490886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0492843Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0493716Z ^ 2025-05-07T19:54:29.0497353Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:29.0500570Z 2025-05-07T19:54:29.0502145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0504104Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0505003Z ^ 2025-05-07T19:54:29.0508325Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:29.0511159Z 2025-05-07T19:54:29.0512369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0514255Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0515122Z ^ 2025-05-07T19:54:29.0518486Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:29.0521480Z 2025-05-07T19:54:29.0522758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0524701Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0525584Z ^ 2025-05-07T19:54:29.0529283Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:29.0532439Z 2025-05-07T19:54:29.0533633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0535606Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0536482Z ^ 2025-05-07T19:54:29.0539952Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:29.0543168Z 2025-05-07T19:54:29.0544849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0546866Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0547767Z ^ 2025-05-07T19:54:29.0551235Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:29.0554582Z 2025-05-07T19:54:29.0555848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0557720Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0558648Z ^ 2025-05-07T19:54:29.0561971Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:29.0565118Z 2025-05-07T19:54:29.0566088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0567859Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0568729Z ^ 2025-05-07T19:54:29.0572237Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:29.0575376Z 2025-05-07T19:54:29.0576625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0578584Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0579454Z ^ 2025-05-07T19:54:29.0582822Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:29.0586025Z 2025-05-07T19:54:29.0589706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0591784Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0592675Z ^ 2025-05-07T19:54:29.0596266Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:29.0599556Z 2025-05-07T19:54:29.0600857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0602984Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0603903Z ^ 2025-05-07T19:54:29.0607174Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:29.0610516Z 2025-05-07T19:54:29.0611824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0613773Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0614701Z ^ 2025-05-07T19:54:29.0618043Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:29.0620958Z 2025-05-07T19:54:29.0622036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0623911Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0624760Z ^ 2025-05-07T19:54:29.0628164Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:29.0631461Z 2025-05-07T19:54:29.0632680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0635001Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0635905Z ^ 2025-05-07T19:54:29.0639288Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:29.0642491Z 2025-05-07T19:54:29.0643696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0645804Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0646722Z ^ 2025-05-07T19:54:29.0650247Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:29.0653524Z 2025-05-07T19:54:29.0654816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0656749Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0657643Z ^ 2025-05-07T19:54:29.0661145Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:29.0664304Z 2025-05-07T19:54:29.0665597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0667616Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0668501Z ^ 2025-05-07T19:54:29.0671999Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:29.0675134Z 2025-05-07T19:54:29.0676358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0678208Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0679045Z ^ 2025-05-07T19:54:29.0682411Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:29.0685547Z 2025-05-07T19:54:29.0686780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0688845Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0689727Z ^ 2025-05-07T19:54:29.0693239Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:29.0696470Z 2025-05-07T19:54:29.0697674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0699598Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0700460Z ^ 2025-05-07T19:54:29.0703981Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:29.0707084Z 2025-05-07T19:54:29.0708331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0710317Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0711234Z ^ 2025-05-07T19:54:29.0714784Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:29.0718034Z 2025-05-07T19:54:29.0719337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0721293Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0722454Z ^ 2025-05-07T19:54:29.0725983Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:29.0729523Z 2025-05-07T19:54:29.0730886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0733162Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0734057Z ^ 2025-05-07T19:54:29.0737368Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:29.0740390Z 2025-05-07T19:54:29.0740737Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:29.0741266Z 2025-05-07T19:54:29.0742353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0744289Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0745114Z ^ 2025-05-07T19:54:29.0748423Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:29.0751556Z 2025-05-07T19:54:29.0752793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0754704Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0755601Z ^ 2025-05-07T19:54:29.0758820Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:29.0762022Z 2025-05-07T19:54:29.0763304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0765529Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0766450Z ^ 2025-05-07T19:54:29.0769852Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:29.0773257Z 2025-05-07T19:54:29.0774570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0776680Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0777579Z ^ 2025-05-07T19:54:29.0781049Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:29.0784130Z 2025-05-07T19:54:29.0785408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0787407Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0788310Z ^ 2025-05-07T19:54:29.0791747Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:29.0794944Z 2025-05-07T19:54:29.0796199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0798155Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0799045Z ^ 2025-05-07T19:54:29.0802389Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:29.0805071Z 2025-05-07T19:54:29.0806213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0808095Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0809191Z ^ 2025-05-07T19:54:29.0812659Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:29.0815742Z 2025-05-07T19:54:29.0816945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0818851Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0819874Z ^ 2025-05-07T19:54:29.0823341Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:29.0826556Z 2025-05-07T19:54:29.0827746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0829988Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0830892Z ^ 2025-05-07T19:54:29.0834258Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:29.0837532Z 2025-05-07T19:54:29.0838807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0840796Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0841734Z ^ 2025-05-07T19:54:29.0845225Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:29.0848419Z 2025-05-07T19:54:29.0849734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0851808Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0852736Z ^ 2025-05-07T19:54:29.0856444Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:29.0859567Z 2025-05-07T19:54:29.0860865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0862760Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0863887Z ^ 2025-05-07T19:54:29.0867185Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:29.0869834Z 2025-05-07T19:54:29.0871031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0872835Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0873703Z ^ 2025-05-07T19:54:29.0877134Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:29.0880216Z 2025-05-07T19:54:29.0881452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0883323Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0884202Z ^ 2025-05-07T19:54:29.0887661Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:29.0890847Z 2025-05-07T19:54:29.0892094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0894043Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0894936Z ^ 2025-05-07T19:54:29.0898622Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:29.0901911Z 2025-05-07T19:54:29.0903197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0905197Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0906038Z ^ 2025-05-07T19:54:29.0909683Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:29.0912786Z 2025-05-07T19:54:29.0914061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0916069Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0916950Z ^ 2025-05-07T19:54:29.0920360Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:29.0923635Z 2025-05-07T19:54:29.0924923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0926872Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0927729Z ^ 2025-05-07T19:54:29.0931300Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:29.0934129Z 2025-05-07T19:54:29.0935341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0937213Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0938097Z ^ 2025-05-07T19:54:29.0941906Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:29.0945088Z 2025-05-07T19:54:29.0946347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0948219Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0949102Z ^ 2025-05-07T19:54:29.0952682Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:29.0955841Z 2025-05-07T19:54:29.0957026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0958974Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0959853Z ^ 2025-05-07T19:54:29.0963383Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:29.0966641Z 2025-05-07T19:54:29.0967943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0969888Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0970928Z ^ 2025-05-07T19:54:29.0974346Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:29.0977593Z 2025-05-07T19:54:29.0978868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:29.0980759Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:29.0981658Z ^ 2025-05-07T19:54:29.0985409Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:29.0988658Z 2025-05-07T19:54:31.8573037Z [158/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:54:33.8278851Z [159/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o 2025-05-07T19:54:34.3749234Z [160/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:36.5600992Z [161/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:54:37.5087837Z [162/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:38.2761895Z [163/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:54:38.9177086Z [164/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:38.9313853Z [165/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:38.9449233Z [166/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:38.9583742Z [167/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:38.9718667Z [168/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:39.0025396Z [169/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:39.0065069Z [170/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:39.0162858Z [171/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:54:39.0200429Z [172/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:39.0299590Z [173/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:39.0336487Z [174/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:39.0435268Z [175/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:39.0473612Z [176/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:39.0570722Z [177/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:39.0608238Z [178/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:41.4637858Z [179/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:54:41.8862616Z [180/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:44.7590774Z [181/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:45.8150620Z [182/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o 2025-05-07T19:54:46.3664667Z [183/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:46.3839396Z [184/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o 2025-05-07T19:54:46.3863819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:46.3865863Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:46.3866453Z ^ 2025-05-07T19:54:46.3866751Z 2025-05-07T19:54:46.3867571Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.3868247Z 2025-05-07T19:54:46.3869859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:46.3871884Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:46.3872412Z ^ 2025-05-07T19:54:46.3872726Z 2025-05-07T19:54:46.3874242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:46.3876094Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:46.3876652Z ^ 2025-05-07T19:54:46.3876971Z 2025-05-07T19:54:46.3878530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:46.3880370Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:46.3880913Z ^ 2025-05-07T19:54:46.3881217Z 2025-05-07T19:54:46.3895491Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.3896322Z 2025-05-07T19:54:46.3897840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:46.3899850Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:46.3900409Z ^ 2025-05-07T19:54:46.3900739Z 2025-05-07T19:54:46.3902379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:46.3904424Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:46.3904988Z ^ 2025-05-07T19:54:46.3905323Z 2025-05-07T19:54:46.3906933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:46.3908926Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:46.3909532Z ^ 2025-05-07T19:54:46.3909835Z 2025-05-07T19:54:46.3910319Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.3910863Z 2025-05-07T19:54:46.3912218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:46.3914393Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:46.3914954Z ^ 2025-05-07T19:54:46.3915273Z 2025-05-07T19:54:46.3916782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:46.3918741Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:46.3919315Z ^ 2025-05-07T19:54:46.3919640Z 2025-05-07T19:54:46.3921273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:46.3923326Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:46.3924066Z ^ 2025-05-07T19:54:46.3924369Z 2025-05-07T19:54:46.3924872Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.3925521Z 2025-05-07T19:54:46.3926911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:46.3929008Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:46.3929520Z ^ 2025-05-07T19:54:46.3929791Z 2025-05-07T19:54:46.3931333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:46.3933255Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:46.3933830Z ^ 2025-05-07T19:54:46.3934169Z 2025-05-07T19:54:46.3935789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:46.3937854Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:46.3938425Z ^ 2025-05-07T19:54:46.3938759Z 2025-05-07T19:54:46.3939217Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:46.3939897Z 2025-05-07T19:54:46.3941553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:46.3943588Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:46.3944190Z ^ 2025-05-07T19:54:46.3944484Z 2025-05-07T19:54:46.3946111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:46.3948174Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:46.3948770Z ^ 2025-05-07T19:54:46.3949078Z 2025-05-07T19:54:46.7084216Z [185/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:54:49.0999249Z [186/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:51.3890286Z [187/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:51.4041093Z [188/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:51.4197914Z [189/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:51.4332165Z [190/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:54:51.4462066Z [191/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:51.4594128Z [192/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:51.4725638Z [193/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:51.5703808Z [194/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:53.0172577Z [195/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:53.2285669Z [196/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:53.6270605Z [197/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:53.9977941Z [198/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:57.4952341Z [199/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:01.1153978Z [200/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:55:03.0113472Z [201/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:03.5140475Z [202/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:55:03.6058005Z [203/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:55:04.2891219Z [204/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:55:04.6003309Z [205/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:55:06.1650816Z [206/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:55:06.1956893Z [207/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:55:07.7446226Z [208/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:55:09.4762966Z [209/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o 2025-05-07T19:55:09.4782852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.4784612Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.4785157Z ^ 2025-05-07T19:55:09.4785564Z 2025-05-07T19:55:09.4785993Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:09.4786583Z 2025-05-07T19:55:09.4787934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.4789681Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.4790236Z ^ 2025-05-07T19:55:09.4790532Z 2025-05-07T19:55:09.4791838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.4793538Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.4794103Z ^ 2025-05-07T19:55:09.4794400Z 2025-05-07T19:55:09.4795678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.4797406Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.4797939Z ^ 2025-05-07T19:55:09.4798578Z 2025-05-07T19:55:09.4798988Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:09.4799555Z 2025-05-07T19:55:09.4800910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.4802571Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.4803147Z ^ 2025-05-07T19:55:09.4803440Z 2025-05-07T19:55:09.4804737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.4806447Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.4807229Z ^ 2025-05-07T19:55:09.4807516Z 2025-05-07T19:55:09.4808820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.4810651Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.4811212Z ^ 2025-05-07T19:55:09.4811534Z 2025-05-07T19:55:09.4811932Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:09.4812520Z 2025-05-07T19:55:09.4813855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.4815534Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.4816094Z ^ 2025-05-07T19:55:09.4816384Z 2025-05-07T19:55:09.4817716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.4819374Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.4819925Z ^ 2025-05-07T19:55:09.4820211Z 2025-05-07T19:55:09.4821512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.4823172Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.4823736Z ^ 2025-05-07T19:55:09.4824016Z 2025-05-07T19:55:09.4824424Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:09.4825055Z 2025-05-07T19:55:09.4826385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.4828100Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.4828875Z ^ 2025-05-07T19:55:09.4829405Z 2025-05-07T19:55:09.4830748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.4832407Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.4832962Z ^ 2025-05-07T19:55:09.4833242Z 2025-05-07T19:55:09.4834989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.4836690Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.4837239Z ^ 2025-05-07T19:55:09.4837515Z 2025-05-07T19:55:09.4837904Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:09.4838518Z 2025-05-07T19:55:09.4839826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.4841526Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.4842052Z ^ 2025-05-07T19:55:09.4842377Z 2025-05-07T19:55:09.4843662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.4845633Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.4846206Z ^ 2025-05-07T19:55:09.4846496Z 2025-05-07T19:55:09.6725912Z [210/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:55:09.7301357Z [211/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:09.7325515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7327687Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:09.7328779Z ^ 2025-05-07T19:55:09.7329095Z 2025-05-07T19:55:09.7329571Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:09.7330430Z 2025-05-07T19:55:09.7332053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7334115Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.7334691Z ^ 2025-05-07T19:55:09.7334987Z 2025-05-07T19:55:09.7336667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7338582Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.7339090Z ^ 2025-05-07T19:55:09.7339369Z 2025-05-07T19:55:09.7340962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7342984Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.7343499Z ^ 2025-05-07T19:55:09.7343794Z 2025-05-07T19:55:09.7345302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7347452Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:09.7348543Z ^ 2025-05-07T19:55:09.7348847Z 2025-05-07T19:55:09.7349319Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:09.7350008Z 2025-05-07T19:55:09.7351561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7353547Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.7354119Z ^ 2025-05-07T19:55:09.7354408Z 2025-05-07T19:55:09.7355974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7358191Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.7358739Z ^ 2025-05-07T19:55:09.7359026Z 2025-05-07T19:55:09.7360585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7362604Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.7363156Z ^ 2025-05-07T19:55:09.7363458Z 2025-05-07T19:55:09.7364972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7367255Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:09.7368044Z ^ 2025-05-07T19:55:09.7368344Z 2025-05-07T19:55:09.7368822Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:09.7369516Z 2025-05-07T19:55:09.7371285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7373316Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.7373896Z ^ 2025-05-07T19:55:09.7374195Z 2025-05-07T19:55:09.7375779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7377717Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.7378282Z ^ 2025-05-07T19:55:09.7378590Z 2025-05-07T19:55:09.7380310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7382316Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.7382851Z ^ 2025-05-07T19:55:09.7383133Z 2025-05-07T19:55:09.7384767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7386962Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:09.7387763Z ^ 2025-05-07T19:55:09.7388058Z 2025-05-07T19:55:09.7388544Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:09.7389237Z 2025-05-07T19:55:09.7391058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7393113Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.7393685Z ^ 2025-05-07T19:55:09.7394012Z 2025-05-07T19:55:09.7395618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7397578Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.7398127Z ^ 2025-05-07T19:55:09.7398421Z 2025-05-07T19:55:09.7399961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7402189Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.7402759Z ^ 2025-05-07T19:55:09.7403054Z 2025-05-07T19:55:09.7404694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7406750Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:09.7407440Z ^ 2025-05-07T19:55:09.7407703Z 2025-05-07T19:55:09.7408176Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:09.7408744Z 2025-05-07T19:55:09.7410429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7412430Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.7412991Z ^ 2025-05-07T19:55:09.7413315Z 2025-05-07T19:55:09.7414876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7416854Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.7417394Z ^ 2025-05-07T19:55:09.7417690Z 2025-05-07T19:55:09.7419233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:09.7421200Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:09.7421749Z ^ 2025-05-07T19:55:09.7422009Z 2025-05-07T19:55:10.0626379Z [212/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:11.7942047Z [213/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:55:12.0867724Z [214/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:55:12.7852037Z [215/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:55:13.2924009Z [216/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:13.4708377Z [217/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:55:13.5348625Z [218/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:13.5368547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5370549Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:13.5371179Z ^ 2025-05-07T19:55:13.5371455Z 2025-05-07T19:55:13.5371839Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.5372395Z 2025-05-07T19:55:13.5373667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5375282Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5375780Z ^ 2025-05-07T19:55:13.5376015Z 2025-05-07T19:55:13.5377388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5379041Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5379529Z ^ 2025-05-07T19:55:13.5379759Z 2025-05-07T19:55:13.5381024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5382598Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5383094Z ^ 2025-05-07T19:55:13.5383326Z 2025-05-07T19:55:13.5384562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5386325Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:13.5387007Z ^ 2025-05-07T19:55:13.5387251Z 2025-05-07T19:55:13.5387888Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.5388476Z 2025-05-07T19:55:13.5389741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5391397Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5391861Z ^ 2025-05-07T19:55:13.5392099Z 2025-05-07T19:55:13.5393383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5394959Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5395639Z ^ 2025-05-07T19:55:13.5395881Z 2025-05-07T19:55:13.5397150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5398799Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5399279Z ^ 2025-05-07T19:55:13.5399511Z 2025-05-07T19:55:13.5400773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5402569Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:13.5403221Z ^ 2025-05-07T19:55:13.5403458Z 2025-05-07T19:55:13.5403836Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.5404428Z 2025-05-07T19:55:13.5405710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5407348Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5407816Z ^ 2025-05-07T19:55:13.5408053Z 2025-05-07T19:55:13.5409362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5411148Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5411657Z ^ 2025-05-07T19:55:13.5411895Z 2025-05-07T19:55:13.5413200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5414821Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5415308Z ^ 2025-05-07T19:55:13.5415550Z 2025-05-07T19:55:13.5416833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5418617Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:13.5419266Z ^ 2025-05-07T19:55:13.5419516Z 2025-05-07T19:55:13.5419898Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.5420494Z 2025-05-07T19:55:13.5421835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5423648Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5424172Z ^ 2025-05-07T19:55:13.5424418Z 2025-05-07T19:55:13.5425722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5427317Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5427823Z ^ 2025-05-07T19:55:13.5428071Z 2025-05-07T19:55:13.5429610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5431261Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5431759Z ^ 2025-05-07T19:55:13.5432242Z 2025-05-07T19:55:13.5433533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5435296Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:13.5435933Z ^ 2025-05-07T19:55:13.5436178Z 2025-05-07T19:55:13.5436563Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.5437116Z 2025-05-07T19:55:13.5438456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5440059Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5440556Z ^ 2025-05-07T19:55:13.5440809Z 2025-05-07T19:55:13.5442160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5443760Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5444252Z ^ 2025-05-07T19:55:13.5444491Z 2025-05-07T19:55:13.5445812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.5447406Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.5447905Z ^ 2025-05-07T19:55:13.5448137Z 2025-05-07T19:55:14.2974036Z [219/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:55:15.4184002Z [220/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:55:17.2265948Z [221/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:55:17.8056316Z [222/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:55:17.9202339Z [223/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:55:18.4766104Z [224/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:55:18.7925789Z [225/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:20.2120514Z [226/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:20.3651244Z [227/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:55:20.4618356Z [228/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:55:21.0716516Z [229/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_pt2.so -o fbgemm_gpu_tbe_training_backward_pt2.so CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T19:55:21.6162966Z [230/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o 2025-05-07T19:55:21.9442160Z [231/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:22.1164364Z [232/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:55:23.5347908Z [233/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:23.8166235Z [234/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:23.9693281Z [235/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:55:25.2341444Z [236/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:55:26.2219611Z [237/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:26.2242995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2245112Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:26.2245834Z ^ 2025-05-07T19:55:26.2246114Z 2025-05-07T19:55:26.2246558Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:26.2247131Z 2025-05-07T19:55:26.2248686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2250773Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.2251272Z ^ 2025-05-07T19:55:26.2251547Z 2025-05-07T19:55:26.2253109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2255044Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.2255613Z ^ 2025-05-07T19:55:26.2255889Z 2025-05-07T19:55:26.2257400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2259266Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.2260151Z ^ 2025-05-07T19:55:26.2260442Z 2025-05-07T19:55:26.2261995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2264104Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:26.2264807Z ^ 2025-05-07T19:55:26.2265119Z 2025-05-07T19:55:26.2265570Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:26.2266242Z 2025-05-07T19:55:26.2267553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2269328Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.2269801Z ^ 2025-05-07T19:55:26.2270035Z 2025-05-07T19:55:26.2271236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2272953Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.2273498Z ^ 2025-05-07T19:55:26.2273743Z 2025-05-07T19:55:26.2275194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2277166Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.2277736Z ^ 2025-05-07T19:55:26.2278017Z 2025-05-07T19:55:26.2279600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2281808Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:26.2282573Z ^ 2025-05-07T19:55:26.2282883Z 2025-05-07T19:55:26.2283337Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:26.2284021Z 2025-05-07T19:55:26.2285612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2287573Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.2288149Z ^ 2025-05-07T19:55:26.2288443Z 2025-05-07T19:55:26.2290167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2292083Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.2292617Z ^ 2025-05-07T19:55:26.2292876Z 2025-05-07T19:55:26.2294329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2296161Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.2296667Z ^ 2025-05-07T19:55:26.2296966Z 2025-05-07T19:55:26.2298411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2300410Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:26.2301312Z ^ 2025-05-07T19:55:26.2301639Z 2025-05-07T19:55:26.2302071Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:26.2302710Z 2025-05-07T19:55:26.2304172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2305987Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.2306553Z ^ 2025-05-07T19:55:26.2306815Z 2025-05-07T19:55:26.2308179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2310237Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.2310751Z ^ 2025-05-07T19:55:26.2311003Z 2025-05-07T19:55:26.2312354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2314131Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.2314604Z ^ 2025-05-07T19:55:26.2314906Z 2025-05-07T19:55:26.2316347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2318453Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:26.2319183Z ^ 2025-05-07T19:55:26.2319496Z 2025-05-07T19:55:26.2319948Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:26.2320600Z 2025-05-07T19:55:26.2322146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2323860Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.2324382Z ^ 2025-05-07T19:55:26.2324639Z 2025-05-07T19:55:26.2326004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2327847Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.2328377Z ^ 2025-05-07T19:55:26.2328917Z 2025-05-07T19:55:26.2330454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:26.2332249Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:26.2332762Z ^ 2025-05-07T19:55:26.2333036Z 2025-05-07T19:55:27.1245209Z [238/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:55:27.6693088Z [239/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:55:33.8684150Z [240/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:33.8707591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8709607Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:33.8710306Z ^ 2025-05-07T19:55:33.8710615Z 2025-05-07T19:55:33.8711045Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:33.8711677Z 2025-05-07T19:55:33.8713209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8715135Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8715705Z ^ 2025-05-07T19:55:33.8716002Z 2025-05-07T19:55:33.8717698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8719642Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8720205Z ^ 2025-05-07T19:55:33.8720485Z 2025-05-07T19:55:33.8722023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8724005Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8724571Z ^ 2025-05-07T19:55:33.8724840Z 2025-05-07T19:55:33.8726220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8728895Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:33.8729569Z ^ 2025-05-07T19:55:33.8729897Z 2025-05-07T19:55:33.8730265Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:33.8730772Z 2025-05-07T19:55:33.8732209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8734075Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8734623Z ^ 2025-05-07T19:55:33.8734867Z 2025-05-07T19:55:33.8736180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8738205Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8738742Z ^ 2025-05-07T19:55:33.8739000Z 2025-05-07T19:55:33.8740342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8742225Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8742747Z ^ 2025-05-07T19:55:33.8742997Z 2025-05-07T19:55:33.8744458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8746587Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:33.8747360Z ^ 2025-05-07T19:55:33.8747672Z 2025-05-07T19:55:33.8748114Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:33.8748775Z 2025-05-07T19:55:33.8750357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8752329Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8752903Z ^ 2025-05-07T19:55:33.8753169Z 2025-05-07T19:55:33.8754754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8756534Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8757113Z ^ 2025-05-07T19:55:33.8757397Z 2025-05-07T19:55:33.8758945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8760767Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8761316Z ^ 2025-05-07T19:55:33.8761576Z 2025-05-07T19:55:33.8763102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8765202Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:33.8765905Z ^ 2025-05-07T19:55:33.8766209Z 2025-05-07T19:55:33.8766630Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:33.8767246Z 2025-05-07T19:55:33.8771600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8773711Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8774302Z ^ 2025-05-07T19:55:33.8774591Z 2025-05-07T19:55:33.8776148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8778030Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8778585Z ^ 2025-05-07T19:55:33.8778845Z 2025-05-07T19:55:33.8780384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8782471Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8782998Z ^ 2025-05-07T19:55:33.8783241Z 2025-05-07T19:55:33.8784734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8786832Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:33.8787532Z ^ 2025-05-07T19:55:33.8787833Z 2025-05-07T19:55:33.8788285Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:33.8788919Z 2025-05-07T19:55:33.8790497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8792401Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8792971Z ^ 2025-05-07T19:55:33.8793246Z 2025-05-07T19:55:33.8794780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8796718Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8797273Z ^ 2025-05-07T19:55:33.8797543Z 2025-05-07T19:55:33.8799056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.8800973Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.8801510Z ^ 2025-05-07T19:55:33.8801797Z 2025-05-07T19:55:43.9850590Z [241/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o 2025-05-07T19:55:43.9874593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:43.9876532Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:43.9877125Z ^ 2025-05-07T19:55:43.9877422Z 2025-05-07T19:55:43.9877992Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:43.9878638Z 2025-05-07T19:55:43.9880185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:43.9882204Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:43.9882748Z ^ 2025-05-07T19:55:43.9883066Z 2025-05-07T19:55:43.9884539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:43.9886506Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:43.9887058Z ^ 2025-05-07T19:55:43.9887389Z 2025-05-07T19:55:43.9888995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:43.9891081Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:43.9891630Z ^ 2025-05-07T19:55:43.9891931Z 2025-05-07T19:55:43.9892403Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:43.9893068Z 2025-05-07T19:55:43.9894650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:43.9896632Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:43.9897163Z ^ 2025-05-07T19:55:43.9897453Z 2025-05-07T19:55:43.9899354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:43.9901411Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:43.9901977Z ^ 2025-05-07T19:55:43.9902300Z 2025-05-07T19:55:43.9903913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:43.9905947Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:43.9906501Z ^ 2025-05-07T19:55:43.9906797Z 2025-05-07T19:55:43.9907254Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:43.9907898Z 2025-05-07T19:55:43.9909497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:43.9911578Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:43.9912152Z ^ 2025-05-07T19:55:43.9912444Z 2025-05-07T19:55:43.9914045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:43.9915855Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:43.9916334Z ^ 2025-05-07T19:55:43.9916642Z 2025-05-07T19:55:43.9918051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:43.9919998Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:43.9920555Z ^ 2025-05-07T19:55:43.9920845Z 2025-05-07T19:55:43.9921273Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:43.9921921Z 2025-05-07T19:55:43.9923488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:43.9925485Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:43.9926049Z ^ 2025-05-07T19:55:43.9926339Z 2025-05-07T19:55:43.9927843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:43.9930287Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:43.9930874Z ^ 2025-05-07T19:55:43.9931175Z 2025-05-07T19:55:43.9932650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:43.9934554Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:43.9935104Z ^ 2025-05-07T19:55:43.9935432Z 2025-05-07T19:55:43.9935874Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:43.9936552Z 2025-05-07T19:55:43.9938094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:43.9940061Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:43.9940637Z ^ 2025-05-07T19:55:43.9941225Z 2025-05-07T19:55:43.9942873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:43.9944899Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:43.9945479Z ^ 2025-05-07T19:55:43.9945782Z 2025-05-07T19:55:49.3536135Z [242/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o 2025-05-07T19:55:58.8900547Z [243/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o 2025-05-07T19:56:06.0385493Z [244/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o 2025-05-07T19:56:06.0412113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0413907Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0414584Z ^ 2025-05-07T19:56:06.0417213Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:06.0420360Z 2025-05-07T19:56:06.0420800Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:06.0421428Z 2025-05-07T19:56:06.0422650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0424437Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0425268Z ^ 2025-05-07T19:56:06.0428755Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:06.0431936Z 2025-05-07T19:56:06.0433240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0435228Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0436126Z ^ 2025-05-07T19:56:06.0439532Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:06.0442717Z 2025-05-07T19:56:06.0444044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0446032Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0446910Z ^ 2025-05-07T19:56:06.0450221Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:06.0453513Z 2025-05-07T19:56:06.0455137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0457148Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0458034Z ^ 2025-05-07T19:56:06.0461442Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:06.0464826Z 2025-05-07T19:56:06.0466160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0468208Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0469118Z ^ 2025-05-07T19:56:06.0472538Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:06.0475739Z 2025-05-07T19:56:06.0476901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0478822Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0479683Z ^ 2025-05-07T19:56:06.0482998Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:06.0486114Z 2025-05-07T19:56:06.0487423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0489389Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0490262Z ^ 2025-05-07T19:56:06.0492977Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:06.0495574Z 2025-05-07T19:56:06.0497049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0498897Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0499749Z ^ 2025-05-07T19:56:06.0503004Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:06.0506101Z 2025-05-07T19:56:06.0507427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0509556Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0510420Z ^ 2025-05-07T19:56:06.0513402Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:06.0516376Z 2025-05-07T19:56:06.0517688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0519728Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0520663Z ^ 2025-05-07T19:56:06.0524091Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:06.0527308Z 2025-05-07T19:56:06.0528835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0531012Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0531908Z ^ 2025-05-07T19:56:06.0535328Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:06.0538542Z 2025-05-07T19:56:06.0539835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0542140Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0543060Z ^ 2025-05-07T19:56:06.0546374Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:06.0549477Z 2025-05-07T19:56:06.0550793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0553014Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0553883Z ^ 2025-05-07T19:56:06.0557035Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:06.0559937Z 2025-05-07T19:56:06.0560917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0562630Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0563516Z ^ 2025-05-07T19:56:06.0566876Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:06.0569963Z 2025-05-07T19:56:06.0571385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0573324Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0574203Z ^ 2025-05-07T19:56:06.0577526Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:06.0580590Z 2025-05-07T19:56:06.0581865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0583746Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0584878Z ^ 2025-05-07T19:56:06.0588150Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:06.0591176Z 2025-05-07T19:56:06.0592323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0594033Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0595090Z ^ 2025-05-07T19:56:06.0598464Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:06.0601629Z 2025-05-07T19:56:06.0602911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0604909Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0605797Z ^ 2025-05-07T19:56:06.0609198Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:06.0612540Z 2025-05-07T19:56:06.0613859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0615825Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0616684Z ^ 2025-05-07T19:56:06.0620119Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:06.0623288Z 2025-05-07T19:56:06.0624554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0626562Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0627414Z ^ 2025-05-07T19:56:06.0631277Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:06.0634420Z 2025-05-07T19:56:06.0635749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0637716Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0638584Z ^ 2025-05-07T19:56:06.0641881Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:06.0644838Z 2025-05-07T19:56:06.0646012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0647743Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0648473Z ^ 2025-05-07T19:56:06.0651886Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:06.0654910Z 2025-05-07T19:56:06.0656097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0657829Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0658719Z ^ 2025-05-07T19:56:06.0662013Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:06.0665084Z 2025-05-07T19:56:06.0666295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0668184Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0669058Z ^ 2025-05-07T19:56:06.0672710Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:06.0675288Z 2025-05-07T19:56:06.0675695Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:06.0676216Z 2025-05-07T19:56:06.0677200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0678721Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0679459Z ^ 2025-05-07T19:56:06.0682330Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:06.0684845Z 2025-05-07T19:56:06.0685873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0687559Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0688383Z ^ 2025-05-07T19:56:06.0691594Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:06.0694391Z 2025-05-07T19:56:06.0695553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0697378Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0698217Z ^ 2025-05-07T19:56:06.0701051Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:06.0703622Z 2025-05-07T19:56:06.0704675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0706282Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0707055Z ^ 2025-05-07T19:56:06.0710446Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:06.0713365Z 2025-05-07T19:56:06.0714569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0716438Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0717207Z ^ 2025-05-07T19:56:06.0720321Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:06.0723441Z 2025-05-07T19:56:06.0724682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0726467Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0727243Z ^ 2025-05-07T19:56:06.0730694Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:06.0733829Z 2025-05-07T19:56:06.0735040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0736796Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0737744Z ^ 2025-05-07T19:56:06.0740813Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:06.0743524Z 2025-05-07T19:56:06.0744832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0746797Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0747701Z ^ 2025-05-07T19:56:06.0751452Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:06.0754612Z 2025-05-07T19:56:06.0755896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0757883Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0758785Z ^ 2025-05-07T19:56:06.0762234Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:06.0765650Z 2025-05-07T19:56:06.0766971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0768978Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0769888Z ^ 2025-05-07T19:56:06.0772899Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:06.0775993Z 2025-05-07T19:56:06.0777325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0779382Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0780283Z ^ 2025-05-07T19:56:06.0783675Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:06.0786931Z 2025-05-07T19:56:06.0788223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0790259Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0791148Z ^ 2025-05-07T19:56:06.0794749Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:06.0797910Z 2025-05-07T19:56:06.0799160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0801229Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0802140Z ^ 2025-05-07T19:56:06.0805463Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:06.0808778Z 2025-05-07T19:56:06.0810025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0812174Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0813077Z ^ 2025-05-07T19:56:06.0816464Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:06.0819547Z 2025-05-07T19:56:06.0820924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0822875Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0823802Z ^ 2025-05-07T19:56:06.0827037Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:06.0830541Z 2025-05-07T19:56:06.0831886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0833889Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0834815Z ^ 2025-05-07T19:56:06.0838213Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:06.0841392Z 2025-05-07T19:56:06.0843023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0844937Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0845828Z ^ 2025-05-07T19:56:06.0849219Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:06.0852890Z 2025-05-07T19:56:06.0854247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0856216Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0857114Z ^ 2025-05-07T19:56:06.0860448Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:06.0863625Z 2025-05-07T19:56:06.0864879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0866891Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0867816Z ^ 2025-05-07T19:56:06.0871301Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:06.0874359Z 2025-05-07T19:56:06.0875670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0877664Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0878554Z ^ 2025-05-07T19:56:06.0881927Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:06.0885102Z 2025-05-07T19:56:06.0886751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0888710Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0889598Z ^ 2025-05-07T19:56:06.0893196Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:06.0896566Z 2025-05-07T19:56:06.0897892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0899919Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0900764Z ^ 2025-05-07T19:56:06.0904147Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:06.0907257Z 2025-05-07T19:56:06.0908627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0910672Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0911614Z ^ 2025-05-07T19:56:06.0915125Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:06.0918302Z 2025-05-07T19:56:06.0938406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0940557Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0941474Z ^ 2025-05-07T19:56:06.0944928Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:06.0947953Z 2025-05-07T19:56:06.0948452Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:06.0949161Z 2025-05-07T19:56:06.0950797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0952755Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0953615Z ^ 2025-05-07T19:56:06.0956787Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:06.0962560Z 2025-05-07T19:56:06.0963904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0965940Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0966820Z ^ 2025-05-07T19:56:06.0970207Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:06.0973515Z 2025-05-07T19:56:06.0974827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0976859Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0977748Z ^ 2025-05-07T19:56:06.0981089Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:06.0984096Z 2025-05-07T19:56:06.0985435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0987443Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0988373Z ^ 2025-05-07T19:56:06.0991774Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:06.0994903Z 2025-05-07T19:56:06.0996449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.0998463Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.0999381Z ^ 2025-05-07T19:56:06.1002779Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:06.1005883Z 2025-05-07T19:56:06.1007083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1009121Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1010020Z ^ 2025-05-07T19:56:06.1013422Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:06.1016423Z 2025-05-07T19:56:06.1017673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1019605Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1020474Z ^ 2025-05-07T19:56:06.1023710Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:06.1026766Z 2025-05-07T19:56:06.1028039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1030323Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1031223Z ^ 2025-05-07T19:56:06.1034617Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:06.1037771Z 2025-05-07T19:56:06.1039024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1041267Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1042139Z ^ 2025-05-07T19:56:06.1045409Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:06.1048442Z 2025-05-07T19:56:06.1049755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1052141Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1053008Z ^ 2025-05-07T19:56:06.1056286Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:06.1059367Z 2025-05-07T19:56:06.1060680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1062709Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1063597Z ^ 2025-05-07T19:56:06.1066980Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:06.1070165Z 2025-05-07T19:56:06.1071504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1073507Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1074421Z ^ 2025-05-07T19:56:06.1077679Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:06.1080831Z 2025-05-07T19:56:06.1082160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1084135Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1085330Z ^ 2025-05-07T19:56:06.1088727Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:06.1092062Z 2025-05-07T19:56:06.1093400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1095348Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1096531Z ^ 2025-05-07T19:56:06.1099974Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:06.1103096Z 2025-05-07T19:56:06.1104424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1106463Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1107379Z ^ 2025-05-07T19:56:06.1110651Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:06.1113870Z 2025-05-07T19:56:06.1115180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1117160Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1118070Z ^ 2025-05-07T19:56:06.1121485Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:06.1124629Z 2025-05-07T19:56:06.1125970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1128031Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1129199Z ^ 2025-05-07T19:56:06.1133112Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:06.1136170Z 2025-05-07T19:56:06.1137462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1139405Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1140284Z ^ 2025-05-07T19:56:06.1143914Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:06.1147043Z 2025-05-07T19:56:06.1148351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1150317Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1151147Z ^ 2025-05-07T19:56:06.1154589Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:06.1157825Z 2025-05-07T19:56:06.1159121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1161168Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1162103Z ^ 2025-05-07T19:56:06.1165537Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:06.1168804Z 2025-05-07T19:56:06.1170129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1172299Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1173235Z ^ 2025-05-07T19:56:06.1176890Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:06.1180059Z 2025-05-07T19:56:06.1181357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1183332Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1184248Z ^ 2025-05-07T19:56:06.1187783Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:06.1191251Z 2025-05-07T19:56:06.1192591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1194644Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1195553Z ^ 2025-05-07T19:56:06.1199012Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:06.1202047Z 2025-05-07T19:56:06.1203281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1205134Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1206033Z ^ 2025-05-07T19:56:06.1209508Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:06.1212663Z 2025-05-07T19:56:06.1213110Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:06.1213703Z 2025-05-07T19:56:06.1214928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1216862Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1217764Z ^ 2025-05-07T19:56:06.1221351Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:06.1224468Z 2025-05-07T19:56:06.1225776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1227805Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1228935Z ^ 2025-05-07T19:56:06.1232278Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:06.1235598Z 2025-05-07T19:56:06.1236858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1238836Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1239725Z ^ 2025-05-07T19:56:06.1243166Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:06.1246363Z 2025-05-07T19:56:06.1247688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1249673Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1250702Z ^ 2025-05-07T19:56:06.1254110Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:06.1257273Z 2025-05-07T19:56:06.1258568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1260593Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1261488Z ^ 2025-05-07T19:56:06.1265089Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:06.1268172Z 2025-05-07T19:56:06.1269505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1271524Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1272447Z ^ 2025-05-07T19:56:06.1275918Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:06.1279226Z 2025-05-07T19:56:06.1280582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1282606Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1283539Z ^ 2025-05-07T19:56:06.1286865Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:06.1290083Z 2025-05-07T19:56:06.1291502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1293493Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1294422Z ^ 2025-05-07T19:56:06.1297859Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:06.1300996Z 2025-05-07T19:56:06.1302339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1304412Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1305347Z ^ 2025-05-07T19:56:06.1309180Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:06.1312455Z 2025-05-07T19:56:06.1313791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1315853Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1316761Z ^ 2025-05-07T19:56:06.1320239Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:06.1323656Z 2025-05-07T19:56:06.1324995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1327058Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1327969Z ^ 2025-05-07T19:56:06.1331529Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:06.1334621Z 2025-05-07T19:56:06.1335954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1338006Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1338929Z ^ 2025-05-07T19:56:06.1342415Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:06.1345652Z 2025-05-07T19:56:06.1346770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1348494Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1349247Z ^ 2025-05-07T19:56:06.1352418Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:06.1355878Z 2025-05-07T19:56:06.1357218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1359147Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1360046Z ^ 2025-05-07T19:56:06.1363374Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:06.1366672Z 2025-05-07T19:56:06.1367995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1369902Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1370961Z ^ 2025-05-07T19:56:06.1374299Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:06.1377362Z 2025-05-07T19:56:06.1378629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1380558Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1381446Z ^ 2025-05-07T19:56:06.1384757Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:06.1387888Z 2025-05-07T19:56:06.1389061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1391048Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1391971Z ^ 2025-05-07T19:56:06.1395355Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:06.1398533Z 2025-05-07T19:56:06.1400069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1402046Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1402923Z ^ 2025-05-07T19:56:06.1406354Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:06.1409682Z 2025-05-07T19:56:06.1411117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1413119Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1413881Z ^ 2025-05-07T19:56:06.1417106Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:06.1420226Z 2025-05-07T19:56:06.1421432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1423381Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1424250Z ^ 2025-05-07T19:56:06.1427493Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:06.1430918Z 2025-05-07T19:56:06.1432198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1434105Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1435043Z ^ 2025-05-07T19:56:06.1438353Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:06.1441368Z 2025-05-07T19:56:06.1442952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1444987Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1445932Z ^ 2025-05-07T19:56:06.1449463Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:06.1452845Z 2025-05-07T19:56:06.1454225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1456512Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1457454Z ^ 2025-05-07T19:56:06.1460759Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:06.1463766Z 2025-05-07T19:56:06.1465028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1466979Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1467823Z ^ 2025-05-07T19:56:06.1470939Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:06.1473786Z 2025-05-07T19:56:06.1474202Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:06.1474885Z 2025-05-07T19:56:06.1476139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1478093Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1479004Z ^ 2025-05-07T19:56:06.1482460Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:06.1485698Z 2025-05-07T19:56:06.1487277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1489334Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1490272Z ^ 2025-05-07T19:56:06.1493718Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:06.1496743Z 2025-05-07T19:56:06.1498042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1500158Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1501047Z ^ 2025-05-07T19:56:06.1504391Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:06.1507394Z 2025-05-07T19:56:06.1508654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1510225Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1510993Z ^ 2025-05-07T19:56:06.1514033Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:06.1516950Z 2025-05-07T19:56:06.1518184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1520108Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1521038Z ^ 2025-05-07T19:56:06.1524456Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:06.1527663Z 2025-05-07T19:56:06.1529321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1532722Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1533264Z ^ 2025-05-07T19:56:06.1535018Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:06.1536659Z 2025-05-07T19:56:06.1537332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1538540Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1539025Z ^ 2025-05-07T19:56:06.1540812Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:06.1542455Z 2025-05-07T19:56:06.1543125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1544179Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1544659Z ^ 2025-05-07T19:56:06.1546418Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:06.1548032Z 2025-05-07T19:56:06.1548700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1549753Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1550240Z ^ 2025-05-07T19:56:06.1551998Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:06.1553594Z 2025-05-07T19:56:06.1554282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1555308Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1555878Z ^ 2025-05-07T19:56:06.1557724Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:06.1559329Z 2025-05-07T19:56:06.1560017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1561041Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1561629Z ^ 2025-05-07T19:56:06.1563419Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:06.1565037Z 2025-05-07T19:56:06.1565736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1566759Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1567263Z ^ 2025-05-07T19:56:06.1569029Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:06.1570804Z 2025-05-07T19:56:06.1571476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1572527Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1573030Z ^ 2025-05-07T19:56:06.1574783Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:06.1576411Z 2025-05-07T19:56:06.1577082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1578136Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1578617Z ^ 2025-05-07T19:56:06.1580452Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:06.1582077Z 2025-05-07T19:56:06.1582745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1583799Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1584277Z ^ 2025-05-07T19:56:06.1586106Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:06.1587743Z 2025-05-07T19:56:06.1588411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1589460Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1589946Z ^ 2025-05-07T19:56:06.1591728Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:06.1593346Z 2025-05-07T19:56:06.1594034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1595056Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1595562Z ^ 2025-05-07T19:56:06.1597360Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:06.1598982Z 2025-05-07T19:56:06.1599673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1600696Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1601203Z ^ 2025-05-07T19:56:06.1603053Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:06.1604678Z 2025-05-07T19:56:06.1605347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1606415Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1606918Z ^ 2025-05-07T19:56:06.1608712Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:06.1610389Z 2025-05-07T19:56:06.1611200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1612263Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1612773Z ^ 2025-05-07T19:56:06.1614544Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:06.1616194Z 2025-05-07T19:56:06.1616855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1617904Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1618381Z ^ 2025-05-07T19:56:06.1620168Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:06.1621815Z 2025-05-07T19:56:06.1622485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1623530Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1624007Z ^ 2025-05-07T19:56:06.1625853Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:06.1627501Z 2025-05-07T19:56:06.1628173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:06.1629507Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:06.1629992Z ^ 2025-05-07T19:56:06.1631793Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:06.1633591Z 2025-05-07T19:56:09.2123181Z [245/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o 2025-05-07T19:56:09.2147514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2149630Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2150147Z ^ 2025-05-07T19:56:09.2150409Z 2025-05-07T19:56:09.2150846Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:09.2151514Z 2025-05-07T19:56:09.2153050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2155014Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2155530Z ^ 2025-05-07T19:56:09.2156184Z 2025-05-07T19:56:09.2157643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2159431Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2159969Z ^ 2025-05-07T19:56:09.2160249Z 2025-05-07T19:56:09.2161898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2163788Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2164344Z ^ 2025-05-07T19:56:09.2164721Z 2025-05-07T19:56:09.2165190Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:09.2165841Z 2025-05-07T19:56:09.2167445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2169433Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2170172Z ^ 2025-05-07T19:56:09.2170471Z 2025-05-07T19:56:09.2172039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2174018Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2174567Z ^ 2025-05-07T19:56:09.2174893Z 2025-05-07T19:56:09.2176424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2178347Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2178903Z ^ 2025-05-07T19:56:09.2179189Z 2025-05-07T19:56:09.2179619Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:09.2180224Z 2025-05-07T19:56:09.2181725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2183688Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2184265Z ^ 2025-05-07T19:56:09.2184570Z 2025-05-07T19:56:09.2186168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2188152Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2188903Z ^ 2025-05-07T19:56:09.2189236Z 2025-05-07T19:56:09.2190792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2192844Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2193422Z ^ 2025-05-07T19:56:09.2193721Z 2025-05-07T19:56:09.2194173Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:09.2194830Z 2025-05-07T19:56:09.2196312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2198177Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2198671Z ^ 2025-05-07T19:56:09.2198947Z 2025-05-07T19:56:09.2200260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2202042Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2202579Z ^ 2025-05-07T19:56:09.2202883Z 2025-05-07T19:56:09.2204515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2206519Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2207090Z ^ 2025-05-07T19:56:09.2207398Z 2025-05-07T19:56:09.2207892Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:09.2208569Z 2025-05-07T19:56:09.2210347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2212403Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2212991Z ^ 2025-05-07T19:56:09.2213300Z 2025-05-07T19:56:09.2214921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:09.2216977Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:09.2217549Z ^ 2025-05-07T19:56:09.2217884Z 2025-05-07T19:56:13.3489208Z [246/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_dense_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o 2025-05-07T19:56:16.3164500Z [247/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_grad_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o 2025-05-07T19:56:31.2205506Z [248/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:31.2222498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2224031Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:31.2224574Z ^ 2025-05-07T19:56:31.2224774Z 2025-05-07T19:56:31.2225125Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:31.2225566Z 2025-05-07T19:56:31.2226599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2227876Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.2228277Z ^ 2025-05-07T19:56:31.2228804Z 2025-05-07T19:56:31.2229819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2231360Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.2231762Z ^ 2025-05-07T19:56:31.2231976Z 2025-05-07T19:56:31.2233294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2234761Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.2235173Z ^ 2025-05-07T19:56:31.2235381Z 2025-05-07T19:56:31.2236530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2237972Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:31.2238557Z ^ 2025-05-07T19:56:31.2238816Z 2025-05-07T19:56:31.2239141Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:31.2239807Z 2025-05-07T19:56:31.2240923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2242312Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.2242722Z ^ 2025-05-07T19:56:31.2242915Z 2025-05-07T19:56:31.2243901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2245208Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.2245613Z ^ 2025-05-07T19:56:31.2245810Z 2025-05-07T19:56:31.2246801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2248275Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.2248641Z ^ 2025-05-07T19:56:31.2248854Z 2025-05-07T19:56:31.2249839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2251370Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:31.2251872Z ^ 2025-05-07T19:56:31.2252091Z 2025-05-07T19:56:31.2252391Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:31.2252845Z 2025-05-07T19:56:31.2254021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2255383Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.2255840Z ^ 2025-05-07T19:56:31.2256068Z 2025-05-07T19:56:31.2257099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2258374Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.2258778Z ^ 2025-05-07T19:56:31.2258979Z 2025-05-07T19:56:31.2259975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2261267Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.2261673Z ^ 2025-05-07T19:56:31.2261926Z 2025-05-07T19:56:31.2263202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2264609Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:31.2265100Z ^ 2025-05-07T19:56:31.2265326Z 2025-05-07T19:56:31.2265628Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:31.2266063Z 2025-05-07T19:56:31.2267108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2268502Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.2268909Z ^ 2025-05-07T19:56:31.2269279Z 2025-05-07T19:56:31.2270277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2271550Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.2271941Z ^ 2025-05-07T19:56:31.2272133Z 2025-05-07T19:56:31.2273121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2274400Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.2274777Z ^ 2025-05-07T19:56:31.2274997Z 2025-05-07T19:56:31.2276008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2277613Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:31.2278191Z ^ 2025-05-07T19:56:31.2278448Z 2025-05-07T19:56:31.2278768Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:31.2279211Z 2025-05-07T19:56:31.2280306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2281741Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.2282204Z ^ 2025-05-07T19:56:31.2282422Z 2025-05-07T19:56:31.2283429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2284923Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.2285372Z ^ 2025-05-07T19:56:31.2285593Z 2025-05-07T19:56:31.2286644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.2288057Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.2288457Z ^ 2025-05-07T19:56:31.2288689Z 2025-05-07T19:56:31.6594180Z [249/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:31.6621566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6623757Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:31.6624560Z ^ 2025-05-07T19:56:31.6624874Z 2025-05-07T19:56:31.6625366Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:31.6626058Z 2025-05-07T19:56:31.6627631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6630047Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.6630639Z ^ 2025-05-07T19:56:31.6630962Z 2025-05-07T19:56:31.6632529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6634556Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.6635131Z ^ 2025-05-07T19:56:31.6635430Z 2025-05-07T19:56:31.6637022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6638992Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.6639595Z ^ 2025-05-07T19:56:31.6639899Z 2025-05-07T19:56:31.6641795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6643488Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:31.6644023Z ^ 2025-05-07T19:56:31.6644238Z 2025-05-07T19:56:31.6644554Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:31.6645033Z 2025-05-07T19:56:31.6646062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6647386Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.6647771Z ^ 2025-05-07T19:56:31.6648218Z 2025-05-07T19:56:31.6649245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6650681Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.6651066Z ^ 2025-05-07T19:56:31.6651265Z 2025-05-07T19:56:31.6652310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6653600Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.6654005Z ^ 2025-05-07T19:56:31.6654197Z 2025-05-07T19:56:31.6655241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6656667Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:31.6657209Z ^ 2025-05-07T19:56:31.6657417Z 2025-05-07T19:56:31.6657727Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:31.6658222Z 2025-05-07T19:56:31.6659246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6660570Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.6660968Z ^ 2025-05-07T19:56:31.6661201Z 2025-05-07T19:56:31.6662229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6663678Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.6664189Z ^ 2025-05-07T19:56:31.6664423Z 2025-05-07T19:56:31.6665641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6667186Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.6667685Z ^ 2025-05-07T19:56:31.6667923Z 2025-05-07T19:56:31.6669172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6670860Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:31.6671528Z ^ 2025-05-07T19:56:31.6671771Z 2025-05-07T19:56:31.6672301Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:31.6672876Z 2025-05-07T19:56:31.6674089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6675643Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.6676105Z ^ 2025-05-07T19:56:31.6676373Z 2025-05-07T19:56:31.6677551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6679113Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.6679568Z ^ 2025-05-07T19:56:31.6679945Z 2025-05-07T19:56:31.6681201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6682744Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.6683225Z ^ 2025-05-07T19:56:31.6683453Z 2025-05-07T19:56:31.6684683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6686354Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:31.6686985Z ^ 2025-05-07T19:56:31.6687227Z 2025-05-07T19:56:31.6687593Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:31.6688146Z 2025-05-07T19:56:31.6689376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6691074Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.6691548Z ^ 2025-05-07T19:56:31.6691784Z 2025-05-07T19:56:31.6692976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6694303Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.6694733Z ^ 2025-05-07T19:56:31.6694925Z 2025-05-07T19:56:31.6696018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:31.6697349Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:31.6697773Z ^ 2025-05-07T19:56:31.6697968Z 2025-05-07T19:56:32.1327907Z [250/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:32.1355348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1357272Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.1357953Z ^ 2025-05-07T19:56:32.1358235Z 2025-05-07T19:56:32.1358630Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.1359244Z 2025-05-07T19:56:32.1360682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1362388Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.1362881Z ^ 2025-05-07T19:56:32.1363131Z 2025-05-07T19:56:32.1364358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1366091Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.1366614Z ^ 2025-05-07T19:56:32.1366858Z 2025-05-07T19:56:32.1368185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1369968Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.1370651Z ^ 2025-05-07T19:56:32.1370911Z 2025-05-07T19:56:32.1372235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1374090Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.1374774Z ^ 2025-05-07T19:56:32.1375040Z 2025-05-07T19:56:32.1375830Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.1376442Z 2025-05-07T19:56:32.1377784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1379500Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.1379996Z ^ 2025-05-07T19:56:32.1380276Z 2025-05-07T19:56:32.1381630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1383368Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.1383865Z ^ 2025-05-07T19:56:32.1384138Z 2025-05-07T19:56:32.1385641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1387376Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.1387870Z ^ 2025-05-07T19:56:32.1388120Z 2025-05-07T19:56:32.1389468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1391284Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.1391988Z ^ 2025-05-07T19:56:32.1392255Z 2025-05-07T19:56:32.1392850Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.1393415Z 2025-05-07T19:56:32.1394753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1396476Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.1396971Z ^ 2025-05-07T19:56:32.1397249Z 2025-05-07T19:56:32.1398566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1400276Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.1400755Z ^ 2025-05-07T19:56:32.1400994Z 2025-05-07T19:56:32.1402342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1404050Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.1404552Z ^ 2025-05-07T19:56:32.1404792Z 2025-05-07T19:56:32.1406122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1407946Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.1408625Z ^ 2025-05-07T19:56:32.1408876Z 2025-05-07T19:56:32.1409263Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.1409871Z 2025-05-07T19:56:32.1411314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1413041Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.1413516Z ^ 2025-05-07T19:56:32.1413975Z 2025-05-07T19:56:32.1415312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1417030Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.1417508Z ^ 2025-05-07T19:56:32.1417754Z 2025-05-07T19:56:32.1419101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1420780Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.1421315Z ^ 2025-05-07T19:56:32.1421567Z 2025-05-07T19:56:32.1422928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1424857Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:32.1425562Z ^ 2025-05-07T19:56:32.1425822Z 2025-05-07T19:56:32.1426225Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:32.1426846Z 2025-05-07T19:56:32.1428138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1430002Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.1430721Z ^ 2025-05-07T19:56:32.1430997Z 2025-05-07T19:56:32.1432325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1434015Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.1434483Z ^ 2025-05-07T19:56:32.1434728Z 2025-05-07T19:56:32.1436078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:32.1437757Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:32.1438271Z ^ 2025-05-07T19:56:32.1438511Z 2025-05-07T19:56:41.1458286Z [251/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:41.1479958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1481978Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:41.1482618Z ^ 2025-05-07T19:56:41.1482901Z 2025-05-07T19:56:41.1483303Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:41.1484254Z 2025-05-07T19:56:41.1485695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1487591Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:41.1488117Z ^ 2025-05-07T19:56:41.1488389Z 2025-05-07T19:56:41.1489787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1491820Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:41.1492262Z ^ 2025-05-07T19:56:41.1492515Z 2025-05-07T19:56:41.1493934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1495710Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:41.1496205Z ^ 2025-05-07T19:56:41.1496466Z 2025-05-07T19:56:41.1497865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1499839Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:41.1500528Z ^ 2025-05-07T19:56:41.1500790Z 2025-05-07T19:56:41.1501214Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:41.1501836Z 2025-05-07T19:56:41.1503218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1505106Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:41.1505583Z ^ 2025-05-07T19:56:41.1505834Z 2025-05-07T19:56:41.1507540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1509203Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:41.1509575Z ^ 2025-05-07T19:56:41.1509832Z 2025-05-07T19:56:41.1511251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1513064Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:41.1513587Z ^ 2025-05-07T19:56:41.1513874Z 2025-05-07T19:56:41.1515347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1517404Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:41.1518074Z ^ 2025-05-07T19:56:41.1518312Z 2025-05-07T19:56:41.1518676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:41.1519246Z 2025-05-07T19:56:41.1520660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1522522Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:41.1523215Z ^ 2025-05-07T19:56:41.1523491Z 2025-05-07T19:56:41.1525003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1526652Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:41.1527131Z ^ 2025-05-07T19:56:41.1527350Z 2025-05-07T19:56:41.1529096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1531084Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:41.1531547Z ^ 2025-05-07T19:56:41.1531815Z 2025-05-07T19:56:41.1533239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1535070Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:41.1535741Z ^ 2025-05-07T19:56:41.1535993Z 2025-05-07T19:56:41.1536433Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:41.1537031Z 2025-05-07T19:56:41.1538487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1540346Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:41.1540826Z ^ 2025-05-07T19:56:41.1541059Z 2025-05-07T19:56:41.1542444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1544220Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:41.1544690Z ^ 2025-05-07T19:56:41.1544952Z 2025-05-07T19:56:41.1546679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1548407Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:41.1548910Z ^ 2025-05-07T19:56:41.1549198Z 2025-05-07T19:56:41.1550559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1552564Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:41.1553280Z ^ 2025-05-07T19:56:41.1553569Z 2025-05-07T19:56:41.1553983Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:41.1554820Z 2025-05-07T19:56:41.1556161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1557929Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:41.1558451Z ^ 2025-05-07T19:56:41.1558696Z 2025-05-07T19:56:41.1560030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1561826Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:41.1562554Z ^ 2025-05-07T19:56:41.1562822Z 2025-05-07T19:56:41.1564296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:41.1566045Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:41.1566544Z ^ 2025-05-07T19:56:41.1566798Z 2025-05-07T19:57:13.4446413Z [252/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:15.6571052Z [253/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:15.8312345Z [254/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o 2025-05-07T19:57:17.8650195Z [255/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o 2025-05-07T19:57:20.2664839Z [256/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:57:21.1076530Z [257/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:57:22.7812012Z [258/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:25.0880339Z [259/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:26.4596301Z [260/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:28.6345838Z [261/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:57:36.7195716Z [262/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o 2025-05-07T19:57:37.3278302Z [263/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o 2025-05-07T19:57:40.5007998Z [264/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:40.5474955Z [265/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:46.0575387Z [266/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o 2025-05-07T19:57:46.0596564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:46.0598114Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:46.0598648Z ^ 2025-05-07T19:57:46.0598898Z 2025-05-07T19:57:46.0599376Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.0600002Z 2025-05-07T19:57:46.0601109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:46.0602749Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:46.0603560Z ^ 2025-05-07T19:57:46.0603843Z 2025-05-07T19:57:46.0604228Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.0604874Z 2025-05-07T19:57:46.0606155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:46.0607723Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:46.0608246Z ^ 2025-05-07T19:57:46.0608491Z 2025-05-07T19:57:46.0608905Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.0609565Z 2025-05-07T19:57:46.0610960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:46.0612557Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:46.0613085Z ^ 2025-05-07T19:57:46.0613322Z 2025-05-07T19:57:46.0613738Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.0614367Z 2025-05-07T19:57:53.8284197Z [267/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:54.7009640Z [268/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:56.2845987Z [269/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:58.6713049Z [270/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:02.2886538Z [271/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o 2025-05-07T19:58:02.2909495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:58:02.2911287Z int error_code = 0; 2025-05-07T19:58:02.2911728Z ^ 2025-05-07T19:58:02.2911938Z 2025-05-07T19:58:02.2912392Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:02.2913030Z 2025-05-07T19:58:02.2914377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:58:02.2916091Z int64_t error_value; 2025-05-07T19:58:02.2916551Z ^ 2025-05-07T19:58:02.2916780Z 2025-05-07T19:58:02.2918126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:58:02.2920025Z int error_code = 0; 2025-05-07T19:58:02.2920478Z ^ 2025-05-07T19:58:02.2920715Z 2025-05-07T19:58:02.2922035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:58:02.2923766Z int64_t error_value; 2025-05-07T19:58:02.2924198Z ^ 2025-05-07T19:58:02.2924418Z 2025-05-07T19:58:02.2925749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:58:02.2927546Z int error_code = 0; 2025-05-07T19:58:02.2928115Z ^ 2025-05-07T19:58:02.2928313Z 2025-05-07T19:58:02.2929900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:58:02.2931675Z int64_t error_value; 2025-05-07T19:58:02.2932142Z ^ 2025-05-07T19:58:02.2932359Z 2025-05-07T19:58:02.2933633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:58:02.2935335Z int error_code = 0; 2025-05-07T19:58:02.2935769Z ^ 2025-05-07T19:58:02.2935966Z 2025-05-07T19:58:02.2937284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:58:02.2939116Z int64_t error_value; 2025-05-07T19:58:02.2939535Z ^ 2025-05-07T19:58:02.2939785Z 2025-05-07T19:58:02.2941078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:58:02.2942809Z int error_code = 0; 2025-05-07T19:58:02.2943217Z ^ 2025-05-07T19:58:02.2943428Z 2025-05-07T19:58:02.2943861Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:02.2944499Z 2025-05-07T19:58:02.2945757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:58:02.2947465Z int64_t error_value; 2025-05-07T19:58:02.2947901Z ^ 2025-05-07T19:58:02.2948125Z 2025-05-07T19:58:02.2949436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:58:02.2951148Z int error_code = 0; 2025-05-07T19:58:02.2951572Z ^ 2025-05-07T19:58:02.2952139Z 2025-05-07T19:58:02.2953472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:58:02.2955196Z int64_t error_value; 2025-05-07T19:58:02.2955620Z ^ 2025-05-07T19:58:02.2955847Z 2025-05-07T19:58:02.2957136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:58:02.2958861Z int error_code = 0; 2025-05-07T19:58:02.2959321Z ^ 2025-05-07T19:58:02.2959541Z 2025-05-07T19:58:02.2960820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:58:02.2962761Z int64_t error_value; 2025-05-07T19:58:02.2963229Z ^ 2025-05-07T19:58:02.2963463Z 2025-05-07T19:58:02.2964745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:58:02.2966498Z int error_code = 0; 2025-05-07T19:58:02.2966929Z ^ 2025-05-07T19:58:02.2967175Z 2025-05-07T19:58:02.2968484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:58:02.2970358Z int64_t error_value; 2025-05-07T19:58:02.2972107Z ^ 2025-05-07T19:58:02.2972340Z 2025-05-07T19:58:02.2973621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:58:02.2975321Z int error_code = 0; 2025-05-07T19:58:02.2975753Z ^ 2025-05-07T19:58:02.2975955Z 2025-05-07T19:58:02.2976415Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:02.2977064Z 2025-05-07T19:58:02.2978389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:58:02.2980089Z int64_t error_value; 2025-05-07T19:58:02.2980519Z ^ 2025-05-07T19:58:02.2980781Z 2025-05-07T19:58:02.2982076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:58:02.2983783Z int error_code = 0; 2025-05-07T19:58:02.2984201Z ^ 2025-05-07T19:58:02.2984443Z 2025-05-07T19:58:02.2985836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:58:02.2987487Z int64_t error_value; 2025-05-07T19:58:02.2987923Z ^ 2025-05-07T19:58:02.2988134Z 2025-05-07T19:58:02.2989459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:58:02.2991161Z int error_code = 0; 2025-05-07T19:58:02.2991601Z ^ 2025-05-07T19:58:02.2991808Z 2025-05-07T19:58:02.2993190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:58:02.2994918Z int64_t error_value; 2025-05-07T19:58:02.2995568Z ^ 2025-05-07T19:58:02.2995798Z 2025-05-07T19:58:02.2997131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:58:02.2998840Z int error_code = 0; 2025-05-07T19:58:02.2999255Z ^ 2025-05-07T19:58:02.2999490Z 2025-05-07T19:58:02.3000789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:58:02.3002548Z int64_t error_value; 2025-05-07T19:58:02.3002980Z ^ 2025-05-07T19:58:02.3003182Z 2025-05-07T19:58:02.3004474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:58:02.3006221Z int error_code = 0; 2025-05-07T19:58:02.3006642Z ^ 2025-05-07T19:58:02.3006844Z 2025-05-07T19:58:02.3007262Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:02.3007925Z 2025-05-07T19:58:02.3009212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:58:02.3011080Z int64_t error_value; 2025-05-07T19:58:02.3011494Z ^ 2025-05-07T19:58:02.3011731Z 2025-05-07T19:58:02.3013093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:58:02.3014917Z int error_code = 0; 2025-05-07T19:58:02.3015343Z ^ 2025-05-07T19:58:02.3015533Z 2025-05-07T19:58:02.3016860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:58:02.3018484Z int64_t error_value; 2025-05-07T19:58:02.3018914Z ^ 2025-05-07T19:58:02.3019125Z 2025-05-07T19:58:02.3020421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:58:02.3022117Z int error_code = 0; 2025-05-07T19:58:02.3022542Z ^ 2025-05-07T19:58:02.3022727Z 2025-05-07T19:58:02.3024025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:58:02.3025812Z int64_t error_value; 2025-05-07T19:58:02.3026242Z ^ 2025-05-07T19:58:02.3026488Z 2025-05-07T19:58:02.3027805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:58:02.3029733Z int error_code = 0; 2025-05-07T19:58:02.3030131Z ^ 2025-05-07T19:58:02.3030345Z 2025-05-07T19:58:02.3031642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:58:02.3033264Z int64_t error_value; 2025-05-07T19:58:02.3033723Z ^ 2025-05-07T19:58:02.3033945Z 2025-05-07T19:58:04.7272227Z [272/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o 2025-05-07T19:58:05.3288452Z [273/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:58:09.1510378Z [274/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o 2025-05-07T19:58:10.0477983Z [275/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o 2025-05-07T19:58:13.1632329Z [276/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:17.0778087Z [277/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o 2025-05-07T19:58:19.9574272Z [278/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:21.2745879Z [279/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:22.7534598Z [280/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:23.1926916Z [281/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:27.8274798Z [282/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o 2025-05-07T19:58:27.8297246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:27.8299071Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:27.8299633Z ^ 2025-05-07T19:58:27.8299888Z 2025-05-07T19:58:27.8300350Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.8301023Z 2025-05-07T19:58:27.8302408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:27.8304326Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:27.8304897Z ^ 2025-05-07T19:58:27.8305167Z 2025-05-07T19:58:27.8305554Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.8306152Z 2025-05-07T19:58:27.8307427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:27.8309083Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:27.8309642Z ^ 2025-05-07T19:58:27.8309893Z 2025-05-07T19:58:27.8310315Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.8310944Z 2025-05-07T19:58:27.8312302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:27.8314233Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:27.8314793Z ^ 2025-05-07T19:58:27.8315076Z 2025-05-07T19:58:27.8315523Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.8316179Z 2025-05-07T19:58:31.2810736Z [283/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:41.4454971Z [284/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o 2025-05-07T19:58:43.0918956Z [285/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:48.3325817Z [286/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:48.7543130Z [287/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:49.3367222Z [288/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:50.3025766Z [289/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:53.6581281Z [290/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:54.5601254Z [291/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o 2025-05-07T19:58:54.5623949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:54.5626238Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:54.5626771Z ^ 2025-05-07T19:58:54.5627109Z 2025-05-07T19:58:54.5627546Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.5628163Z 2025-05-07T19:58:54.5629957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:54.5631942Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:54.5632508Z ^ 2025-05-07T19:58:54.5632802Z 2025-05-07T19:58:54.5634419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:54.5636363Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:54.5636930Z ^ 2025-05-07T19:58:54.5637224Z 2025-05-07T19:58:54.5638687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:54.5640567Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:54.5641142Z ^ 2025-05-07T19:58:54.5641439Z 2025-05-07T19:58:54.5641879Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.5642474Z 2025-05-07T19:58:54.5643923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:54.5645783Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:54.5646349Z ^ 2025-05-07T19:58:54.5646631Z 2025-05-07T19:58:54.5648430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:54.5650507Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:54.5651095Z ^ 2025-05-07T19:58:54.5651399Z 2025-05-07T19:58:54.5652970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:54.5654918Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:54.5655473Z ^ 2025-05-07T19:58:54.5655743Z 2025-05-07T19:58:54.5656185Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.5656857Z 2025-05-07T19:58:54.5658360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:54.5660433Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:54.5660981Z ^ 2025-05-07T19:58:54.5661279Z 2025-05-07T19:58:54.5662708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:54.5664521Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:54.5665058Z ^ 2025-05-07T19:58:54.5665333Z 2025-05-07T19:58:54.5666826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:54.5668909Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:54.5669466Z ^ 2025-05-07T19:58:54.5669752Z 2025-05-07T19:58:54.5670192Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.5670816Z 2025-05-07T19:58:54.5672397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:54.5674340Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:54.5674898Z ^ 2025-05-07T19:58:54.5675191Z 2025-05-07T19:58:54.5676722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:54.5678648Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:54.5679211Z ^ 2025-05-07T19:58:54.5679484Z 2025-05-07T19:58:54.5680925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:54.5682864Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:54.5683411Z ^ 2025-05-07T19:58:54.5683677Z 2025-05-07T19:58:54.5684073Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.5684691Z 2025-05-07T19:58:54.5686149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:54.5688054Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:54.5688628Z ^ 2025-05-07T19:58:54.5688897Z 2025-05-07T19:58:54.5690760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:58:54.5692712Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:58:54.5693283Z ^ 2025-05-07T19:58:54.5693582Z 2025-05-07T19:58:58.5984314Z [292/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o 2025-05-07T19:58:59.0540261Z [293/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:59.2480787Z [294/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:02.2982217Z [295/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:03.3910231Z [296/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:05.3221564Z [297/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:06.1130663Z [298/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:07.3638276Z [299/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:09.6800778Z [300/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:59:13.7555757Z [301/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o 2025-05-07T19:59:15.3863326Z [302/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:59:16.6074802Z [303/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o 2025-05-07T19:59:16.6353433Z [304/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:21.2075891Z [305/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:26.8315613Z [306/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:26.9903977Z [307/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:27.3187518Z [308/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:28.9473343Z [309/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:59:34.1427210Z [310/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:34.4280805Z [311/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:34.6275844Z [312/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:36.8417286Z [313/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:59:38.0092037Z [314/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:43.4722177Z [315/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:43.6354519Z [316/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:43.7967650Z [317/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o 2025-05-07T19:59:46.0160096Z [318/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:48.3937278Z [319/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:51.3969420Z [320/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o 2025-05-07T19:59:51.3993973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.3996100Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:51.3996852Z ^ 2025-05-07T19:59:51.3997136Z 2025-05-07T19:59:51.3997561Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:51.3998227Z 2025-05-07T19:59:51.3999755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4001689Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.4002266Z ^ 2025-05-07T19:59:51.4002538Z 2025-05-07T19:59:51.4004352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4006277Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.4006839Z ^ 2025-05-07T19:59:51.4007109Z 2025-05-07T19:59:51.4008659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4010743Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.4011341Z ^ 2025-05-07T19:59:51.4011625Z 2025-05-07T19:59:51.4013204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4015431Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:51.4016192Z ^ 2025-05-07T19:59:51.4016513Z 2025-05-07T19:59:51.4016969Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:51.4017650Z 2025-05-07T19:59:51.4019406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4021401Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.4022041Z ^ 2025-05-07T19:59:51.4022329Z 2025-05-07T19:59:51.4023937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4025937Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.4026510Z ^ 2025-05-07T19:59:51.4026788Z 2025-05-07T19:59:51.4028357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4030589Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.4031164Z ^ 2025-05-07T19:59:51.4031435Z 2025-05-07T19:59:51.4033030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4035217Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:51.4035983Z ^ 2025-05-07T19:59:51.4036301Z 2025-05-07T19:59:51.4036753Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:51.4037431Z 2025-05-07T19:59:51.4039020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4040989Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.4041574Z ^ 2025-05-07T19:59:51.4041864Z 2025-05-07T19:59:51.4043469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4045470Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.4046042Z ^ 2025-05-07T19:59:51.4046310Z 2025-05-07T19:59:51.4048095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4050100Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.4050764Z ^ 2025-05-07T19:59:51.4051001Z 2025-05-07T19:59:51.4052502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4054605Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:51.4055367Z ^ 2025-05-07T19:59:51.4055670Z 2025-05-07T19:59:51.4056131Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:51.4056899Z 2025-05-07T19:59:51.4058406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4060415Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.4061023Z ^ 2025-05-07T19:59:51.4061318Z 2025-05-07T19:59:51.4062855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4064826Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.4065370Z ^ 2025-05-07T19:59:51.4065752Z 2025-05-07T19:59:51.4067310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4069281Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.4069837Z ^ 2025-05-07T19:59:51.4070132Z 2025-05-07T19:59:51.4071696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4073941Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:59:51.4074666Z ^ 2025-05-07T19:59:51.4074982Z 2025-05-07T19:59:51.4075446Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:51.4076136Z 2025-05-07T19:59:51.4077638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4079595Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.4080163Z ^ 2025-05-07T19:59:51.4080452Z 2025-05-07T19:59:51.4081967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4083904Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.4084447Z ^ 2025-05-07T19:59:51.4084711Z 2025-05-07T19:59:51.4086248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:51.4088217Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:51.4088763Z ^ 2025-05-07T19:59:51.4089037Z 2025-05-07T19:59:55.0123275Z [321/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o 2025-05-07T20:00:04.6088965Z [322/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o 2025-05-07T20:00:04.6111940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6114104Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:04.6114893Z ^ 2025-05-07T20:00:04.6115181Z 2025-05-07T20:00:04.6115616Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:04.6116268Z 2025-05-07T20:00:04.6118130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6120067Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:04.6120616Z ^ 2025-05-07T20:00:04.6120893Z 2025-05-07T20:00:04.6122397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6124315Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:04.6124880Z ^ 2025-05-07T20:00:04.6125156Z 2025-05-07T20:00:04.6126706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6128987Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:04.6129522Z ^ 2025-05-07T20:00:04.6129812Z 2025-05-07T20:00:04.6131566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6133664Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:04.6134429Z ^ 2025-05-07T20:00:04.6134716Z 2025-05-07T20:00:04.6135158Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:04.6135810Z 2025-05-07T20:00:04.6137338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6139306Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:04.6139836Z ^ 2025-05-07T20:00:04.6140149Z 2025-05-07T20:00:04.6142037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6143993Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:04.6144567Z ^ 2025-05-07T20:00:04.6144829Z 2025-05-07T20:00:04.6146423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6148330Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:04.6148901Z ^ 2025-05-07T20:00:04.6149177Z 2025-05-07T20:00:04.6150549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6152819Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:04.6153568Z ^ 2025-05-07T20:00:04.6153846Z 2025-05-07T20:00:04.6154279Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:04.6154981Z 2025-05-07T20:00:04.6156447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6158396Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:04.6158938Z ^ 2025-05-07T20:00:04.6159221Z 2025-05-07T20:00:04.6160937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6162881Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:04.6163457Z ^ 2025-05-07T20:00:04.6163726Z 2025-05-07T20:00:04.6165322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6167328Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:04.6167907Z ^ 2025-05-07T20:00:04.6168186Z 2025-05-07T20:00:04.6169748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6172097Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:04.6172881Z ^ 2025-05-07T20:00:04.6173168Z 2025-05-07T20:00:04.6173644Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:04.6174359Z 2025-05-07T20:00:04.6175973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6178007Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:04.6178570Z ^ 2025-05-07T20:00:04.6178849Z 2025-05-07T20:00:04.6180472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6182476Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:04.6183059Z ^ 2025-05-07T20:00:04.6183345Z 2025-05-07T20:00:04.6185168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6187197Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:04.6187702Z ^ 2025-05-07T20:00:04.6187936Z 2025-05-07T20:00:04.6189470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6191455Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:04.6192222Z ^ 2025-05-07T20:00:04.6192506Z 2025-05-07T20:00:04.6192953Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:04.6193733Z 2025-05-07T20:00:04.6195246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6197207Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:04.6197756Z ^ 2025-05-07T20:00:04.6198033Z 2025-05-07T20:00:04.6199408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6201251Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:04.6201793Z ^ 2025-05-07T20:00:04.6202063Z 2025-05-07T20:00:04.6203665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:04.6205551Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:04.6206077Z ^ 2025-05-07T20:00:04.6206333Z 2025-05-07T20:00:05.7706047Z [323/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_inference.so -o fbgemm_gpu_tbe_inference.so CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed asmjit.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T20:00:09.9903004Z [324/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o 2025-05-07T20:00:15.0136051Z [325/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o 2025-05-07T20:00:49.3072262Z [326/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o 2025-05-07T20:00:57.9462967Z [327/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:00:59.0710892Z [328/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o 2025-05-07T20:01:01.5095034Z [329/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:02.1389811Z [330/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o 2025-05-07T20:01:07.5911986Z [331/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:07.9008249Z [332/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o 2025-05-07T20:01:08.4584666Z [333/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o 2025-05-07T20:01:09.0170795Z [334/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o 2025-05-07T20:01:17.3757574Z [335/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o 2025-05-07T20:01:17.8956280Z [336/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o 2025-05-07T20:01:18.3287799Z [337/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T20:01:18.8425209Z [338/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T20:01:19.4835512Z [339/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:26.4829169Z [340/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o 2025-05-07T20:01:27.1203442Z [341/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:01:27.6489853Z [342/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T20:01:28.0112242Z [343/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T20:01:28.3614542Z [344/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:28.5241408Z [345/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T20:01:29.2323507Z [346/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:35.5205777Z [347/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T20:01:36.0938688Z [348/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T20:01:36.9229142Z [349/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o 2025-05-07T20:01:37.4889945Z [350/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:37.6369280Z [351/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T20:01:38.0009802Z [352/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:41.6893424Z [353/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:43.3877658Z [354/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T20:01:44.6862132Z [355/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:01:48.2073190Z [356/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o 2025-05-07T20:01:50.2833707Z [357/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o 2025-05-07T20:01:53.4240963Z [358/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o 2025-05-07T20:01:55.2863704Z [359/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o 2025-05-07T20:02:04.0442229Z [360/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o 2025-05-07T20:02:07.2331692Z [361/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o 2025-05-07T20:02:12.1043524Z [362/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:02:14.3157546Z [363/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o 2025-05-07T20:02:15.3829704Z [364/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T20:02:18.3266780Z [365/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o 2025-05-07T20:02:19.4568170Z [366/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T20:02:20.8787702Z [367/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:02:22.5379082Z [368/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:31.0962208Z [369/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o 2025-05-07T20:02:37.9360028Z [370/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o 2025-05-07T20:02:40.6643840Z [371/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:41.3469135Z [372/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:02:52.8415953Z [373/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o 2025-05-07T20:02:56.5489818Z [374/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o 2025-05-07T20:03:02.7144761Z [375/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o 2025-05-07T20:03:10.8260969Z [376/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o 2025-05-07T20:03:11.0488018Z [377/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o 2025-05-07T20:03:14.8561018Z [378/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:03:16.0010002Z [379/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o 2025-05-07T20:03:27.7302022Z [380/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:03:30.8814199Z [381/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:34.6522055Z [382/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:03:36.5319050Z [383/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:37.3616221Z [384/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T20:03:40.5751972Z [385/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T20:03:40.7522327Z [386/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o 2025-05-07T20:03:40.9641991Z [387/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:44.5657246Z [388/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T20:03:51.3919793Z [389/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:57.1656891Z [390/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:57.7413415Z [391/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:00.3632286Z [392/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_dense.cpp 2025-05-07T20:04:01.3381458Z [393/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:02.2731681Z [394/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:03.4359978Z [395/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:04:04.1562610Z [396/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:04.4696949Z [397/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:04:04.6201719Z [398/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:06.3739992Z [399/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:08.3620175Z [400/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:10.1496613Z [401/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:12.5914799Z [402/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:18.5720902Z [403/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:20.6214829Z [404/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:24.3673305Z [405/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:24.9873448Z [406/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adagrad.cpp 2025-05-07T20:04:25.5260497Z [407/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:28.5808639Z [408/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T20:04:28.9649168Z [409/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_sgd.cpp 2025-05-07T20:04:29.3720273Z [410/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_lamb.cpp 2025-05-07T20:04:29.4500663Z [411/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:30.1117333Z [412/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:30.9286972Z [413/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T20:04:31.0650707Z [414/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:31.5777896Z [415/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:32.7672506Z [416/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T20:04:33.0000713Z [417/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T20:04:33.3013863Z [418/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_none.cpp 2025-05-07T20:04:34.5658422Z [419/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:34.6465216Z [420/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_adam.cpp 2025-05-07T20:04:34.7555055Z [421/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:34.8209906Z [422/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T20:04:35.5377716Z [423/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:35.6681869Z [424/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:35.7708114Z [425/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:36.4044149Z [426/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T20:04:36.5589854Z [427/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:38.8490979Z [428/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:38.9263586Z [429/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T20:04:39.4146494Z [430/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:04:39.4933015Z [431/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T20:04:40.3372422Z [432/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:04:40.4113287Z [433/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T20:04:40.5878807Z [434/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T20:04:40.8896688Z [435/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T20:04:41.1596267Z [436/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:42.4847262Z [437/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T20:04:42.7522766Z [438/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:42.7863865Z [439/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T20:04:42.9030655Z [440/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T20:04:42.9749359Z [441/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T20:04:43.3163372Z [442/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:43.5260779Z [443/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:44.1584646Z [444/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T20:04:44.2040406Z [445/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:44.3842365Z [446/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T20:04:44.7856699Z [447/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:45.1365088Z [448/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T20:04:45.1970794Z [449/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T20:04:45.4390100Z [450/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T20:04:45.7488810Z [451/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:46.0742790Z [452/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T20:04:46.4203988Z [453/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T20:04:48.5020426Z [454/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:48.7174592Z [455/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T20:04:48.9508662Z [456/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T20:04:50.0419449Z [457/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:51.2734336Z [458/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cpp 2025-05-07T20:04:53.6368401Z [459/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cpp 2025-05-07T20:04:54.4196495Z [460/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:56.8804738Z [461/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T20:04:57.0747188Z [462/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T20:04:57.4290317Z [463/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T20:04:57.5087752Z [464/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_split_host.so -o fbgemm_gpu_tbe_training_backward_split_host.so CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T20:04:57.8103415Z [465/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_models.cpp 2025-05-07T20:04:58.4945880Z [466/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T20:04:58.9472820Z [467/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T20:04:59.3318341Z [468/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T20:05:00.1511931Z [469/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:05:00.5938732Z [470/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T20:05:00.8814893Z [471/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T20:05:01.4401463Z [472/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o 2025-05-07T20:05:01.4423160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:01.4424923Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:01.4425426Z ^ 2025-05-07T20:05:01.4425734Z 2025-05-07T20:05:01.4426156Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.4426763Z 2025-05-07T20:05:01.4428012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:01.4430003Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:01.4430560Z ^ 2025-05-07T20:05:01.4430803Z 2025-05-07T20:05:01.4431248Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.4431878Z 2025-05-07T20:05:01.4433155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:01.4434759Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:01.4435259Z ^ 2025-05-07T20:05:01.4435522Z 2025-05-07T20:05:01.4435941Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.4436557Z 2025-05-07T20:05:01.4437841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:01.4439481Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:01.4440015Z ^ 2025-05-07T20:05:01.4440254Z 2025-05-07T20:05:01.4441028Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.4441678Z 2025-05-07T20:05:01.9137998Z [473/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T20:05:02.2637858Z [474/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T20:05:02.5734464Z [475/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T20:05:03.1575495Z [476/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T20:05:03.6372217Z [477/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T20:05:03.7524115Z [478/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T20:05:03.7828202Z [479/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T20:05:03.8884008Z [480/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T20:05:06.7313012Z [481/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/topology_utils.cpp 2025-05-07T20:05:06.9955851Z [482/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T20:05:11.4232277Z [483/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops_host.cpp 2025-05-07T20:05:11.9020027Z [484/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T20:05:12.4303778Z [485/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T20:05:13.1652181Z [486/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T20:05:14.8730685Z [487/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o 2025-05-07T20:05:14.8894116Z [488/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T20:05:17.0024963Z [489/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o 2025-05-07T20:05:17.0047847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:17.0049612Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:17.0050180Z ^ 2025-05-07T20:05:17.0050601Z 2025-05-07T20:05:17.0051082Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:17.0051752Z 2025-05-07T20:05:17.0053181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:17.0054821Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:17.0055372Z ^ 2025-05-07T20:05:17.0055598Z 2025-05-07T20:05:17.0056019Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:17.0056715Z 2025-05-07T20:05:17.0058163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:17.0059797Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:17.0060271Z ^ 2025-05-07T20:05:17.0060528Z 2025-05-07T20:05:17.0060970Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:17.0061664Z 2025-05-07T20:05:17.0063397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:17.0065230Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:17.0065800Z ^ 2025-05-07T20:05:17.0066063Z 2025-05-07T20:05:17.0066514Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:17.0067181Z 2025-05-07T20:05:17.2393152Z [490/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T20:05:17.4349959Z [491/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_utils.cpp 2025-05-07T20:05:17.6185221Z [492/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T20:05:18.1705879Z [493/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator.cpp 2025-05-07T20:05:18.2779070Z [494/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T20:05:18.4381838Z [495/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:05:18.4661080Z [496/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o 2025-05-07T20:05:18.5742758Z [497/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_forward.so -o fbgemm_gpu_tbe_training_forward.so CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_common.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:05:19.0290949Z [498/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T20:05:19.1306152Z [499/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T20:05:19.3510227Z [500/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator.cpp 2025-05-07T20:05:20.3593962Z [501/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T20:05:21.7241249Z [502/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T20:05:24.3214411Z [503/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:05:26.8707613Z [504/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:05:31.7303715Z [505/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o 2025-05-07T20:05:33.0855059Z [506/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T20:05:35.0445391Z [507/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:05:36.9718951Z [508/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o 2025-05-07T20:05:44.4563642Z [509/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o 2025-05-07T20:05:47.2743421Z [510/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_split_grad_index_select.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o 2025-05-07T20:05:49.3491829Z [511/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:05:50.9296158Z [512/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_forward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o 2025-05-07T20:05:54.8574545Z [513/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_forward_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o 2025-05-07T20:05:55.8409474Z [514/608] /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T20:05:57.8296257Z [515/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o 2025-05-07T20:05:58.0284349Z [516/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o 2025-05-07T20:06:02.9140316Z [517/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o 2025-05-07T20:06:08.8435035Z [518/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_forward_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o 2025-05-07T20:06:10.0921741Z [519/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o 2025-05-07T20:06:18.5020852Z [520/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_backward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o 2025-05-07T20:06:20.4625894Z [521/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/histogram_binning_calibration_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o 2025-05-07T20:06:23.2452336Z [522/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T20:06:23.6124401Z [523/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_backward_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o 2025-05-07T20:06:25.8267744Z [524/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o 2025-05-07T20:06:28.3271135Z [525/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o 2025-05-07T20:06:32.1702116Z [526/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:06:33.3132090Z [527/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o 2025-05-07T20:06:34.5515061Z [528/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o 2025-05-07T20:06:35.6114876Z [529/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o 2025-05-07T20:06:36.1382406Z [530/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o 2025-05-07T20:06:37.3642711Z [531/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o 2025-05-07T20:06:38.5535042Z [532/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o 2025-05-07T20:06:38.6294831Z [533/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o 2025-05-07T20:06:39.5959954Z [534/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o 2025-05-07T20:06:39.6037818Z [535/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:06:39.6040105Z ################################################################################ 2025-05-07T20:06:39.6040736Z [CMAKE] Running post-build script ... 2025-05-07T20:06:39.6041990Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:06:39.6042934Z Removing all RPATHs ... 2025-05-07T20:06:39.6043435Z ################################################################################ 2025-05-07T20:06:39.6241340Z [536/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 1 2025-05-07T20:06:39.6243499Z ################################################################################ 2025-05-07T20:06:39.6244141Z [CMAKE] Running post-build script ... 2025-05-07T20:06:39.6245112Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:06:39.6246084Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:39.6246985Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:39.6247728Z ################################################################################ 2025-05-07T20:06:39.6944047Z [537/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:06:39.6946377Z ################################################################################ 2025-05-07T20:06:39.6946997Z [CMAKE] Running post-build script ... 2025-05-07T20:06:39.6948052Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:06:39.6949110Z Removing all RPATHs ... 2025-05-07T20:06:39.6949866Z ################################################################################ 2025-05-07T20:06:40.5569931Z [538/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o 2025-05-07T20:06:40.5629851Z [539/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:06:40.5632007Z ################################################################################ 2025-05-07T20:06:40.5632666Z [CMAKE] Running post-build script ... 2025-05-07T20:06:40.5633631Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:06:40.5634604Z Removing all RPATHs ... 2025-05-07T20:06:40.5635306Z ################################################################################ 2025-05-07T20:06:40.8873654Z [540/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:06:40.8876024Z ################################################################################ 2025-05-07T20:06:40.8876622Z [CMAKE] Running post-build script ... 2025-05-07T20:06:40.8877611Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:06:40.8878623Z Removing all RPATHs ... 2025-05-07T20:06:40.8879089Z ################################################################################ 2025-05-07T20:06:40.9210253Z [541/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 1 2025-05-07T20:06:40.9212674Z ################################################################################ 2025-05-07T20:06:40.9213277Z [CMAKE] Running post-build script ... 2025-05-07T20:06:40.9214285Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:06:40.9215319Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:40.9215981Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:40.9216690Z ################################################################################ 2025-05-07T20:06:40.9789902Z [542/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 1 2025-05-07T20:06:40.9792208Z ################################################################################ 2025-05-07T20:06:40.9792805Z [CMAKE] Running post-build script ... 2025-05-07T20:06:40.9793796Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:06:40.9794743Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:40.9795405Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:40.9796126Z ################################################################################ 2025-05-07T20:06:41.2235945Z [543/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:06:41.2238309Z ################################################################################ 2025-05-07T20:06:41.2238921Z [CMAKE] Running post-build script ... 2025-05-07T20:06:41.2240305Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:06:41.2241326Z Removing all RPATHs ... 2025-05-07T20:06:41.2241824Z ################################################################################ 2025-05-07T20:06:41.4699419Z [544/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o 2025-05-07T20:06:41.5083936Z [545/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o 2025-05-07T20:06:41.6375561Z [546/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 1 2025-05-07T20:06:41.6378665Z ################################################################################ 2025-05-07T20:06:41.6379514Z [CMAKE] Running post-build script ... 2025-05-07T20:06:41.6381000Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:06:41.6382598Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:41.6383714Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:41.6384706Z ################################################################################ 2025-05-07T20:06:41.6699465Z [547/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o 2025-05-07T20:06:41.6969345Z [548/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 1 2025-05-07T20:06:41.6971296Z ################################################################################ 2025-05-07T20:06:41.6971901Z [CMAKE] Running post-build script ... 2025-05-07T20:06:41.6972769Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:06:41.6973907Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:41.6974439Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:41.6975019Z ################################################################################ 2025-05-07T20:06:43.5426998Z [549/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o 2025-05-07T20:06:44.9719595Z [550/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 1 2025-05-07T20:06:44.9722242Z ################################################################################ 2025-05-07T20:06:44.9722860Z [CMAKE] Running post-build script ... 2025-05-07T20:06:44.9723858Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:06:44.9724887Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:44.9725516Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:44.9726236Z ################################################################################ 2025-05-07T20:06:45.6597305Z [551/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o 2025-05-07T20:06:45.7396855Z [552/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o 2025-05-07T20:06:46.3800269Z [553/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o 2025-05-07T20:06:47.0436262Z [554/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 1 2025-05-07T20:06:47.0438262Z ################################################################################ 2025-05-07T20:06:47.0438773Z [CMAKE] Running post-build script ... 2025-05-07T20:06:47.0439660Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:06:47.0440557Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:47.0441070Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:47.0441663Z ################################################################################ 2025-05-07T20:06:47.3648970Z [555/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o 2025-05-07T20:06:47.6947797Z [556/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_batch_index_select_dim0_backward_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o 2025-05-07T20:06:47.9974656Z [557/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.cu -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o 2025-05-07T20:06:48.3435619Z [558/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_index_select.so -o fbgemm_gpu_tbe_index_select.so CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T20:06:48.5861042Z [559/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_embedding_inplace_ops.so -o fbgemm_gpu_embedding_inplace_ops.so CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -lrt -lpthread -ldl && : 2025-05-07T20:06:48.6173305Z [560/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:06:48.6175740Z ################################################################################ 2025-05-07T20:06:48.6176350Z [CMAKE] Running post-build script ... 2025-05-07T20:06:48.6177437Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:06:48.6178514Z Removing all RPATHs ... 2025-05-07T20:06:48.6179010Z ################################################################################ 2025-05-07T20:06:48.8257586Z [561/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 1 2025-05-07T20:06:48.8259957Z ################################################################################ 2025-05-07T20:06:48.8260545Z [CMAKE] Running post-build script ... 2025-05-07T20:06:48.8261458Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:06:48.8262462Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:48.8263049Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:48.8263743Z ################################################################################ 2025-05-07T20:06:49.9134241Z [562/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o 2025-05-07T20:06:51.6238316Z [563/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o 2025-05-07T20:06:51.8548422Z [564/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o 2025-05-07T20:06:55.3620776Z [565/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_bfloat16.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o 2025-05-07T20:06:55.4581177Z [566/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o 2025-05-07T20:07:01.7378275Z [567/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_mx.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o 2025-05-07T20:07:02.2255103Z [568/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o 2025-05-07T20:07:07.0747737Z [569/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o 2025-05-07T20:07:08.2836630Z [570/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o 2025-05-07T20:07:09.9169323Z [571/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o 2025-05-07T20:07:10.7935669Z [572/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_hfp8.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o 2025-05-07T20:07:15.8449675Z [573/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o 2025-05-07T20:07:19.4656623Z [574/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o 2025-05-07T20:07:22.6363767Z [575/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o 2025-05-07T20:07:22.7627613Z [576/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o 2025-05-07T20:07:22.7639033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:22.7641462Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:22.7641805Z ^ 2025-05-07T20:07:22.7641970Z 2025-05-07T20:07:22.7642210Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:22.7642569Z 2025-05-07T20:07:22.7643101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:22.7643856Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:22.7644177Z ^ 2025-05-07T20:07:22.7644337Z 2025-05-07T20:07:22.7644581Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:22.7644931Z 2025-05-07T20:07:22.7645563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:22.7646399Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:22.7646719Z ^ 2025-05-07T20:07:22.7646881Z 2025-05-07T20:07:22.7647138Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:22.7647488Z 2025-05-07T20:07:22.7648018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:22.7648801Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:22.7649122Z ^ 2025-05-07T20:07:22.7649311Z 2025-05-07T20:07:22.7649550Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:22.7649901Z 2025-05-07T20:07:25.5553869Z [577/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o 2025-05-07T20:07:28.4262627Z [578/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o 2025-05-07T20:07:38.0840260Z [579/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_group_index.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o 2025-05-07T20:07:38.7230107Z [580/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_add.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o 2025-05-07T20:07:39.8107179Z [581/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o 2025-05-07T20:07:40.8914495Z [582/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_invert_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o 2025-05-07T20:07:41.6110570Z [583/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_select.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o 2025-05-07T20:07:45.5854863Z [584/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o 2025-05-07T20:07:50.3765532Z [585/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o 2025-05-07T20:07:50.8342626Z [586/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute102.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o 2025-05-07T20:07:51.4074337Z [587/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o 2025-05-07T20:07:52.4376224Z [588/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o 2025-05-07T20:07:53.3655134Z [589/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_range.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o 2025-05-07T20:07:53.4281196Z [590/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_1d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o 2025-05-07T20:07:54.2226087Z [591/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_zipf.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o 2025-05-07T20:07:55.9903744Z [592/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_2d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o 2025-05-07T20:07:56.7092891Z [593/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o 2025-05-07T20:07:57.8303053Z [594/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o 2025-05-07T20:07:58.6597297Z [595/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_py.so -o fbgemm_gpu_py.so CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_embedding_inplace_ops.so fbgemm_gpu_tbe_index_select.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_optimizers.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:07:58.8729446Z [596/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 1 2025-05-07T20:07:58.8730739Z ################################################################################ 2025-05-07T20:07:58.8731079Z [CMAKE] Running post-build script ... 2025-05-07T20:07:58.8731616Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:07:58.8732145Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:07:58.8732510Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:07:58.8732904Z ################################################################################ 2025-05-07T20:09:07.4222559Z [597/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T20:09:14.0080848Z [598/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:09:16.4960141Z [599/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o 2025-05-07T20:09:18.3285753Z [600/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward.so -o fbgemm_gpu_tbe_training_backward.so CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:09:18.9653695Z [601/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_dense.so -o fbgemm_gpu_tbe_training_backward_dense.so CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:09:19.0651776Z [602/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_gwd.so -o fbgemm_gpu_tbe_training_backward_gwd.so CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs" -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib" && : 2025-05-07T20:09:19.1414356Z [603/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 1 2025-05-07T20:09:19.1415716Z ################################################################################ 2025-05-07T20:09:19.1416156Z [CMAKE] Running post-build script ... 2025-05-07T20:09:19.1416827Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:09:19.1417507Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:19.1417904Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:19.1418317Z ################################################################################ 2025-05-07T20:09:19.6589556Z [604/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 1 2025-05-07T20:09:19.6591002Z ################################################################################ 2025-05-07T20:09:19.6591418Z [CMAKE] Running post-build script ... 2025-05-07T20:09:19.6592325Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:09:19.6592990Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:19.6593424Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:19.6593925Z ################################################################################ 2025-05-07T20:09:20.4321713Z [605/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 1 2025-05-07T20:09:20.4323018Z ################################################################################ 2025-05-07T20:09:20.4323382Z [CMAKE] Running post-build script ... 2025-05-07T20:09:20.4324048Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:09:20.4324726Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:20.4325109Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:20.4325795Z ################################################################################ 2025-05-07T20:09:26.8720536Z [606/608] : && /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_vbe.so -o fbgemm_gpu_tbe_training_backward_vbe.so CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && : 2025-05-07T20:09:29.5210456Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 1 2025-05-07T20:09:29.5211809Z ################################################################################ 2025-05-07T20:09:29.5212190Z [CMAKE] Running post-build script ... 2025-05-07T20:09:29.5212867Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:09:29.5213546Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:29.5213935Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:29.5214633Z ################################################################################ 2025-05-07T20:09:29.5215634Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-build && /github/home/miniconda/envs/build_binary/lib/python3.13/site-packages/cmake/data/bin/cmake -P cmake_install.cmake 2025-05-07T20:09:29.5264790Z -- Install configuration: "Release" 2025-05-07T20:09:29.5265479Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/asmjit.so 2025-05-07T20:09:29.5281695Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm.so 2025-05-07T20:09:29.5284272Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so 2025-05-07T20:09:29.5300493Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so 2025-05-07T20:09:29.5301658Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so 2025-05-07T20:09:29.5331927Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so 2025-05-07T20:09:29.5350606Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:09:29.5352222Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so 2025-05-07T20:09:29.5353309Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:09:29.5369966Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:09:29.5371276Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:09:29.5373982Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:09:29.5375183Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:09:29.5376278Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:09:29.5377397Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:09:29.5378589Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:09:29.5379687Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:09:29.5380795Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py 2025-05-07T20:09:29.5382124Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py 2025-05-07T20:09:29.5383390Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py 2025-05-07T20:09:29.5384576Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py 2025-05-07T20:09:29.5385797Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py 2025-05-07T20:09:29.5387033Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py 2025-05-07T20:09:29.5388359Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py 2025-05-07T20:09:29.5389708Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py 2025-05-07T20:09:29.5390980Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py 2025-05-07T20:09:29.5392267Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py 2025-05-07T20:09:29.5393647Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py 2025-05-07T20:09:29.5394938Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py 2025-05-07T20:09:29.5396113Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py 2025-05-07T20:09:29.5397346Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py 2025-05-07T20:09:29.5398680Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T20:09:29.5399962Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py 2025-05-07T20:09:29.5401094Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:09:29.5412054Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so 2025-05-07T20:09:29.5468507Z 2025-05-07T20:09:29.5504419Z 2025-05-07T20:09:29.5505343Z copying fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/__init__.py 2025-05-07T20:09:29.5507019Z copying fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py 2025-05-07T20:09:29.5508664Z copying fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/enums.py 2025-05-07T20:09:29.5509958Z copying fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/metrics.py 2025-05-07T20:09:29.5511740Z copying fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py 2025-05-07T20:09:29.5513973Z copying fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py 2025-05-07T20:09:29.5516031Z copying fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_comm.py 2025-05-07T20:09:29.5517574Z copying fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_utils.py 2025-05-07T20:09:29.5519114Z copying fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/runtime_monitor.py 2025-05-07T20:09:29.5520623Z copying fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sparse_ops.py 2025-05-07T20:09:29.5522285Z copying fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_configs.py 2025-05-07T20:09:29.5524285Z copying fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py 2025-05-07T20:09:29.5526411Z copying fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py 2025-05-07T20:09:29.5528083Z copying fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_utils.py 2025-05-07T20:09:29.5530115Z copying fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py 2025-05-07T20:09:29.5532441Z copying fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py 2025-05-07T20:09:29.5534837Z copying fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py 2025-05-07T20:09:29.5537290Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py 2025-05-07T20:09:29.5539847Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py 2025-05-07T20:09:29.5542347Z copying fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py 2025-05-07T20:09:29.5544399Z copying fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py 2025-05-07T20:09:29.5545891Z copying fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/uvm.py 2025-05-07T20:09:29.5546992Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config 2025-05-07T20:09:29.5548283Z copying fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/__init__.py 2025-05-07T20:09:29.5550017Z copying fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/feature_list.py 2025-05-07T20:09:29.5551468Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs 2025-05-07T20:09:29.5552793Z copying fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/__init__.py 2025-05-07T20:09:29.5554274Z copying fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/common.py 2025-05-07T20:09:29.5555724Z copying fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/examples.py 2025-05-07T20:09:29.5557428Z copying fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py 2025-05-07T20:09:29.5559344Z copying fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py 2025-05-07T20:09:29.5561403Z copying fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py 2025-05-07T20:09:29.5563340Z copying fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/quantize_ops.py 2025-05-07T20:09:29.5565531Z copying fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/sparse_ops.py 2025-05-07T20:09:29.5567005Z copying fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/version.py 2025-05-07T20:09:29.5568332Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize 2025-05-07T20:09:29.5569620Z copying fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/__init__.py 2025-05-07T20:09:29.5571416Z copying fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/quantize_ops.py 2025-05-07T20:09:29.5572820Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll 2025-05-07T20:09:29.5574187Z copying fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/__init__.py 2025-05-07T20:09:29.5575477Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe 2025-05-07T20:09:29.5576742Z copying fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/__init__.py 2025-05-07T20:09:29.5578034Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton 2025-05-07T20:09:29.5579359Z copying fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/__init__.py 2025-05-07T20:09:29.5580843Z copying fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/common.py 2025-05-07T20:09:29.5582444Z copying fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize.py 2025-05-07T20:09:29.5584080Z copying fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize_ref.py 2025-05-07T20:09:29.5585538Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils 2025-05-07T20:09:29.5586886Z copying fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/__init__.py 2025-05-07T20:09:29.5588432Z copying fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/filestore.py 2025-05-07T20:09:29.5589911Z copying fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/loader.py 2025-05-07T20:09:29.5591488Z copying fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/torch_library.py 2025-05-07T20:09:29.5592929Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu 2025-05-07T20:09:29.5594232Z copying fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/__init__.py 2025-05-07T20:09:29.5595766Z copying fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py 2025-05-07T20:09:29.5597270Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta 2025-05-07T20:09:29.5598702Z copying fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/__init__.py 2025-05-07T20:09:29.5600285Z copying fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py 2025-05-07T20:09:29.5601696Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton 2025-05-07T20:09:29.5603100Z copying fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/__init__.py 2025-05-07T20:09:29.5604740Z copying fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/common.py 2025-05-07T20:09:29.5606773Z copying fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py 2025-05-07T20:09:29.5609305Z copying fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py 2025-05-07T20:09:29.5611672Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py 2025-05-07T20:09:29.5613640Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py 2025-05-07T20:09:29.5616034Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py 2025-05-07T20:09:29.5618795Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py 2025-05-07T20:09:29.5621607Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py 2025-05-07T20:09:29.5624160Z copying fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py 2025-05-07T20:09:29.5626807Z copying fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py 2025-05-07T20:09:29.5629482Z copying fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py 2025-05-07T20:09:29.5631563Z copying fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py 2025-05-07T20:09:29.5633226Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench 2025-05-07T20:09:29.5634430Z copying fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/__init__.py 2025-05-07T20:09:29.5636020Z copying fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py 2025-05-07T20:09:29.5637683Z copying fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py 2025-05-07T20:09:29.5639511Z copying fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py 2025-05-07T20:09:29.5641450Z copying fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py 2025-05-07T20:09:29.5643630Z copying fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py 2025-05-07T20:09:29.5645613Z copying fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/reporter.py 2025-05-07T20:09:29.5647507Z copying fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py 2025-05-07T20:09:29.5649405Z copying fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py 2025-05-07T20:09:29.5651860Z copying fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py 2025-05-07T20:09:29.5653777Z copying fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/utils.py 2025-05-07T20:09:29.5655341Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache 2025-05-07T20:09:29.5656758Z copying fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/__init__.py 2025-05-07T20:09:29.5658701Z copying fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py 2025-05-07T20:09:29.5660568Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:29.5661900Z copying fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py 2025-05-07T20:09:29.5663576Z copying fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/common.py 2025-05-07T20:09:29.5665100Z copying fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/inference.py 2025-05-07T20:09:29.5666633Z copying fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/training.py 2025-05-07T20:09:29.5667960Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats 2025-05-07T20:09:29.5669436Z copying fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/__init__.py 2025-05-07T20:09:29.5671144Z copying fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py 2025-05-07T20:09:29.5672694Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils 2025-05-07T20:09:29.5673983Z copying fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/__init__.py 2025-05-07T20:09:29.5675498Z copying fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/common.py 2025-05-07T20:09:29.5677018Z copying fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/offsets.py 2025-05-07T20:09:29.5678575Z copying fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/quantize.py 2025-05-07T20:09:29.5680185Z copying fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/requests.py 2025-05-07T20:09:29.5681533Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:29.5682917Z copying fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py 2025-05-07T20:09:29.5684869Z copying fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py 2025-05-07T20:09:29.5686602Z creating directory _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged 2025-05-07T20:09:29.5688006Z copying fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/__init__.py 2025-05-07T20:09:29.5689888Z copying fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py 2025-05-07T20:09:29.5691442Z 2025-05-07T20:09:29.5778775Z INFO:root:running bdist_wheel 2025-05-07T20:09:29.5827170Z INFO:root:running build 2025-05-07T20:09:29.5827838Z INFO:root:running build_py 2025-05-07T20:09:29.5832249Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5834064Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5836744Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5839247Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5841601Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5844243Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5846913Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5849489Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5851930Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5854196Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5856649Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5859163Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5861798Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5864490Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5866995Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5869516Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5872140Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5874722Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5877508Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5880522Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5883346Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5885901Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5888255Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.5889997Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:09:29.5891978Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:09:29.5894573Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:09:29.5896582Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:29.5898433Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:29.5900815Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:29.5903255Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:29.5905794Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:29.5908392Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:29.5911018Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:29.5913587Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:29.5915997Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:29.5918446Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:29.5920300Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:09:29.5922287Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:09:29.5924760Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:09:29.5926692Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll 2025-05-07T20:09:29.5928869Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll 2025-05-07T20:09:29.5930890Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe 2025-05-07T20:09:29.5932706Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe 2025-05-07T20:09:29.5934510Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:09:29.5936388Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:09:29.5938804Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:09:29.5941305Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:09:29.5943951Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:09:29.5945889Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:09:29.5947786Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:09:29.5950178Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:09:29.5952779Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:09:29.5955322Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:09:29.5957339Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:09:29.5959435Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:09:29.5961897Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:09:29.5963930Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:09:29.5965980Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:09:29.5968518Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:09:29.5970650Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:29.5972709Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:29.5975252Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:29.5978110Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:29.5981218Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:29.5984065Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:29.5986957Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:29.5989950Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:29.5993174Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:29.5996348Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:29.5999424Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:29.6002518Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:29.6005664Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:29.6008666Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:29.6011013Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:29.6013060Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:29.6015694Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:29.6018294Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:29.6020874Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:29.6023564Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:29.6026310Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:29.6029250Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:29.6032029Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:29.6034736Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:29.6037563Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:29.6040308Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:29.6042357Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:09:29.6044481Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:09:29.6047150Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:09:29.6049255Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:29.6051307Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:29.6053904Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:29.6056462Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:29.6058983Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:29.6061238Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:09:29.6063355Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:09:29.6066194Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:09:29.6068582Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:09:29.6070666Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:09:29.6073309Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:09:29.6076111Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:09:29.6078969Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:09:29.6081744Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:09:29.6083917Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:29.6086090Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:29.6089073Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:29.6091754Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:09:29.6094015Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:09:29.6097140Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:09:29.6113881Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.6138535Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.6373344Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:29.7311944Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:33.1749293Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:33.1752586Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:33.3066979Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:33.3173204Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:33.3395657Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:33.4092150Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:36.1510667Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:36.2072860Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:43.5509263Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:44.6829030Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:47.4087379Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:47.8736011Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:47.9031396Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.1740018Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:48.1741825Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:48.1743962Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:48.1748065Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:48.1753195Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:48.1758521Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:48.1763190Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:48.1768622Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:48.1774390Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:48.1779401Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:48.1784265Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:48.1789461Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:48.1794314Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:48.1799995Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:48.1804394Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:48.1808462Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:48.1810082Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:48.1814600Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:48.1822100Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.1856262Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7650407Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7651844Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7653193Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7654449Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7655775Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7657287Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7658700Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7659982Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7661294Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7662577Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7664061Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7665580Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7667057Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7668457Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7669849Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7671350Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7672963Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7674481Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7677337Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7678969Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7680376Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7681997Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu 2025-05-07T20:09:48.7683312Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:09:48.7684724Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config 2025-05-07T20:09:48.7686132Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:48.7687523Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:48.7688929Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:48.7690457Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:48.7692049Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:48.7694036Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:48.7695696Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:48.7697083Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:48.7698646Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs 2025-05-07T20:09:48.7700162Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:09:48.7703591Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize 2025-05-07T20:09:48.7705035Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll 2025-05-07T20:09:48.7706490Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe 2025-05-07T20:09:48.7707986Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:09:48.7709478Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:09:48.7711213Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:09:48.7712647Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton 2025-05-07T20:09:48.7714045Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:09:48.7715453Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:09:48.7717260Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:09:48.7718691Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils 2025-05-07T20:09:48.7720158Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:09:48.7721586Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu 2025-05-07T20:09:48.7723223Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:09:48.7724716Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta 2025-05-07T20:09:48.7726278Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7727789Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7729536Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7731248Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7732803Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7734463Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7736114Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7737796Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7739498Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7741234Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7742946Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7744564Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7746201Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7747763Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7749192Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7750655Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7752108Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7753596Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7755292Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7758610Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7760086Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7761580Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7763152Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7764700Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7766086Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:09:48.7767577Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache 2025-05-07T20:09:48.7769059Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:48.7770515Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:48.7771970Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:48.7773410Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:48.7775769Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:09:48.7777324Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats 2025-05-07T20:09:48.7778810Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:09:48.7780287Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:09:48.7781699Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:09:48.7783142Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:09:48.7784595Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils 2025-05-07T20:09:48.7786112Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:48.7787911Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:48.7789501Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:09:48.7791073Z INFO:root:copying _skbuild/linux-x86_64-3.13/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged 2025-05-07T20:09:48.7802336Z INFO:skbuild:copied 90 files 2025-05-07T20:09:48.7802660Z INFO:root:running build_ext 2025-05-07T20:09:48.7804483Z INFO:root:installing to _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:48.7805058Z INFO:root:running install 2025-05-07T20:09:48.7859382Z INFO:root:running install_lib 2025-05-07T20:09:48.7860384Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:48.7862184Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu 2025-05-07T20:09:48.7863350Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/config 2025-05-07T20:09:48.7864555Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:48.7866125Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:48.7867453Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/docs 2025-05-07T20:09:48.7868615Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:48.7870111Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:48.7871649Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:48.7873227Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:48.7874843Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:48.7876528Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:48.7878183Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:48.7879714Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:48.7881321Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:48.7882551Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/quantize 2025-05-07T20:09:48.7883732Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:48.7885353Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:48.7886548Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll 2025-05-07T20:09:48.7887311Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/cpu 2025-05-07T20:09:48.7888494Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:48.7890205Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:48.7891381Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/meta 2025-05-07T20:09:48.7892579Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:48.7894176Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:48.7895388Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7896578Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7898171Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7899859Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7901672Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7903401Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7905105Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7906902Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7908808Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7910687Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7912520Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7914382Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7916222Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7918047Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:48.7919746Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll 2025-05-07T20:09:48.7920879Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe 2025-05-07T20:09:48.7921649Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7922897Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7924535Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7926177Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7927800Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7929650Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7931429Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7933108Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7934768Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7936510Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7938287Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7939946Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:48.7941106Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/cache 2025-05-07T20:09:48.7942273Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:48.7943920Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:48.7945193Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:48.7945975Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:48.7947191Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:48.7948926Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:48.7950682Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:48.7952211Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:48.7953754Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:48.7955323Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:48.7956491Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/stats 2025-05-07T20:09:48.7957663Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:48.7959331Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:48.7960603Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/utils 2025-05-07T20:09:48.7961791Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:48.7963434Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:48.7965098Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:48.7966706Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:48.7968340Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:48.7969896Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe 2025-05-07T20:09:48.7971064Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton 2025-05-07T20:09:48.7971915Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton/jagged 2025-05-07T20:09:48.7973162Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:48.7974926Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:48.7976620Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:48.7978206Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:48.7979757Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:48.7981320Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:48.7982471Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/utils 2025-05-07T20:09:48.7983662Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:48.7985140Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:48.7986614Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:48.7988118Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:48.7989560Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:48.7990940Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:48.7999426Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:48.8132155Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:49.0895796Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:49.0897382Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:49.1004833Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:49.1012899Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:49.1033866Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:49.1092746Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:49.3282624Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:49.3328805Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:49.8971794Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:49.9853320Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.1875571Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2236839Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2257433Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2472724Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:50.2474409Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:50.2476900Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:50.2479052Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:50.2481133Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:50.2483229Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:50.2485396Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:50.2487623Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:50.2489907Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:50.2492291Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:50.2494487Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:50.2496766Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:50.2498937Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:50.2501018Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:50.2503155Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:50.2504751Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:50.2506504Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:50.2508657Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:50.2510534Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2512129Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2953919Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2955783Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2957324Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2958744Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2960335Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2962110Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2963673Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2965186Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2966692Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2968167Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2969694Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2971422Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2973092Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2974764Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2976435Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2978126Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2979817Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2981561Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2983361Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2985089Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2986711Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2988208Z INFO:root:copying _skbuild/linux-x86_64-3.13/setuptools/lib.linux-x86_64-cpython-313/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.2989102Z INFO:skbuild:copied 125 files 2025-05-07T20:09:50.2989452Z INFO:root:running install_egg_info 2025-05-07T20:09:50.3028064Z INFO:root:running egg_info 2025-05-07T20:09:50.3067054Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T20:09:50.3068328Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T20:09:50.3069421Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T20:09:50.3070396Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T20:09:50.3163866Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:50.3195882Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:50.3197253Z INFO:root:Copying fbgemm_gpu_nightly.egg-info to _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu_nightly-2025.5.7-py3.13.egg-info 2025-05-07T20:09:50.3200037Z INFO:root:running install_scripts 2025-05-07T20:09:50.3200421Z INFO:skbuild:copied 0 files 2025-05-07T20:09:52.9848142Z INFO:root:creating _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL 2025-05-07T20:09:52.9849601Z INFO:wheel:creating '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-3xp3vcls/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl' and adding '_skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel' to it 2025-05-07T20:09:52.9850819Z INFO:wheel:adding 'fbgemm_gpu/__init__.py' 2025-05-07T20:09:53.0121643Z INFO:wheel:adding 'fbgemm_gpu/asmjit.so' 2025-05-07T20:09:53.0132119Z INFO:wheel:adding 'fbgemm_gpu/batched_unary_embeddings_ops.py' 2025-05-07T20:09:53.0132943Z INFO:wheel:adding 'fbgemm_gpu/enums.py' 2025-05-07T20:09:53.1748714Z INFO:wheel:adding 'fbgemm_gpu/fbgemm.so' 2025-05-07T20:09:53.1879903Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_config.so' 2025-05-07T20:09:53.2013410Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so' 2025-05-07T20:09:54.9715488Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_py.so' 2025-05-07T20:09:55.1770034Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so' 2025-05-07T20:09:55.8978446Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_cache.so' 2025-05-07T20:09:56.0110976Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_common.so' 2025-05-07T20:09:56.6084575Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_index_select.so' 2025-05-07T20:10:14.7847458Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_inference.so' 2025-05-07T20:10:16.0543984Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so' 2025-05-07T20:10:44.2895942Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so' 2025-05-07T20:10:47.1250314Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so' 2025-05-07T20:10:50.7971797Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so' 2025-05-07T20:10:51.3892556Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so' 2025-05-07T20:10:51.5642889Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so' 2025-05-07T20:11:00.3329206Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so' 2025-05-07T20:11:11.4956291Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so' 2025-05-07T20:11:12.9769995Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_utils.so' 2025-05-07T20:11:13.0125598Z INFO:wheel:adding 'fbgemm_gpu/metrics.py' 2025-05-07T20:11:13.0126218Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules.py' 2025-05-07T20:11:13.0130607Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules_split.py' 2025-05-07T20:11:13.0131224Z INFO:wheel:adding 'fbgemm_gpu/quantize_comm.py' 2025-05-07T20:11:13.0132610Z INFO:wheel:adding 'fbgemm_gpu/quantize_utils.py' 2025-05-07T20:11:13.0135567Z INFO:wheel:adding 'fbgemm_gpu/runtime_monitor.py' 2025-05-07T20:11:13.0146190Z INFO:wheel:adding 'fbgemm_gpu/sparse_ops.py' 2025-05-07T20:11:13.0149636Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_configs.py' 2025-05-07T20:11:13.0152364Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_inference_converter.py' 2025-05-07T20:11:13.0153787Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_ops.py' 2025-05-07T20:11:13.0155144Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_utils.py' 2025-05-07T20:11:13.0156790Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops.py' 2025-05-07T20:11:13.0159852Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_common.py' 2025-05-07T20:11:13.0180179Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_inference.py' 2025-05-07T20:11:13.0221887Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training.py' 2025-05-07T20:11:13.0226924Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py' 2025-05-07T20:11:13.0228313Z INFO:wheel:adding 'fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py' 2025-05-07T20:11:13.0230395Z INFO:wheel:adding 'fbgemm_gpu/tbe_input_multiplexer.py' 2025-05-07T20:11:13.0231851Z INFO:wheel:adding 'fbgemm_gpu/uvm.py' 2025-05-07T20:11:13.0233722Z INFO:wheel:adding 'fbgemm_gpu/config/__init__.py' 2025-05-07T20:11:13.0235385Z INFO:wheel:adding 'fbgemm_gpu/config/feature_list.py' 2025-05-07T20:11:13.0237026Z INFO:wheel:adding 'fbgemm_gpu/docs/__init__.py' 2025-05-07T20:11:13.0238219Z INFO:wheel:adding 'fbgemm_gpu/docs/common.py' 2025-05-07T20:11:13.0239934Z INFO:wheel:adding 'fbgemm_gpu/docs/examples.py' 2025-05-07T20:11:13.0242200Z INFO:wheel:adding 'fbgemm_gpu/docs/jagged_tensor_ops.py' 2025-05-07T20:11:13.0243809Z INFO:wheel:adding 'fbgemm_gpu/docs/merge_pooled_embedding_ops.py' 2025-05-07T20:11:13.0245877Z INFO:wheel:adding 'fbgemm_gpu/docs/permute_pooled_embedding_ops.py' 2025-05-07T20:11:13.0247380Z INFO:wheel:adding 'fbgemm_gpu/docs/quantize_ops.py' 2025-05-07T20:11:13.0253193Z INFO:wheel:adding 'fbgemm_gpu/docs/sparse_ops.py' 2025-05-07T20:11:13.0254925Z INFO:wheel:adding 'fbgemm_gpu/docs/version.py' 2025-05-07T20:11:13.0256659Z INFO:wheel:adding 'fbgemm_gpu/quantize/__init__.py' 2025-05-07T20:11:13.0258130Z INFO:wheel:adding 'fbgemm_gpu/quantize/quantize_ops.py' 2025-05-07T20:11:13.0260040Z INFO:wheel:adding 'fbgemm_gpu/sll/__init__.py' 2025-05-07T20:11:13.0262006Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/__init__.py' 2025-05-07T20:11:13.0267920Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/cpu_sll.py' 2025-05-07T20:11:13.0270254Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/__init__.py' 2025-05-07T20:11:13.0272523Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/meta_sll.py' 2025-05-07T20:11:13.0274728Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/__init__.py' 2025-05-07T20:11:13.0276171Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/common.py' 2025-05-07T20:11:13.0277828Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py' 2025-05-07T20:11:13.0280058Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py' 2025-05-07T20:11:13.0283413Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm.py' 2025-05-07T20:11:13.0287832Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py' 2025-05-07T20:11:13.0289852Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py' 2025-05-07T20:11:13.0292062Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py' 2025-05-07T20:11:13.0297386Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py' 2025-05-07T20:11:13.0302561Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py' 2025-05-07T20:11:13.0304595Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py' 2025-05-07T20:11:13.0308257Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_softmax.py' 2025-05-07T20:11:13.0313344Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py' 2025-05-07T20:11:13.0315841Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py' 2025-05-07T20:11:13.0318712Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py' 2025-05-07T20:11:13.0322076Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py' 2025-05-07T20:11:13.0324092Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py' 2025-05-07T20:11:13.0325838Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py' 2025-05-07T20:11:13.0328747Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py' 2025-05-07T20:11:13.0332153Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py' 2025-05-07T20:11:13.0334816Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py' 2025-05-07T20:11:13.0337818Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py' 2025-05-07T20:11:13.0340743Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py' 2025-05-07T20:11:13.0343607Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py' 2025-05-07T20:11:13.0346611Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py' 2025-05-07T20:11:13.0349851Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py' 2025-05-07T20:11:13.0352558Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py' 2025-05-07T20:11:13.0354373Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py' 2025-05-07T20:11:13.0356735Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py' 2025-05-07T20:11:13.0358086Z INFO:wheel:adding 'fbgemm_gpu/tbe/__init__.py' 2025-05-07T20:11:13.0359942Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/__init__.py' 2025-05-07T20:11:13.0362252Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_config.py' 2025-05-07T20:11:13.0366738Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_runs.py' 2025-05-07T20:11:13.0369052Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eeg_cli.py' 2025-05-07T20:11:13.0371363Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/embedding_ops_common_config.py' 2025-05-07T20:11:13.0373096Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eval_compression.py' 2025-05-07T20:11:13.0374483Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/reporter.py' 2025-05-07T20:11:13.0377469Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config.py' 2025-05-07T20:11:13.0380072Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_loader.py' 2025-05-07T20:11:13.0382346Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py' 2025-05-07T20:11:13.0383889Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/utils.py' 2025-05-07T20:11:13.0385388Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/__init__.py' 2025-05-07T20:11:13.0386872Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py' 2025-05-07T20:11:13.0388225Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/__init__.py' 2025-05-07T20:11:13.0389436Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/common.py' 2025-05-07T20:11:13.0395097Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/inference.py' 2025-05-07T20:11:13.0420282Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/training.py' 2025-05-07T20:11:13.0423093Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/__init__.py' 2025-05-07T20:11:13.0425767Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py' 2025-05-07T20:11:13.0427234Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/__init__.py' 2025-05-07T20:11:13.0430094Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/bench_params_reporter.py' 2025-05-07T20:11:13.0431824Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/__init__.py' 2025-05-07T20:11:13.0433096Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/common.py' 2025-05-07T20:11:13.0434613Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/offsets.py' 2025-05-07T20:11:13.0436963Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/quantize.py' 2025-05-07T20:11:13.0442350Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/requests.py' 2025-05-07T20:11:13.0444307Z INFO:wheel:adding 'fbgemm_gpu/triton/__init__.py' 2025-05-07T20:11:13.0445864Z INFO:wheel:adding 'fbgemm_gpu/triton/common.py' 2025-05-07T20:11:13.0453396Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize.py' 2025-05-07T20:11:13.0457821Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize_ref.py' 2025-05-07T20:11:13.0459581Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/__init__.py' 2025-05-07T20:11:13.0467373Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py' 2025-05-07T20:11:13.0469425Z INFO:wheel:adding 'fbgemm_gpu/utils/__init__.py' 2025-05-07T20:11:13.0471506Z INFO:wheel:adding 'fbgemm_gpu/utils/filestore.py' 2025-05-07T20:11:13.0472948Z INFO:wheel:adding 'fbgemm_gpu/utils/loader.py' 2025-05-07T20:11:13.0474933Z INFO:wheel:adding 'fbgemm_gpu/utils/torch_library.py' 2025-05-07T20:11:13.0477302Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/METADATA' 2025-05-07T20:11:13.0478282Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL' 2025-05-07T20:11:13.0479093Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/top_level.txt' 2025-05-07T20:11:13.0485363Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/RECORD' 2025-05-07T20:11:13.0488212Z INFO:root:removing _skbuild/linux-x86_64-3.13/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:11:13.2070679Z ╒════════════════════════════╤════════════════════════════════════════════════╕ 2025-05-07T20:11:13.2071267Z │ │ Version │ 2025-05-07T20:11:13.2071837Z ╞════════════════════════════╪════════════════════════════════════════════════╡ 2025-05-07T20:11:13.2072354Z │ PyTorch │ 2.8.0.dev20250507+cu126 │ 2025-05-07T20:11:13.2072928Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:11:13.2073486Z │ CUDA (Declared by PyTorch) │ 12.6 │ 2025-05-07T20:11:13.2074251Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:11:13.2074819Z │ CUDA (Actual) │ nvcc: NVIDIA (R) Cuda compiler driver │ 2025-05-07T20:11:13.2075587Z │ │ Copyright (c) 2005-2024 NVIDIA Corporation │ 2025-05-07T20:11:13.2076113Z │ │ Built on Tue_Oct_29_23:50:19_PDT_2024 │ 2025-05-07T20:11:13.2076629Z │ │ Cuda compilation tools, release 12.6, V12.6.85 │ 2025-05-07T20:11:13.2077122Z │ │ Build cuda_12.6.r12.6/compiler.35059454_0 │ 2025-05-07T20:11:13.2077687Z ╘════════════════════════════╧════════════════════════════════════════════════╛ 2025-05-07T20:11:13.4679170Z Successfully built fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:11:13.5436704Z 2025-05-07T20:11:13.5586933Z ################################################################################ 2025-05-07T20:11:13.5587706Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:11:13.5588166Z [CHECK] Listing out library size: 2025-05-07T20:11:13.5588647Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:11:13.5588986Z 2025-05-07T20:11:13.5606083Z 1 ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:11:13.5607451Z 2025-05-07T20:11:13.5607843Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:11:13.5608746Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.5609318Z 2025-05-07T20:11:13.5689835Z GLIBC_2.2.5 2025-05-07T20:11:13.5690342Z GLIBC_2.14 2025-05-07T20:11:13.5692157Z 2025-05-07T20:11:13.5692170Z 2025-05-07T20:11:13.5692544Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:11:13.5697227Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.5697808Z 2025-05-07T20:11:13.5762301Z 2025-05-07T20:11:13.5762391Z 2025-05-07T20:11:13.5782781Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so > /tmp/tmp.P22urRSV1b.symbols.txt 2025-05-07T20:11:13.5784033Z 2025-05-07T20:11:13.5813875Z 2025-05-07T20:11:13.5847026Z [CHECK] Total Number of symbols: 803 2025-05-07T20:11:13.5862529Z [CHECK] Number of fbgemm symbols: 0 2025-05-07T20:11:13.5883726Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so > /tmp/tmp.05ZaA89j93.usymbols.txt 2025-05-07T20:11:13.5885001Z 2025-05-07T20:11:13.5899395Z 2025-05-07T20:11:13.5931379Z [CHECK] Listing out undefined symbols (49 total): 2025-05-07T20:11:13.5947536Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.5948339Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.5948798Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.5949171Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:11:13.5949520Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.5949880Z U __popcountdi2@GCC_3.4 2025-05-07T20:11:13.5950194Z U abort@GLIBC_2.2.5 2025-05-07T20:11:13.5950510Z U close@GLIBC_2.2.5 2025-05-07T20:11:13.5950856Z U fputs@GLIBC_2.2.5 2025-05-07T20:11:13.5951156Z U free@GLIBC_2.2.5 2025-05-07T20:11:13.5951478Z U ftruncate64@GLIBC_2.2.5 2025-05-07T20:11:13.5951791Z U fwrite@GLIBC_2.2.5 2025-05-07T20:11:13.5952111Z U getenv@GLIBC_2.2.5 2025-05-07T20:11:13.5952420Z U getpagesize@GLIBC_2.2.5 2025-05-07T20:11:13.5952765Z U madvise@GLIBC_2.2.5 2025-05-07T20:11:13.5953069Z U malloc@GLIBC_2.2.5 2025-05-07T20:11:13.5953399Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:13.5953698Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.5954177Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.5954595Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.5954892Z U mmap@GLIBC_2.2.5 2025-05-07T20:11:13.5955278Z U mprotect@GLIBC_2.2.5 2025-05-07T20:11:13.5955592Z U munmap@GLIBC_2.2.5 2025-05-07T20:11:13.5955920Z U open64@GLIBC_2.2.5 2025-05-07T20:11:13.5956272Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:13.5956709Z U pthread_mutex_destroy@GLIBC_2.2.5 2025-05-07T20:11:13.5957171Z U pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:13.5957705Z U pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:13.5958064Z U read@GLIBC_2.2.5 2025-05-07T20:11:13.5958363Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:13.5958685Z U shm_open 2025-05-07T20:11:13.5958952Z U shm_unlink 2025-05-07T20:11:13.5959258Z U snprintf@GLIBC_2.2.5 2025-05-07T20:11:13.5959568Z U stderr@GLIBC_2.2.5 2025-05-07T20:11:13.5959901Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:13.5960200Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.5960526Z U strtol@GLIBC_2.2.5 2025-05-07T20:11:13.5960854Z U syscall@GLIBC_2.2.5 2025-05-07T20:11:13.5961157Z U sysconf@GLIBC_2.2.5 2025-05-07T20:11:13.5961481Z U uname@GLIBC_2.2.5 2025-05-07T20:11:13.5961778Z U unlink@GLIBC_2.2.5 2025-05-07T20:11:13.5962117Z U vsnprintf@GLIBC_2.2.5 2025-05-07T20:11:13.5962517Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.5962987Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.5963468Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.5963915Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.5964286Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.5964617Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.5964958Z w __gmon_start__ 2025-05-07T20:11:13.5965300Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.5965740Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:11:13.5966012Z 2025-05-07T20:11:13.5992427Z linux-vdso.so.1 (0x00007fffb6f90000) 2025-05-07T20:11:13.5993812Z libtorch_cpu.so => not found 2025-05-07T20:11:13.5994239Z libtorch_cuda.so => not found 2025-05-07T20:11:13.5994588Z libtorch.so => not found 2025-05-07T20:11:13.5994946Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f78686e8000) 2025-05-07T20:11:13.5995403Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f78686ba000) 2025-05-07T20:11:13.5995808Z libc.so.6 => /lib64/libc.so.6 (0x00007f78684b0000) 2025-05-07T20:11:13.5996168Z libm.so.6 => /lib64/libm.so.6 (0x00007f78683d5000) 2025-05-07T20:11:13.5996600Z /lib64/ld-linux-x86-64.so.2 (0x00007f78689cb000) 2025-05-07T20:11:13.5996851Z 2025-05-07T20:11:13.5996984Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.5997408Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so 2025-05-07T20:11:13.5997694Z 2025-05-07T20:11:13.6030874Z 2025-05-07T20:11:13.6031198Z Dynamic section at offset 0x78e78 contains 33 entries: 2025-05-07T20:11:13.6031618Z Tag Type Name/Value 2025-05-07T20:11:13.6032067Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.6032631Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.6033144Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.6033666Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.6034173Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.6036296Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.6036825Z 0x000000000000000e (SONAME) Library soname: [asmjit.so] 2025-05-07T20:11:13.6037344Z 0x000000000000000c (INIT) 0x1a000 2025-05-07T20:11:13.6037753Z 0x000000000000000d (FINI) 0x5af2c 2025-05-07T20:11:13.6038080Z 0x0000000000000019 (INIT_ARRAY) 0x780a0 2025-05-07T20:11:13.6038432Z 0x000000000000001b (INIT_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.6038772Z 0x000000000000001a (FINI_ARRAY) 0x780a8 2025-05-07T20:11:13.6039127Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.6039460Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:13.6039804Z 0x000000006ffffef5 (GNU_HASH) 0x1e18 2025-05-07T20:11:13.6040126Z 0x0000000000000005 (STRTAB) 0x86e0 2025-05-07T20:11:13.6040463Z 0x0000000000000006 (SYMTAB) 0x3b80 2025-05-07T20:11:13.6040826Z 0x000000000000000a (STRSZ) 45342 (bytes) 2025-05-07T20:11:13.6041178Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.6041531Z 0x0000000000000003 (PLTGOT) 0x790d8 2025-05-07T20:11:13.6041881Z 0x0000000000000002 (PLTRELSZ) 8064 (bytes) 2025-05-07T20:11:13.6042234Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.6042549Z 0x0000000000000017 (JMPREL) 0x17220 2025-05-07T20:11:13.6042883Z 0x0000000000000007 (RELA) 0x13ed8 2025-05-07T20:11:13.6043240Z 0x0000000000000008 (RELASZ) 13128 (bytes) 2025-05-07T20:11:13.6043589Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.6043925Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:13.6044242Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:13.6044601Z 0x000000006ffffffe (VERNEED) 0x13e48 2025-05-07T20:11:13.6045033Z 0x000000006fffffff (VERNEEDNUM) 3 2025-05-07T20:11:13.6045411Z 0x000000006ffffff0 (VERSYM) 0x137fe 2025-05-07T20:11:13.6045735Z 0x000000006ffffff9 (RELACOUNT) 3 2025-05-07T20:11:13.6046212Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.6046427Z 2025-05-07T20:11:13.6046562Z ################################################################################ 2025-05-07T20:11:13.6046789Z 2025-05-07T20:11:13.6046793Z 2025-05-07T20:11:13.6046904Z ################################################################################ 2025-05-07T20:11:13.6047386Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:13.6047851Z [CHECK] Listing out library size: 2025-05-07T20:11:13.6048356Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:13.6048707Z 2025-05-07T20:11:13.6048988Z 1 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:13.6049278Z 2025-05-07T20:11:13.6051352Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:13.6054244Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.6055988Z 2025-05-07T20:11:13.6098536Z GLIBC_2.2.5 2025-05-07T20:11:13.6099171Z GLIBC_2.14 2025-05-07T20:11:13.6099989Z 2025-05-07T20:11:13.6100002Z 2025-05-07T20:11:13.6101742Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:13.6104426Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.6105036Z 2025-05-07T20:11:13.6157706Z GLIBCXX_3.4 2025-05-07T20:11:13.6158340Z GLIBCXX_3.4.9 2025-05-07T20:11:13.6158618Z GLIBCXX_3.4.21 2025-05-07T20:11:13.6158765Z 2025-05-07T20:11:13.6158770Z 2025-05-07T20:11:13.6178030Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.fhrJVB4nZU.symbols.txt 2025-05-07T20:11:13.6179691Z 2025-05-07T20:11:13.6207717Z 2025-05-07T20:11:13.6235828Z [CHECK] Total Number of symbols: 107 2025-05-07T20:11:13.6251466Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:11:13.6269620Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.JaEZWU3U0v.usymbols.txt 2025-05-07T20:11:13.6270129Z 2025-05-07T20:11:13.6287072Z 2025-05-07T20:11:13.6314299Z [CHECK] Listing out undefined symbols (57 total): 2025-05-07T20:11:13.6330022Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.6331937Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.6332824Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.6333758Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.6334681Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:13.6335619Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.6336537Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.6337468Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:13.6338338Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:11:13.6339289Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.6339812Z U c10::BoolType::get() 2025-05-07T20:11:13.6340290Z U c10::StringType::get() 2025-05-07T20:11:13.6340631Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:13.6341331Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:13.6342493Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.6343400Z U getenv@GLIBC_2.2.5 2025-05-07T20:11:13.6343686Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:13.6343990Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.6344270Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.6344575Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.6344901Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:13.6345302Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.6345694Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:13.6346352Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:11:13.6347202Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.6347788Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.6348184Z U std::__throw_invalid_argument(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.6348611Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.6348994Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.6349395Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.6349964Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:13.6350950Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.6351725Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.6352065Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.6352430Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.6352812Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.6353215Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:13.6353609Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:13.6353918Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.6354219Z U strtol@GLIBC_2.2.5 2025-05-07T20:11:13.6354525Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:13.6355315Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:13.6356469Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:11:13.6357411Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:13.6358049Z U typeinfo for std::invalid_argument@GLIBCXX_3.4 2025-05-07T20:11:13.6358437Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.6358868Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:13.6359301Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.6359886Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.6360660Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.6361289Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:13.6361799Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:13.6362288Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.6362604Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.6362931Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.6363219Z w __gmon_start__ 2025-05-07T20:11:13.6363512Z w __pthread_key_create 2025-05-07T20:11:13.6363870Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.6364294Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:13.6364577Z 2025-05-07T20:11:13.6372548Z linux-vdso.so.1 (0x00007fff4b9ba000) 2025-05-07T20:11:13.6373412Z libc10.so => not found 2025-05-07T20:11:13.6374146Z libtorch_cpu.so => not found 2025-05-07T20:11:13.6374912Z libtorch_cuda.so => not found 2025-05-07T20:11:13.6375691Z libtorch.so => not found 2025-05-07T20:11:13.6376630Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fc8b3596000) 2025-05-07T20:11:13.6377867Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fc8b3566000) 2025-05-07T20:11:13.6379015Z libc.so.6 => /lib64/libc.so.6 (0x00007fc8b335e000) 2025-05-07T20:11:13.6379385Z libm.so.6 => /lib64/libm.so.6 (0x00007fc8b3283000) 2025-05-07T20:11:13.6379784Z /lib64/ld-linux-x86-64.so.2 (0x00007fc8b380a000) 2025-05-07T20:11:13.6380036Z 2025-05-07T20:11:13.6380151Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.6380598Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:13.6380928Z 2025-05-07T20:11:13.6406840Z 2025-05-07T20:11:13.6407568Z Dynamic section at offset 0xab00 contains 34 entries: 2025-05-07T20:11:13.6408700Z Tag Type Name/Value 2025-05-07T20:11:13.6409987Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.6411750Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.6413283Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.6415072Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.6415641Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.6416231Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.6416843Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.6417383Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_config.so] 2025-05-07T20:11:13.6417849Z 0x000000000000000c (INIT) 0x4000 2025-05-07T20:11:13.6418181Z 0x000000000000000d (FINI) 0x817c 2025-05-07T20:11:13.6418535Z 0x0000000000000019 (INIT_ARRAY) 0xaa58 2025-05-07T20:11:13.6418872Z 0x000000000000001b (INIT_ARRAYSZ) 16 (bytes) 2025-05-07T20:11:13.6419245Z 0x000000000000001a (FINI_ARRAY) 0xaa68 2025-05-07T20:11:13.6419582Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.6419943Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:13.6420270Z 0x000000006ffffef5 (GNU_HASH) 0x700 2025-05-07T20:11:13.6420799Z 0x0000000000000005 (STRTAB) 0x13b0 2025-05-07T20:11:13.6421159Z 0x0000000000000006 (SYMTAB) 0x990 2025-05-07T20:11:13.6421510Z 0x000000000000000a (STRSZ) 6890 (bytes) 2025-05-07T20:11:13.6421899Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.6422246Z 0x0000000000000003 (PLTGOT) 0xad70 2025-05-07T20:11:13.6422628Z 0x0000000000000002 (PLTRELSZ) 1272 (bytes) 2025-05-07T20:11:13.6422980Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.6423334Z 0x0000000000000017 (JMPREL) 0x34a8 2025-05-07T20:11:13.6423693Z 0x0000000000000007 (RELA) 0x3028 2025-05-07T20:11:13.6424190Z 0x0000000000000008 (RELASZ) 1152 (bytes) 2025-05-07T20:11:13.6424573Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.6424948Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:13.6425309Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:13.6425668Z 0x000000006ffffffe (VERNEED) 0x2f78 2025-05-07T20:11:13.6426031Z 0x000000006fffffff (VERNEEDNUM) 3 2025-05-07T20:11:13.6426365Z 0x000000006ffffff0 (VERSYM) 0x2e9a 2025-05-07T20:11:13.6426721Z 0x000000006ffffff9 (RELACOUNT) 4 2025-05-07T20:11:13.6427062Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.6427270Z 2025-05-07T20:11:13.6427389Z ################################################################################ 2025-05-07T20:11:13.6427623Z 2025-05-07T20:11:13.6427627Z 2025-05-07T20:11:13.6427766Z ################################################################################ 2025-05-07T20:11:13.6428208Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:11:13.6428851Z [CHECK] Listing out library size: 2025-05-07T20:11:13.6429262Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:11:13.6429685Z 2025-05-07T20:11:13.6429848Z 6 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:11:13.6430097Z 2025-05-07T20:11:13.6430463Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:11:13.6431373Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.6431909Z 2025-05-07T20:11:13.6685833Z GLIBC_2.2.5 2025-05-07T20:11:13.6686168Z GLIBC_2.3 2025-05-07T20:11:13.6686484Z GLIBC_2.14 2025-05-07T20:11:13.6686803Z 2025-05-07T20:11:13.6686894Z 2025-05-07T20:11:13.6687254Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:11:13.6688211Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.6688779Z 2025-05-07T20:11:13.6943600Z GLIBCXX_3.4 2025-05-07T20:11:13.6944285Z GLIBCXX_3.4.9 2025-05-07T20:11:13.6945034Z GLIBCXX_3.4.11 2025-05-07T20:11:13.6945611Z GLIBCXX_3.4.14 2025-05-07T20:11:13.6946208Z GLIBCXX_3.4.15 2025-05-07T20:11:13.6946863Z GLIBCXX_3.4.18 2025-05-07T20:11:13.6947460Z GLIBCXX_3.4.21 2025-05-07T20:11:13.6947814Z 2025-05-07T20:11:13.6947839Z 2025-05-07T20:11:13.6966863Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so > /tmp/tmp.6ixvyACon7.symbols.txt 2025-05-07T20:11:13.6967339Z 2025-05-07T20:11:13.7181774Z 2025-05-07T20:11:13.7208952Z [CHECK] Total Number of symbols: 4871 2025-05-07T20:11:13.7227804Z [CHECK] Number of fbgemm symbols: 3365 2025-05-07T20:11:13.7245971Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so > /tmp/tmp.A6NZiACZr3.usymbols.txt 2025-05-07T20:11:13.7247287Z 2025-05-07T20:11:13.7273177Z 2025-05-07T20:11:13.7311051Z [CHECK] Listing out undefined symbols (135 total): 2025-05-07T20:11:13.7329180Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.7329661Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:13.7330083Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.7330578Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.7330927Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:13.7331289Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:13.7331625Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:13.7331980Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.7332333Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.7332691Z U __cxa_init_primary_exception@CXXABI_1.3.11 2025-05-07T20:11:13.7333070Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:13.7333392Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:13.7333745Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:13.7334257Z U __cxa_throw_bad_array_new_length@CXXABI_1.3.8 2025-05-07T20:11:13.7334651Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.7334993Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:11:13.7335341Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:13.7335676Z U abort@GLIBC_2.2.5 2025-05-07T20:11:13.7336103Z U asmjit::_abi_1_13::BaseAssembler::bind(asmjit::_abi_1_13::Label const&) 2025-05-07T20:11:13.7336612Z U asmjit::_abi_1_13::BaseAssembler::newLabel() 2025-05-07T20:11:13.7337140Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:13.7338135Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:13.7339054Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:13.7340159Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:13.7341241Z U asmjit::_abi_1_13::BaseEmitter::emitArgsAssignment(asmjit::_abi_1_13::FuncFrame const&, asmjit::_abi_1_13::FuncArgsAssignment const&) 2025-05-07T20:11:13.7342014Z U asmjit::_abi_1_13::BaseEmitter::emitEpilog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:11:13.7342571Z U asmjit::_abi_1_13::BaseEmitter::emitProlog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:11:13.7343173Z U asmjit::_abi_1_13::CodeHolder::CodeHolder(asmjit::_abi_1_13::Support::Temporary const*) 2025-05-07T20:11:13.7343802Z U asmjit::_abi_1_13::CodeHolder::init(asmjit::_abi_1_13::Environment const&, unsigned long) 2025-05-07T20:11:13.7344292Z U asmjit::_abi_1_13::CodeHolder::~CodeHolder() 2025-05-07T20:11:13.7344905Z U asmjit::_abi_1_13::FuncArgsAssignment::updateFuncFrame(asmjit::_abi_1_13::FuncFrame&) const 2025-05-07T20:11:13.7345715Z U asmjit::_abi_1_13::FuncDetail::init(asmjit::_abi_1_13::FuncSignature const&, asmjit::_abi_1_13::Environment const&) 2025-05-07T20:11:13.7346311Z U asmjit::_abi_1_13::FuncFrame::finalize() 2025-05-07T20:11:13.7346763Z U asmjit::_abi_1_13::FuncFrame::init(asmjit::_abi_1_13::FuncDetail const&) 2025-05-07T20:11:13.7347360Z U asmjit::_abi_1_13::JitRuntime::JitRuntime(asmjit::_abi_1_13::JitAllocator::CreateParams const*) 2025-05-07T20:11:13.7347987Z U asmjit::_abi_1_13::JitRuntime::_add(void**, asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:11:13.7348424Z U asmjit::_abi_1_13::JitRuntime::~JitRuntime() 2025-05-07T20:11:13.7348884Z U asmjit::_abi_1_13::x86::Assembler::Assembler(asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:11:13.7349388Z U asmjit::_abi_1_13::x86::Assembler::~Assembler() 2025-05-07T20:11:13.7349748Z U cpuinfo_get_packages 2025-05-07T20:11:13.7350053Z U cpuinfo_get_packages_count 2025-05-07T20:11:13.7350388Z U cpuinfo_initialize 2025-05-07T20:11:13.7350671Z U cpuinfo_isa 2025-05-07T20:11:13.7350963Z U fma@GLIBC_2.2.5 2025-05-07T20:11:13.7351239Z U fmaf@GLIBC_2.2.5 2025-05-07T20:11:13.7351535Z U fminf@GLIBC_2.2.5 2025-05-07T20:11:13.7351809Z U free@GLIBC_2.2.5 2025-05-07T20:11:13.7352104Z U fwrite@GLIBC_2.2.5 2025-05-07T20:11:13.7352379Z U getenv@GLIBC_2.2.5 2025-05-07T20:11:13.7352676Z U log2@GLIBC_2.2.5 2025-05-07T20:11:13.7352966Z U log2f@GLIBC_2.2.5 2025-05-07T20:11:13.7353242Z U lrintf@GLIBC_2.2.5 2025-05-07T20:11:13.7353576Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:13.7353853Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.7354153Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.7354430Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.7354734Z U nearbyint@GLIBC_2.2.5 2025-05-07T20:11:13.7355026Z U nearbyintf@GLIBC_2.2.5 2025-05-07T20:11:13.7355390Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:13.7355784Z U operator delete[](void*)@GLIBCXX_3.4 2025-05-07T20:11:13.7356121Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.7356490Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.7356829Z U posix_memalign@GLIBC_2.2.5 2025-05-07T20:11:13.7357148Z U pow@GLIBC_2.2.5 2025-05-07T20:11:13.7357417Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:11:13.7357824Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:13.7358334Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:13.7358777Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:13.7359434Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:11:13.7360153Z U std::__atomic_futex_unsigned_base::_M_futex_notify_all(unsigned int*)@GLIBCXX_3.4.21 2025-05-07T20:11:13.7361151Z U std::__atomic_futex_unsigned_base::_M_futex_wait_until(unsigned int*, unsigned int, bool, std::chrono::duration >, std::chrono::duration >)@GLIBCXX_3.4.21 2025-05-07T20:11:13.7362257Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:13.7362984Z U std::__detail::_Prime_rehash_policy::_M_next_bkt(unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:13.7363522Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:13.7363956Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:13.7364410Z U std::__exception_ptr::exception_ptr::exception_ptr(void*)@CXXABI_1.3.11 2025-05-07T20:11:13.7364927Z U std::__future_base::_Result_base::_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:11:13.7365378Z U std::__future_base::_Result_base::~_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:11:13.7365794Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:11:13.7366147Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:11:13.7366477Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.7366842Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:13.7367173Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:11:13.7367548Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:13.7367925Z U std::__throw_future_error(int)@GLIBCXX_3.4.14 2025-05-07T20:11:13.7368331Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.7368747Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.7369128Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:13.7369519Z U std::bad_alloc::~bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.7370570Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.7371571Z U std::cerr@GLIBCXX_3.4 2025-05-07T20:11:13.7371924Z U std::cout@GLIBCXX_3.4 2025-05-07T20:11:13.7372390Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:11:13.7372834Z U std::future_category()@GLIBCXX_3.4.15 2025-05-07T20:11:13.7373227Z U std::future_error::~future_error()@GLIBCXX_3.4.14 2025-05-07T20:11:13.7373649Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.7374043Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.7374697Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:13.7375442Z U std::logic_error::logic_error(std::logic_error const&)@GLIBCXX_3.4.21 2025-05-07T20:11:13.7375973Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:11:13.7376502Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.7377081Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.7377558Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:11:13.7377949Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:13.7378312Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:11:13.7378792Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:13.7379343Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:13.7379784Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:13.7380175Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:13.7380498Z U stderr@GLIBC_2.2.5 2025-05-07T20:11:13.7380822Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:13.7381115Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.7381427Z U strstr@GLIBC_2.2.5 2025-05-07T20:11:13.7381750Z U tolower@GLIBC_2.2.5 2025-05-07T20:11:13.7382079Z U toupper@GLIBC_2.2.5 2025-05-07T20:11:13.7382515Z U typeinfo for std::__future_base::_Result_base@GLIBCXX_3.4.15 2025-05-07T20:11:13.7382998Z U typeinfo for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:11:13.7383414Z U typeinfo for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:11:13.7383808Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:13.7384245Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.7384700Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.7385106Z U vtable for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:11:13.7385501Z U vtable for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:11:13.7385867Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.7386236Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.7386571Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.7386907Z w __gmon_start__ 2025-05-07T20:11:13.7387215Z w __pthread_key_create 2025-05-07T20:11:13.7387542Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:13.7387911Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:13.7388235Z w pthread_once 2025-05-07T20:11:13.7388549Z w pthread_rwlock_rdlock 2025-05-07T20:11:13.7388863Z w pthread_rwlock_unlock 2025-05-07T20:11:13.7389193Z w pthread_rwlock_wrlock 2025-05-07T20:11:13.7389505Z w pthread_self@GLIBC_2.2.5 2025-05-07T20:11:13.7390010Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.7390439Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:11:13.7390693Z 2025-05-07T20:11:13.7390829Z linux-vdso.so.1 (0x00007ffcd9df6000) 2025-05-07T20:11:13.7391433Z libc10.so => not found 2025-05-07T20:11:13.7391927Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f9894385000) 2025-05-07T20:11:13.7392491Z libtorch.so => not found 2025-05-07T20:11:13.7392750Z libtorch_cpu.so => not found 2025-05-07T20:11:13.7393048Z libtorch_cuda.so => not found 2025-05-07T20:11:13.7393397Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f9894121000) 2025-05-07T20:11:13.7393953Z libm.so.6 => /lib64/libm.so.6 (0x00007f9894046000) 2025-05-07T20:11:13.7394361Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f9894957000) 2025-05-07T20:11:13.7394743Z libc.so.6 => /lib64/libc.so.6 (0x00007f9893e3e000) 2025-05-07T20:11:13.7395132Z /lib64/ld-linux-x86-64.so.2 (0x00007f989498b000) 2025-05-07T20:11:13.7395467Z libtorch_cpu.so => not found 2025-05-07T20:11:13.7395770Z libtorch_cuda.so => not found 2025-05-07T20:11:13.7396047Z libtorch.so => not found 2025-05-07T20:11:13.7396237Z 2025-05-07T20:11:13.7396352Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.7396760Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so 2025-05-07T20:11:13.7397040Z 2025-05-07T20:11:13.7424983Z 2025-05-07T20:11:13.7425440Z Dynamic section at offset 0x51fb38 contains 38 entries: 2025-05-07T20:11:13.7425897Z Tag Type Name/Value 2025-05-07T20:11:13.7426328Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.7426868Z 0x0000000000000001 (NEEDED) Shared library: [asmjit.so] 2025-05-07T20:11:13.7427397Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.7427914Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.7428713Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.7429251Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.7429790Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:13.7430464Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.7431009Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.7431653Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:13.7432185Z 0x000000000000000e (SONAME) Library soname: [fbgemm.so] 2025-05-07T20:11:13.7432704Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:13.7433148Z 0x000000000000000c (INIT) 0xf6000 2025-05-07T20:11:13.7433499Z 0x000000000000000d (FINI) 0x4c8fb0 2025-05-07T20:11:13.7433883Z 0x0000000000000019 (INIT_ARRAY) 0x51dac0 2025-05-07T20:11:13.7434242Z 0x000000000000001b (INIT_ARRAYSZ) 56 (bytes) 2025-05-07T20:11:13.7434632Z 0x000000000000001a (FINI_ARRAY) 0x51daf8 2025-05-07T20:11:13.7434985Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.7435359Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:13.7435699Z 0x000000006ffffef5 (GNU_HASH) 0x6e20 2025-05-07T20:11:13.7436067Z 0x0000000000000005 (STRTAB) 0x2b0a0 2025-05-07T20:11:13.7436428Z 0x0000000000000006 (SYMTAB) 0xe7e0 2025-05-07T20:11:13.7436807Z 0x000000000000000a (STRSZ) 708057 (bytes) 2025-05-07T20:11:13.7437206Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.7437572Z 0x0000000000000003 (PLTGOT) 0x520dd8 2025-05-07T20:11:13.7437972Z 0x0000000000000002 (PLTRELSZ) 24312 (bytes) 2025-05-07T20:11:13.7438334Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.7438706Z 0x0000000000000017 (JMPREL) 0xef8e0 2025-05-07T20:11:13.7439050Z 0x0000000000000007 (RELA) 0xda610 2025-05-07T20:11:13.7439443Z 0x0000000000000008 (RELASZ) 86736 (bytes) 2025-05-07T20:11:13.7439838Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.7440229Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:13.7440604Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:13.7440979Z 0x000000006ffffffe (VERNEED) 0xda490 2025-05-07T20:11:13.7441360Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:13.7441709Z 0x000000006ffffff0 (VERSYM) 0xd7e7a 2025-05-07T20:11:13.7442083Z 0x000000006ffffff9 (RELACOUNT) 9 2025-05-07T20:11:13.7442435Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.7442650Z 2025-05-07T20:11:13.7442770Z ################################################################################ 2025-05-07T20:11:13.7443006Z 2025-05-07T20:11:13.7443010Z 2025-05-07T20:11:13.7443162Z ################################################################################ 2025-05-07T20:11:13.7443664Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:13.7444184Z [CHECK] Listing out library size: 2025-05-07T20:11:13.7444661Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:13.7445061Z 2025-05-07T20:11:13.7445270Z 3 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:13.7446032Z 2025-05-07T20:11:13.7446791Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:13.7447846Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.7448436Z 2025-05-07T20:11:13.7512067Z GLIBC_2.2.5 2025-05-07T20:11:13.7512808Z GLIBC_2.3 2025-05-07T20:11:13.7513120Z GLIBC_2.14 2025-05-07T20:11:13.7513278Z 2025-05-07T20:11:13.7513283Z 2025-05-07T20:11:13.7514121Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:13.7515318Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.7515936Z 2025-05-07T20:11:13.7575918Z GLIBCXX_3.4 2025-05-07T20:11:13.7576189Z GLIBCXX_3.4.9 2025-05-07T20:11:13.7576413Z GLIBCXX_3.4.14 2025-05-07T20:11:13.7576809Z GLIBCXX_3.4.20 2025-05-07T20:11:13.7577063Z GLIBCXX_3.4.21 2025-05-07T20:11:13.7577281Z GLIBCXX_3.4.29 2025-05-07T20:11:13.7577699Z 2025-05-07T20:11:13.7577971Z 2025-05-07T20:11:13.7602625Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.qGDKvcICtW.symbols.txt 2025-05-07T20:11:13.7603138Z 2025-05-07T20:11:13.7640073Z 2025-05-07T20:11:13.7672897Z [CHECK] Total Number of symbols: 505 2025-05-07T20:11:13.7687837Z [CHECK] Number of fbgemm symbols: 47 2025-05-07T20:11:13.7705383Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.ydZD99QcN0.usymbols.txt 2025-05-07T20:11:13.7706877Z 2025-05-07T20:11:13.7723268Z 2025-05-07T20:11:13.7748321Z [CHECK] Listing out undefined symbols (195 total): 2025-05-07T20:11:13.7764202Z U GOMP_barrier 2025-05-07T20:11:13.7764677Z U GOMP_parallel 2025-05-07T20:11:13.7765284Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.7765871Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.7766242Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.7766677Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.7767104Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.7767492Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:13.7767899Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:13.7768275Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:13.7768674Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.7769216Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:13.7769587Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.7769916Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.7770387Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:13.7770746Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:13.7771101Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:13.7771457Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.7771787Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.7772133Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:13.7772442Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:13.7772787Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.7773110Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:13.7773630Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:11:13.7774243Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:13.7774716Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:13.7775669Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.7776606Z U at::_ops::is_nonzero::call(at::Tensor const&) 2025-05-07T20:11:13.7777054Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:13.7777569Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:13.7778227Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:11:13.7779440Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:13.7780428Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:13.7781234Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.7782086Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:13.7782463Z U at::get_num_threads() 2025-05-07T20:11:13.7782876Z U at::get_thread_num() 2025-05-07T20:11:13.7783189Z U at::in_parallel_region() 2025-05-07T20:11:13.7783484Z U at::init_num_threads() 2025-05-07T20:11:13.7783809Z U at::internal::set_thread_num(int) 2025-05-07T20:11:13.7784161Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:11:13.7784518Z U c10::BoolType::get() 2025-05-07T20:11:13.7784886Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:13.7785507Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:13.7786091Z U c10::Error::what() const 2025-05-07T20:11:13.7786435Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.7786886Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.7787321Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:13.7787657Z U c10::IntType::get() 2025-05-07T20:11:13.7788017Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:13.7788381Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:13.7788839Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:13.7789284Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:13.7789614Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:13.7789971Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:13.7790340Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:13.7790999Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:13.7791607Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:13.7791979Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:13.7792312Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:13.7792652Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:11:13.7792977Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:13.7793312Z U c10::SymIntType::get() 2025-05-07T20:11:13.7793690Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:13.7794046Z U c10::TensorType::get() 2025-05-07T20:11:13.7794397Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:13.7795290Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:13.7796232Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:13.7796664Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:11:13.7797201Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:11:13.7797938Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:11:13.7798504Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:13.7798891Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:13.7799254Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:13.7799587Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:13.7799946Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:13.7800391Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:13.7800830Z U c10::cuda::device_count() 2025-05-07T20:11:13.7801164Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:13.7801513Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:13.7801885Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:13.7802242Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:13.7802636Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:13.7802987Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:13.7803673Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:13.7804490Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:13.7805349Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.7806424Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:13.7807655Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.7808462Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:13.7808815Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:13.7809177Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:13.7809534Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:13.7809916Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:11:13.7810386Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:13.7810826Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:13.7811317Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:13.7811730Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:11:13.7812124Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:11:13.7812480Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:13.7812927Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:13.7813384Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:13.7813790Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:13.7814162Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:13.7814736Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:13.7815118Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:13.7815467Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:13.7815829Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:13.7816179Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:13.7816529Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:13.7816866Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:13.7818725Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:13.7819133Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:13.7819516Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:13.7820532Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7822174Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7824048Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7825595Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7827193Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7829223Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7831013Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7832883Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7834679Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7836497Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7838290Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7840103Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7841273Z U fbgemm::fbgemmAlignedAlloc(unsigned long, unsigned long, bool) 2025-05-07T20:11:13.7841798Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:11:13.7842422Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:13.7842926Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.7843313Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.7843921Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.7844297Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.7844808Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:13.7845237Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.7845629Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.7846008Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.7846291Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.7846586Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.7846862Z U omp_get_max_threads 2025-05-07T20:11:13.7847152Z U omp_get_num_threads 2025-05-07T20:11:13.7847432Z U omp_get_thread_num 2025-05-07T20:11:13.7847782Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:13.7848182Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.7848755Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:13.7849605Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.7850498Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:13.7851098Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:13.7851502Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:13.7851865Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.7852237Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:13.7852623Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.7853070Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.7853502Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:13.7854023Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:13.7854731Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:13.7855771Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.7856947Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.7857692Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:13.7858049Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:13.7858417Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.7858781Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.7859124Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.7859479Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.7859818Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:13.7860167Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:13.7860583Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.7861117Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.7861605Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:13.7862169Z U std::pair fbgemm::radix_sort_parallel(int*, int*, int*, int*, long, long, bool) 2025-05-07T20:11:13.7863154Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:11:13.7864308Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:11:13.7865135Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:13.7865585Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.7865915Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:13.7866738Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:13.7867911Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.7868764Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.7869484Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:13.7870065Z U typeinfo for c10::Error 2025-05-07T20:11:13.7870391Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:13.7870815Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.7871273Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.7871792Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:13.7872240Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.7872587Z U vtable for c10::Error 2025-05-07T20:11:13.7873131Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.7873900Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.7874519Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:13.7875056Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:13.7875494Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.7875852Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.7876171Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.7876501Z w __gmon_start__ 2025-05-07T20:11:13.7876804Z w __pthread_key_create 2025-05-07T20:11:13.7877148Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.7877630Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:13.7877943Z 2025-05-07T20:11:13.7878112Z linux-vdso.so.1 (0x00007fff52d79000) 2025-05-07T20:11:13.7878404Z libc10.so => not found 2025-05-07T20:11:13.7878681Z libc10_cuda.so => not found 2025-05-07T20:11:13.7879216Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f8db3600000) 2025-05-07T20:11:13.7880118Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f8db407f000) 2025-05-07T20:11:13.7880749Z libtorch.so => not found 2025-05-07T20:11:13.7881036Z libtorch_cpu.so => not found 2025-05-07T20:11:13.7881335Z libtorch_cuda.so => not found 2025-05-07T20:11:13.7881597Z libcudart.so.12 => not found 2025-05-07T20:11:13.7881950Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f8db339c000) 2025-05-07T20:11:13.7882399Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f8db404f000) 2025-05-07T20:11:13.7882835Z libc.so.6 => /lib64/libc.so.6 (0x00007f8db3194000) 2025-05-07T20:11:13.7883220Z /lib64/ld-linux-x86-64.so.2 (0x00007f8db408f000) 2025-05-07T20:11:13.7883568Z libc10.so => not found 2025-05-07T20:11:13.7884076Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f8db3fd2000) 2025-05-07T20:11:13.7884664Z libtorch.so => not found 2025-05-07T20:11:13.7884955Z libtorch_cpu.so => not found 2025-05-07T20:11:13.7885226Z libtorch_cuda.so => not found 2025-05-07T20:11:13.7885555Z libm.so.6 => /lib64/libm.so.6 (0x00007f8db30b9000) 2025-05-07T20:11:13.7885878Z libc10.so => not found 2025-05-07T20:11:13.7886128Z libtorch_cpu.so => not found 2025-05-07T20:11:13.7886388Z libtorch_cuda.so => not found 2025-05-07T20:11:13.7886666Z libtorch.so => not found 2025-05-07T20:11:13.7886922Z libtorch_cpu.so => not found 2025-05-07T20:11:13.7887218Z libtorch_cuda.so => not found 2025-05-07T20:11:13.7887483Z libtorch.so => not found 2025-05-07T20:11:13.7887669Z 2025-05-07T20:11:13.7887784Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.7888233Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:13.7888571Z 2025-05-07T20:11:13.7888575Z 2025-05-07T20:11:13.7888731Z Dynamic section at offset 0x2c4138 contains 40 entries: 2025-05-07T20:11:13.7889125Z Tag Type Name/Value 2025-05-07T20:11:13.7889535Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.7890047Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:13.7890816Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:13.7891401Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:11:13.7891985Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.7892500Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.7893050Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.7893577Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:13.7894128Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.7894671Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.7895180Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.7895721Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:13.7896280Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:13.7896830Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:13.7897243Z 0x000000000000000c (INIT) 0x13000 2025-05-07T20:11:13.7897605Z 0x000000000000000d (FINI) 0x7422c 2025-05-07T20:11:13.7897965Z 0x0000000000000019 (INIT_ARRAY) 0x2c4cf8 2025-05-07T20:11:13.7898325Z 0x000000000000001b (INIT_ARRAYSZ) 72 (bytes) 2025-05-07T20:11:13.7898703Z 0x000000000000001a (FINI_ARRAY) 0x2c4d40 2025-05-07T20:11:13.7899054Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.7899419Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:13.7899752Z 0x000000006ffffef5 (GNU_HASH) 0x18b0 2025-05-07T20:11:13.7900109Z 0x0000000000000005 (STRTAB) 0x5790 2025-05-07T20:11:13.7900468Z 0x0000000000000006 (SYMTAB) 0x2820 2025-05-07T20:11:13.7900821Z 0x000000000000000a (STRSZ) 40152 (bytes) 2025-05-07T20:11:13.7901213Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.7901571Z 0x0000000000000003 (PLTGOT) 0x2c53f8 2025-05-07T20:11:13.7901967Z 0x0000000000000002 (PLTRELSZ) 6768 (bytes) 2025-05-07T20:11:13.7916344Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.7916749Z 0x0000000000000017 (JMPREL) 0x10f38 2025-05-07T20:11:13.7917127Z 0x0000000000000007 (RELA) 0xf990 2025-05-07T20:11:13.7917609Z 0x0000000000000008 (RELASZ) 5544 (bytes) 2025-05-07T20:11:13.7917981Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.7918343Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:13.7918681Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:13.7919068Z 0x000000006ffffffe (VERNEED) 0xf860 2025-05-07T20:11:13.7919410Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:13.7919776Z 0x000000006ffffff0 (VERSYM) 0xf468 2025-05-07T20:11:13.7920118Z 0x000000006ffffff9 (RELACOUNT) 17 2025-05-07T20:11:13.7920459Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.7920669Z 2025-05-07T20:11:13.7920821Z ################################################################################ 2025-05-07T20:11:13.7921055Z 2025-05-07T20:11:13.7921059Z 2025-05-07T20:11:13.7921183Z ################################################################################ 2025-05-07T20:11:13.7921732Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:13.7922244Z [CHECK] Listing out library size: 2025-05-07T20:11:13.7922747Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:13.7923135Z 2025-05-07T20:11:13.7923370Z 9 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:13.7923693Z 2025-05-07T20:11:13.7924096Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:13.7925134Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.7925773Z 2025-05-07T20:11:13.7925911Z GLIBC_2.2.5 2025-05-07T20:11:13.7926127Z GLIBC_2.3 2025-05-07T20:11:13.7926363Z GLIBC_2.14 2025-05-07T20:11:13.7926483Z 2025-05-07T20:11:13.7926487Z 2025-05-07T20:11:13.7926905Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:13.7927967Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.7928777Z 2025-05-07T20:11:13.7975993Z GLIBCXX_3.4 2025-05-07T20:11:13.7976641Z GLIBCXX_3.4.9 2025-05-07T20:11:13.7977272Z GLIBCXX_3.4.11 2025-05-07T20:11:13.7977838Z GLIBCXX_3.4.18 2025-05-07T20:11:13.7978427Z GLIBCXX_3.4.21 2025-05-07T20:11:13.7978983Z GLIBCXX_3.4.29 2025-05-07T20:11:13.7979359Z 2025-05-07T20:11:13.7979371Z 2025-05-07T20:11:13.7997347Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.zebDaSnw5k.symbols.txt 2025-05-07T20:11:13.7997887Z 2025-05-07T20:11:13.8023641Z 2025-05-07T20:11:13.8051605Z [CHECK] Total Number of symbols: 342 2025-05-07T20:11:13.8062535Z [CHECK] Number of fbgemm symbols: 14 2025-05-07T20:11:13.8081313Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.iy0nGhOgBy.usymbols.txt 2025-05-07T20:11:13.8082830Z 2025-05-07T20:11:13.8094773Z 2025-05-07T20:11:13.8121182Z [CHECK] Listing out undefined symbols (129 total): 2025-05-07T20:11:13.8136592Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.8139103Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.8140493Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.8140867Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.8141418Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.8141829Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.8142258Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:13.8142696Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:13.8143074Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:13.8143428Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.8143794Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.8144109Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.8144446Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:13.8144747Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:13.8145070Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:13.8145414Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.8145730Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:13.8146074Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:13.8146462Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:13.8146934Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:13.8147377Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:13.8147759Z U c10::BoolType::get() 2025-05-07T20:11:13.8148133Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:13.8148488Z U c10::FloatType::get() 2025-05-07T20:11:13.8148824Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:13.8149208Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.8149644Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:13.8149982Z U c10::IntType::get() 2025-05-07T20:11:13.8150410Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:13.8150830Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:13.8151192Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:13.8151603Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:13.8152240Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:13.8152874Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:13.8153231Z U c10::TensorType::get() 2025-05-07T20:11:13.8153552Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:13.8154463Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:13.8155397Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:13.8155747Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:13.8156103Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:13.8156436Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:13.8156785Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:13.8157138Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:13.8157585Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:13.8158055Z U c10::cuda::device_count() 2025-05-07T20:11:13.8158390Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:13.8158785Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:13.8159153Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:13.8159605Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:13.8160031Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:13.8160456Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:13.8161159Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:13.8162014Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:13.8162856Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.8163740Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:13.8164310Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:13.8164666Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:13.8165001Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:13.8165374Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:13.8165745Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:13.8166101Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:13.8166506Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:13.8166920Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:13.8167299Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:13.8167649Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:13.8168006Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:13.8168338Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:13.8168709Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:13.8169068Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:13.8169424Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:13.8169798Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:13.8170124Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:13.8170763Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:13.8171115Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:13.8171500Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:13.8171868Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:13.8172247Z U float at::Tensor::item() const 2025-05-07T20:11:13.8172659Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.8173081Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.8173518Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.8173886Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.8174200Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.8174496Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.8174863Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:13.8175268Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.8175845Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:13.8176709Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.8177553Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:13.8178491Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:13.8179148Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.8179508Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:13.8179910Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.8180332Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.8180709Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:13.8181203Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:13.8181887Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:13.8182941Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.8184139Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.8184882Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.8185260Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.8185633Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.8185981Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.8186337Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:13.8186678Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:13.8187096Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.8187627Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.8188152Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:13.8188516Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:13.8188828Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.8189156Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:13.8190070Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:13.8191211Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.8192037Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.8192754Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:13.8193381Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.8193821Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:13.8194237Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.8194854Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.8195708Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.8196447Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.8197062Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:13.8197585Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:13.8198028Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.8198354Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.8198686Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.8198989Z w __gmon_start__ 2025-05-07T20:11:13.8199244Z w __pthread_key_create 2025-05-07T20:11:13.8199546Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:13.8199853Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:13.8200223Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.8200660Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:13.8200987Z 2025-05-07T20:11:13.8201113Z linux-vdso.so.1 (0x00007fff6a37a000) 2025-05-07T20:11:13.8201417Z libtorch.so => not found 2025-05-07T20:11:13.8201652Z libc10.so => not found 2025-05-07T20:11:13.8201908Z libc10_cuda.so => not found 2025-05-07T20:11:13.8202160Z libtorch_cpu.so => not found 2025-05-07T20:11:13.8202435Z libtorch_cuda.so => not found 2025-05-07T20:11:13.8202694Z libcudart.so.12 => not found 2025-05-07T20:11:13.8203028Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fa1d4d9c000) 2025-05-07T20:11:13.8203430Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fa1d5a63000) 2025-05-07T20:11:13.8203816Z libc.so.6 => /lib64/libc.so.6 (0x00007fa1d4b94000) 2025-05-07T20:11:13.8204179Z /lib64/ld-linux-x86-64.so.2 (0x00007fa1d5a97000) 2025-05-07T20:11:13.8204525Z libm.so.6 => /lib64/libm.so.6 (0x00007fa1d5988000) 2025-05-07T20:11:13.8204742Z 2025-05-07T20:11:13.8204868Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.8205287Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:13.8205639Z 2025-05-07T20:11:13.8208444Z 2025-05-07T20:11:13.8208609Z Dynamic section at offset 0x8a8558 contains 37 entries: 2025-05-07T20:11:13.8209253Z Tag Type Name/Value 2025-05-07T20:11:13.8209703Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.8210385Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.8210906Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:13.8211455Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.8212059Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.8212591Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:13.8213133Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.8213646Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.8214172Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.8214688Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:13.8215281Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:11:13.8215772Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:11:13.8216112Z 0x000000000000000d (FINI) 0x3464c 2025-05-07T20:11:13.8216472Z 0x0000000000000019 (INIT_ARRAY) 0x8a82d8 2025-05-07T20:11:13.8216821Z 0x000000000000001b (INIT_ARRAYSZ) 48 (bytes) 2025-05-07T20:11:13.8217196Z 0x000000000000001a (FINI_ARRAY) 0x8a8308 2025-05-07T20:11:13.8217542Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.8217901Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:13.8218241Z 0x000000006ffffef5 (GNU_HASH) 0xf58 2025-05-07T20:11:13.8218572Z 0x0000000000000005 (STRTAB) 0x3a30 2025-05-07T20:11:13.8218922Z 0x0000000000000006 (SYMTAB) 0x1a08 2025-05-07T20:11:13.8219274Z 0x000000000000000a (STRSZ) 36563 (bytes) 2025-05-07T20:11:13.8219695Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.8220049Z 0x0000000000000003 (PLTGOT) 0x8a87f8 2025-05-07T20:11:13.8220465Z 0x0000000000000002 (PLTRELSZ) 3600 (bytes) 2025-05-07T20:11:13.8220841Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.8221190Z 0x0000000000000017 (JMPREL) 0xe920 2025-05-07T20:11:13.8221543Z 0x0000000000000007 (RELA) 0xcce8 2025-05-07T20:11:13.8221888Z 0x0000000000000008 (RELASZ) 7224 (bytes) 2025-05-07T20:11:13.8222266Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.8222599Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:13.8222949Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:13.8223302Z 0x000000006ffffffe (VERNEED) 0xcbb8 2025-05-07T20:11:13.8223656Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:13.8223985Z 0x000000006ffffff0 (VERSYM) 0xc904 2025-05-07T20:11:13.8224340Z 0x000000006ffffff9 (RELACOUNT) 90 2025-05-07T20:11:13.8224674Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.8224885Z 2025-05-07T20:11:13.8225005Z ################################################################################ 2025-05-07T20:11:13.8225256Z 2025-05-07T20:11:13.8225260Z 2025-05-07T20:11:13.8225378Z ################################################################################ 2025-05-07T20:11:13.8225874Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:13.8226386Z [CHECK] Listing out library size: 2025-05-07T20:11:13.8226863Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:13.8227230Z 2025-05-07T20:11:13.8227438Z 21 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:13.8227767Z 2025-05-07T20:11:13.8228149Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:13.8229356Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.8229981Z 2025-05-07T20:11:13.8295242Z GLIBC_2.2.5 2025-05-07T20:11:13.8295519Z GLIBC_2.14 2025-05-07T20:11:13.8296140Z 2025-05-07T20:11:13.8296375Z 2025-05-07T20:11:13.8296820Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:13.8297864Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.8298463Z 2025-05-07T20:11:13.8373284Z GLIBCXX_3.4 2025-05-07T20:11:13.8373930Z GLIBCXX_3.4.9 2025-05-07T20:11:13.8374543Z GLIBCXX_3.4.11 2025-05-07T20:11:13.8375112Z GLIBCXX_3.4.20 2025-05-07T20:11:13.8375709Z GLIBCXX_3.4.21 2025-05-07T20:11:13.8376264Z GLIBCXX_3.4.29 2025-05-07T20:11:13.8376670Z 2025-05-07T20:11:13.8376683Z 2025-05-07T20:11:13.8395527Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.AETeB1nRaT.symbols.txt 2025-05-07T20:11:13.8396984Z 2025-05-07T20:11:13.8437629Z 2025-05-07T20:11:13.8461792Z [CHECK] Total Number of symbols: 811 2025-05-07T20:11:13.8478373Z [CHECK] Number of fbgemm symbols: 80 2025-05-07T20:11:13.8493496Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.zuB9DoG8KD.usymbols.txt 2025-05-07T20:11:13.8494903Z 2025-05-07T20:11:13.8513298Z 2025-05-07T20:11:13.8536144Z [CHECK] Listing out undefined symbols (152 total): 2025-05-07T20:11:13.8554136Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.8555890Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.8556895Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.8557558Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.8558099Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.8558562Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:13.8559003Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:13.8559369Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:13.8559756Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.8560111Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.8560564Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.8560854Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:13.8561169Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:13.8561578Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.8561887Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.8562194Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:13.8562490Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.8562818Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:13.8563206Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:13.8563906Z U at::_ops::arange::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.8564965Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.8566209Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.8567082Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:13.8567978Z U at::_ops::full_like::call(at::Tensor const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.8568876Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:11:13.8569531Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:11:13.8570504Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.8571912Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.8572788Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:13.8573203Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:11:13.8573756Z U c10::BoolType::get() 2025-05-07T20:11:13.8574131Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:13.8574534Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:13.8574953Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.8575384Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:13.8575751Z U c10::IntType::get() 2025-05-07T20:11:13.8576170Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:13.8576692Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:13.8577131Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:13.8577829Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:13.8578619Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:13.8578989Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:13.8579400Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:13.8579800Z U c10::TensorType::get() 2025-05-07T20:11:13.8580141Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:13.8580887Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:13.8581030Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:13.8581156Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:13.8581312Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:13.8581439Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:13.8581561Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:13.8581697Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:13.8581951Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:13.8582063Z U c10::cuda::current_device() 2025-05-07T20:11:13.8582169Z U c10::cuda::device_count() 2025-05-07T20:11:13.8582331Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:13.8582474Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:13.8582619Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:13.8582782Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:13.8583081Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:13.8583195Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:13.8583691Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:13.8583931Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:13.8584412Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.8584732Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:13.8584848Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:13.8584976Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:13.8585123Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:13.8585285Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:13.8585405Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:13.8585550Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:13.8585681Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:13.8585792Z U c10::throwNullDataPtrError() 2025-05-07T20:11:13.8585908Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:13.8586018Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:13.8586203Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:13.8586333Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:13.8586462Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:13.8586610Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:13.8586763Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:13.8586895Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:13.8587040Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:13.8587153Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:13.8587280Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:13.8587400Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:13.8587524Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:13.8587648Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:13.8587766Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:13.8587884Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:13.8588018Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:13.8588132Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:13.8588248Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:11:13.8588357Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:11:13.8588656Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:13.8588774Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:13.8588884Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:13.8589013Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:13.8589135Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:13.8589253Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:13.8589426Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.8589551Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.8589693Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.8589813Z U log2f@GLIBC_2.2.5 2025-05-07T20:11:13.8590004Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:13.8590138Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.8590287Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.8590406Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.8590503Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.8590595Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.8590756Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:13.8590879Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.8591201Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:13.8591590Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.8591912Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:13.8592034Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.8592364Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.8592510Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.8592684Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:13.8592835Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:13.8593073Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:13.8593410Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:13.8594014Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.8595792Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.8595930Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.8596074Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.8596192Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.8596311Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:13.8596447Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:13.8596639Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.8596881Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.8597014Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:13.8597108Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.8597239Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:13.8597859Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:13.8598327Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.8598586Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.8598973Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:13.8599189Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.8599386Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.8599573Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:13.8599738Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.8600086Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.8600429Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.8600636Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:13.8600862Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:13.8600994Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.8601106Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.8601218Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.8601325Z w __gmon_start__ 2025-05-07T20:11:13.8601426Z w __pthread_key_create 2025-05-07T20:11:13.8601543Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:13.8601677Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:13.8601827Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.8602033Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:13.8602040Z 2025-05-07T20:11:13.8602186Z linux-vdso.so.1 (0x00007ffd3d7ce000) 2025-05-07T20:11:13.8602284Z libtorch.so => not found 2025-05-07T20:11:13.8602380Z libc10.so => not found 2025-05-07T20:11:13.8602481Z libc10_cuda.so => not found 2025-05-07T20:11:13.8602602Z libtorch_cpu.so => not found 2025-05-07T20:11:13.8602700Z libtorch_cuda.so => not found 2025-05-07T20:11:13.8602798Z libcudart.so.12 => not found 2025-05-07T20:11:13.8603014Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f624979c000) 2025-05-07T20:11:13.8603148Z libm.so.6 => /lib64/libm.so.6 (0x00007f62496c1000) 2025-05-07T20:11:13.8603327Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f6249693000) 2025-05-07T20:11:13.8603478Z libc.so.6 => /lib64/libc.so.6 (0x00007f624948b000) 2025-05-07T20:11:13.8603623Z /lib64/ld-linux-x86-64.so.2 (0x00007f624b10b000) 2025-05-07T20:11:13.8603628Z 2025-05-07T20:11:13.8603737Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.8603969Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:13.8603974Z 2025-05-07T20:11:13.8630537Z 2025-05-07T20:11:13.8631630Z Dynamic section at offset 0x14c3b48 contains 37 entries: 2025-05-07T20:11:13.8631992Z Tag Type Name/Value 2025-05-07T20:11:13.8632659Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.8633219Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.8633823Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:13.8634419Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.8635080Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.8635284Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:13.8635481Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.8635687Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:13.8635884Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.8636072Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.8636316Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:11:13.8636432Z 0x000000000000000c (INIT) 0x2a000 2025-05-07T20:11:13.8637379Z 0x000000000000000d (FINI) 0xe445c 2025-05-07T20:11:13.8637535Z 0x0000000000000019 (INIT_ARRAY) 0x14c31b0 2025-05-07T20:11:13.8637668Z 0x000000000000001b (INIT_ARRAYSZ) 208 (bytes) 2025-05-07T20:11:13.8637795Z 0x000000000000001a (FINI_ARRAY) 0x14c3280 2025-05-07T20:11:13.8637917Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.8638049Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:13.8638168Z 0x000000006ffffef5 (GNU_HASH) 0x1eb8 2025-05-07T20:11:13.8638282Z 0x0000000000000005 (STRTAB) 0x8730 2025-05-07T20:11:13.8638411Z 0x0000000000000006 (SYMTAB) 0x3b10 2025-05-07T20:11:13.8638550Z 0x000000000000000a (STRSZ) 113475 (bytes) 2025-05-07T20:11:13.8638674Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.8638797Z 0x0000000000000003 (PLTGOT) 0x14c3de8 2025-05-07T20:11:13.8638950Z 0x0000000000000002 (PLTRELSZ) 8736 (bytes) 2025-05-07T20:11:13.8639067Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.8639182Z 0x0000000000000017 (JMPREL) 0x27c90 2025-05-07T20:11:13.8639310Z 0x0000000000000007 (RELA) 0x249f0 2025-05-07T20:11:13.8639445Z 0x0000000000000008 (RELASZ) 12960 (bytes) 2025-05-07T20:11:13.8639569Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.8639693Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:13.8639823Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:13.8639942Z 0x000000006ffffffe (VERNEED) 0x248d0 2025-05-07T20:11:13.8640063Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:13.8640170Z 0x000000006ffffff0 (VERSYM) 0x24274 2025-05-07T20:11:13.8640271Z 0x000000006ffffff9 (RELACOUNT) 39 2025-05-07T20:11:13.8640371Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.8640377Z 2025-05-07T20:11:13.8640487Z ################################################################################ 2025-05-07T20:11:13.8640546Z 2025-05-07T20:11:13.8640551Z 2025-05-07T20:11:13.8640655Z ################################################################################ 2025-05-07T20:11:13.8641000Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:13.8641100Z [CHECK] Listing out library size: 2025-05-07T20:11:13.8641354Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:13.8641358Z 2025-05-07T20:11:13.8642334Z 17 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:13.8643435Z 2025-05-07T20:11:13.8644376Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:13.8644869Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.8644887Z 2025-05-07T20:11:13.8705181Z GLIBC_2.2.5 2025-05-07T20:11:13.8705987Z GLIBC_2.14 2025-05-07T20:11:13.8706036Z 2025-05-07T20:11:13.8706058Z 2025-05-07T20:11:13.8707283Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:13.8708784Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.8708799Z 2025-05-07T20:11:13.8766953Z GLIBCXX_3.4 2025-05-07T20:11:13.8767382Z GLIBCXX_3.4.9 2025-05-07T20:11:13.8767636Z GLIBCXX_3.4.20 2025-05-07T20:11:13.8767864Z GLIBCXX_3.4.21 2025-05-07T20:11:13.8768075Z GLIBCXX_3.4.29 2025-05-07T20:11:13.8768091Z 2025-05-07T20:11:13.8768104Z 2025-05-07T20:11:13.8788961Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.mKoXZO3aax.symbols.txt 2025-05-07T20:11:13.8789003Z 2025-05-07T20:11:13.8813215Z 2025-05-07T20:11:13.8838178Z [CHECK] Total Number of symbols: 469 2025-05-07T20:11:13.8851268Z [CHECK] Number of fbgemm symbols: 12 2025-05-07T20:11:13.8870699Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.dsJHv1ZFKp.usymbols.txt 2025-05-07T20:11:13.8870760Z 2025-05-07T20:11:13.8887791Z 2025-05-07T20:11:13.8913818Z [CHECK] Listing out undefined symbols (155 total): 2025-05-07T20:11:13.8928032Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.8928304Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.8928659Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.8928816Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.8928972Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.8929116Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:13.8929263Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:13.8929424Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:13.8929559Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.8929684Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:13.8929788Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.8929888Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.8929990Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:13.8930086Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:13.8930285Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.8930385Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.8930481Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:13.8930571Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:13.8930674Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.8931089Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:13.8931249Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:13.8931920Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.8932567Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.8932726Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:13.8932903Z U at::_ops::mul_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:13.8933072Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:13.8933289Z U at::_ops::sub__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:13.8933408Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:13.8933981Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.8934510Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.8934603Z U c10::BoolType::get() 2025-05-07T20:11:13.8934747Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:13.8934877Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:13.8934970Z U c10::IntType::get() 2025-05-07T20:11:13.8935153Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:13.8935265Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:13.8935480Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:13.8935620Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:13.8935749Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:13.8936158Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:13.8936279Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:13.8936383Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:13.8936489Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:13.8936577Z U c10::SymIntType::get() 2025-05-07T20:11:13.8936717Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:13.8936870Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:13.8936961Z U c10::TensorType::get() 2025-05-07T20:11:13.8937076Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:13.8937745Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:13.8937867Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:13.8937975Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:13.8938100Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:13.8938206Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:13.8938311Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:13.8938422Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:13.8938683Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:13.8938805Z U c10::cuda::current_device() 2025-05-07T20:11:13.8938937Z U c10::cuda::device_count() 2025-05-07T20:11:13.8939067Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:13.8939191Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:13.8939332Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:13.8939456Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:13.8939598Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:13.8939702Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:13.8940175Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:13.8940412Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:13.8940874Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.8941187Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:13.8941728Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.8941854Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:13.8941956Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:13.8942091Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:13.8942290Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:13.8942397Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:13.8942531Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:13.8942677Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:13.8942783Z U c10::throwNullDataPtrError() 2025-05-07T20:11:13.8942875Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:13.8942994Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:13.8943171Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:13.8943279Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:13.8943413Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:13.8943531Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:13.8943660Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:13.8943764Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:13.8943885Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:13.8943989Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:13.8944091Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:13.8944216Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:13.8944326Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:13.8944424Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:13.8944546Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:13.8944677Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:13.8944792Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:13.8944896Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:13.8945015Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:13.8945144Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:11:13.8945263Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:11:13.8945610Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:13.8945722Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:13.8945824Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:13.8945923Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:13.8946055Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:13.8946165Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:13.8946280Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.8946422Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.8946513Z U log2@GLIBC_2.2.5 2025-05-07T20:11:13.8946678Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:13.8946812Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.8946954Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.8947047Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.8947147Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.8947238Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.8947377Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:13.8947495Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.8947821Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:13.8948181Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.8948531Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:13.8948638Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.8948762Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.8948906Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.8949061Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:13.8949275Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:13.8949599Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:13.8950125Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.8950244Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:13.8950377Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.8950493Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.8950598Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.8950728Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.8950825Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:13.8950927Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:13.8951107Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.8951324Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.8951441Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:13.8951543Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:13.8951664Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.8951780Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:13.8952361Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:13.8952799Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.8953041Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.8953390Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:13.8953504Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:13.8953649Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.8953806Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:13.8953951Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.8954276Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.8954593Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.8954776Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:13.8954988Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:13.8955104Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.8955204Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.8955318Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.8955402Z w __gmon_start__ 2025-05-07T20:11:13.8955508Z w __pthread_key_create 2025-05-07T20:11:13.8955643Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.8955825Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:13.8955832Z 2025-05-07T20:11:13.8969191Z linux-vdso.so.1 (0x00007ffe20739000) 2025-05-07T20:11:13.8969342Z libtorch.so => not found 2025-05-07T20:11:13.8969454Z libc10.so => not found 2025-05-07T20:11:13.8969586Z libc10_cuda.so => not found 2025-05-07T20:11:13.8969724Z libtorch_cpu.so => not found 2025-05-07T20:11:13.8969819Z libtorch_cuda.so => not found 2025-05-07T20:11:13.8969928Z libcudart.so.12 => not found 2025-05-07T20:11:13.8970271Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f9d4f19c000) 2025-05-07T20:11:13.8970425Z libm.so.6 => /lib64/libm.so.6 (0x00007f9d4f0c1000) 2025-05-07T20:11:13.8970589Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f9d505af000) 2025-05-07T20:11:13.8970729Z libc.so.6 => /lib64/libc.so.6 (0x00007f9d4eeb9000) 2025-05-07T20:11:13.8970867Z /lib64/ld-linux-x86-64.so.2 (0x00007f9d505e3000) 2025-05-07T20:11:13.8970885Z 2025-05-07T20:11:13.8971015Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.8971319Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:13.8971325Z 2025-05-07T20:11:13.9001888Z 2025-05-07T20:11:13.9002615Z Dynamic section at offset 0x106d2d0 contains 37 entries: 2025-05-07T20:11:13.9003015Z Tag Type Name/Value 2025-05-07T20:11:13.9003677Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.9004238Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.9004848Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:13.9005442Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.9006242Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.9006872Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:13.9007604Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.9008158Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:13.9008751Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.9009301Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.9009521Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:13.9009646Z 0x000000000000000c (INIT) 0x12000 2025-05-07T20:11:13.9009791Z 0x000000000000000d (FINI) 0xa2d3c 2025-05-07T20:11:13.9009922Z 0x0000000000000019 (INIT_ARRAY) 0x106de30 2025-05-07T20:11:13.9010055Z 0x000000000000001b (INIT_ARRAYSZ) 96 (bytes) 2025-05-07T20:11:13.9010317Z 0x000000000000001a (FINI_ARRAY) 0x106de90 2025-05-07T20:11:13.9010451Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.9010571Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:13.9010724Z 0x000000006ffffef5 (GNU_HASH) 0x1640 2025-05-07T20:11:13.9010842Z 0x0000000000000005 (STRTAB) 0x51f0 2025-05-07T20:11:13.9010958Z 0x0000000000000006 (SYMTAB) 0x25e0 2025-05-07T20:11:13.9011099Z 0x000000000000000a (STRSZ) 38760 (bytes) 2025-05-07T20:11:13.9011252Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.9011424Z 0x0000000000000003 (PLTGOT) 0x106e570 2025-05-07T20:11:13.9011563Z 0x0000000000000002 (PLTRELSZ) 5376 (bytes) 2025-05-07T20:11:13.9011703Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.9011819Z 0x0000000000000017 (JMPREL) 0x10600 2025-05-07T20:11:13.9011930Z 0x0000000000000007 (RELA) 0xee18 2025-05-07T20:11:13.9012101Z 0x0000000000000008 (RELASZ) 6120 (bytes) 2025-05-07T20:11:13.9012253Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.9012362Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:13.9012500Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:13.9012646Z 0x000000006ffffffe (VERNEED) 0xed08 2025-05-07T20:11:13.9012764Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:13.9012887Z 0x000000006ffffff0 (VERSYM) 0xe958 2025-05-07T20:11:13.9013029Z 0x000000006ffffff9 (RELACOUNT) 26 2025-05-07T20:11:13.9013135Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.9013140Z 2025-05-07T20:11:13.9013262Z ################################################################################ 2025-05-07T20:11:13.9013267Z 2025-05-07T20:11:13.9013271Z 2025-05-07T20:11:13.9013408Z ################################################################################ 2025-05-07T20:11:13.9013726Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:13.9013840Z [CHECK] Listing out library size: 2025-05-07T20:11:13.9014160Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:13.9014164Z 2025-05-07T20:11:13.9014749Z 2 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:13.9015794Z 2025-05-07T20:11:13.9016629Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:13.9017182Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.9017188Z 2025-05-07T20:11:13.9070668Z GLIBC_2.2.5 2025-05-07T20:11:13.9071056Z GLIBC_2.3 2025-05-07T20:11:13.9071339Z GLIBC_2.14 2025-05-07T20:11:13.9071378Z 2025-05-07T20:11:13.9071391Z 2025-05-07T20:11:13.9072978Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:13.9074829Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.9074849Z 2025-05-07T20:11:13.9127260Z GLIBCXX_3.4 2025-05-07T20:11:13.9128074Z GLIBCXX_3.4.9 2025-05-07T20:11:13.9128273Z GLIBCXX_3.4.21 2025-05-07T20:11:13.9128361Z GLIBCXX_3.4.29 2025-05-07T20:11:13.9128369Z 2025-05-07T20:11:13.9128376Z 2025-05-07T20:11:13.9148663Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.NffQrJjiZ0.symbols.txt 2025-05-07T20:11:13.9148883Z 2025-05-07T20:11:13.9169519Z 2025-05-07T20:11:13.9195770Z [CHECK] Total Number of symbols: 326 2025-05-07T20:11:13.9208059Z [CHECK] Number of fbgemm symbols: 56 2025-05-07T20:11:13.9227469Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.WyIo1AhR3T.usymbols.txt 2025-05-07T20:11:13.9227573Z 2025-05-07T20:11:13.9240342Z 2025-05-07T20:11:13.9266888Z [CHECK] Listing out undefined symbols (143 total): 2025-05-07T20:11:13.9282479Z U GOMP_parallel 2025-05-07T20:11:13.9282952Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.9283069Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.9283246Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.9283425Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.9283591Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.9283740Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:13.9283874Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:13.9284160Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:13.9284306Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.9284430Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.9284535Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.9284642Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:13.9284756Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:13.9284861Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.9284963Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.9285075Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.9285183Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:13.9285377Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:11:13.9285986Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.9286637Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.9286816Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:13.9286939Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:13.9287415Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.9287515Z U at::get_num_threads() 2025-05-07T20:11:13.9287623Z U at::get_thread_num() 2025-05-07T20:11:13.9287725Z U at::in_parallel_region() 2025-05-07T20:11:13.9287824Z U at::init_num_threads() 2025-05-07T20:11:13.9287952Z U at::internal::set_thread_num(int) 2025-05-07T20:11:13.9288668Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.9289013Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:11:13.9289208Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.9289372Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:13.9289524Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.9289682Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:13.9289818Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:13.9289942Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:13.9290120Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:13.9290375Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:13.9290537Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:13.9290704Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:13.9290807Z U c10::TensorType::get() 2025-05-07T20:11:13.9290932Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:13.9291661Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:13.9291795Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:13.9291912Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:13.9292086Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:13.9292202Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:13.9292323Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:13.9292436Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:13.9292695Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:13.9292796Z U c10::cuda::device_count() 2025-05-07T20:11:13.9292934Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:13.9293079Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:13.9293220Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:13.9293359Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:13.9293533Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:13.9293646Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:13.9294161Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:13.9294433Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:13.9294924Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.9295264Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:13.9295395Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:13.9295503Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:13.9295652Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:13.9295869Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:13.9295989Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:13.9296163Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:13.9296344Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:13.9296458Z U c10::throwNullDataPtrError() 2025-05-07T20:11:13.9296561Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:13.9296685Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:13.9296878Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:13.9296994Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:13.9297160Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:13.9297286Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:13.9297419Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:13.9297550Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:13.9297674Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:13.9297787Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:13.9297918Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:13.9298041Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:13.9298166Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:13.9298287Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:13.9298405Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:13.9298523Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:13.9298642Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:13.9298778Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:13.9299074Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:13.9299233Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:13.9299355Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:13.9299470Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:13.9299600Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:13.9299734Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:13.9299876Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.9300003Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.9300180Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:13.9300325Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.9300418Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.9300512Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.9300617Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.9300710Z U omp_get_num_threads 2025-05-07T20:11:13.9300802Z U omp_get_thread_num 2025-05-07T20:11:13.9300967Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:13.9301090Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.9301435Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:13.9301836Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.9302170Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:13.9302311Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:13.9302465Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:13.9302581Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.9302854Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.9303024Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.9303280Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:13.9303618Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:13.9304211Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.9304335Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:13.9304453Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.9304591Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.9304707Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.9304991Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.9305127Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:13.9305242Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:13.9305430Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.9305672Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:13.9305771Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.9305897Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:13.9306503Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:13.9306996Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.9307262Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.9307642Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:13.9307797Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.9307961Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:13.9308140Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.9308490Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.9308820Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.9309046Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:13.9309276Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:13.9309393Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.9309522Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.9309627Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.9309717Z w __gmon_start__ 2025-05-07T20:11:13.9309833Z w __pthread_key_create 2025-05-07T20:11:13.9309985Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.9310228Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:13.9310235Z 2025-05-07T20:11:13.9323826Z linux-vdso.so.1 (0x00007ffc02f99000) 2025-05-07T20:11:13.9323995Z libc10.so => not found 2025-05-07T20:11:13.9324122Z libc10_cuda.so => not found 2025-05-07T20:11:13.9324323Z libtorch.so => not found 2025-05-07T20:11:13.9324487Z libtorch_cpu.so => not found 2025-05-07T20:11:13.9324655Z libtorch_cuda.so => not found 2025-05-07T20:11:13.9324853Z libcudart.so.12 => not found 2025-05-07T20:11:13.9326411Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f4c073e7000) 2025-05-07T20:11:13.9326586Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f4c073b9000) 2025-05-07T20:11:13.9326717Z libc.so.6 => /lib64/libc.so.6 (0x00007f4c071b1000) 2025-05-07T20:11:13.9326865Z /lib64/ld-linux-x86-64.so.2 (0x00007f4c07801000) 2025-05-07T20:11:13.9326993Z libm.so.6 => /lib64/libm.so.6 (0x00007f4c070d6000) 2025-05-07T20:11:13.9327000Z 2025-05-07T20:11:13.9327108Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.9327400Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:13.9327406Z 2025-05-07T20:11:13.9357828Z 2025-05-07T20:11:13.9358121Z Dynamic section at offset 0x179670 contains 38 entries: 2025-05-07T20:11:13.9358255Z Tag Type Name/Value 2025-05-07T20:11:13.9358524Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.9360083Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:13.9360681Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.9361278Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.9361880Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.9362454Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:13.9363022Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.9363590Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.9364127Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.9365150Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:13.9365424Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:13.9365600Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:13.9365709Z 0x000000000000000c (INIT) 0xc000 2025-05-07T20:11:13.9365832Z 0x000000000000000d (FINI) 0x237dc 2025-05-07T20:11:13.9365940Z 0x0000000000000019 (INIT_ARRAY) 0x1792c0 2025-05-07T20:11:13.9366052Z 0x000000000000001b (INIT_ARRAYSZ) 32 (bytes) 2025-05-07T20:11:13.9366184Z 0x000000000000001a (FINI_ARRAY) 0x1792e0 2025-05-07T20:11:13.9366301Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.9366406Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:13.9366515Z 0x000000006ffffef5 (GNU_HASH) 0x10f8 2025-05-07T20:11:13.9366633Z 0x0000000000000005 (STRTAB) 0x38a8 2025-05-07T20:11:13.9366733Z 0x0000000000000006 (SYMTAB) 0x1a00 2025-05-07T20:11:13.9366862Z 0x000000000000000a (STRSZ) 24404 (bytes) 2025-05-07T20:11:13.9366987Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.9367103Z 0x0000000000000003 (PLTGOT) 0x179910 2025-05-07T20:11:13.9367226Z 0x0000000000000002 (PLTRELSZ) 3864 (bytes) 2025-05-07T20:11:13.9367347Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.9367453Z 0x0000000000000017 (JMPREL) 0xaba8 2025-05-07T20:11:13.9367550Z 0x0000000000000007 (RELA) 0x9ba0 2025-05-07T20:11:13.9367668Z 0x0000000000000008 (RELASZ) 4104 (bytes) 2025-05-07T20:11:13.9367784Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.9367876Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:13.9367993Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:13.9368115Z 0x000000006ffffffe (VERNEED) 0x9a90 2025-05-07T20:11:13.9368294Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:13.9368402Z 0x000000006ffffff0 (VERSYM) 0x97fc 2025-05-07T20:11:13.9368558Z 0x000000006ffffff9 (RELACOUNT) 7 2025-05-07T20:11:13.9368716Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.9368722Z 2025-05-07T20:11:13.9368826Z ################################################################################ 2025-05-07T20:11:13.9368831Z 2025-05-07T20:11:13.9368836Z 2025-05-07T20:11:13.9368938Z ################################################################################ 2025-05-07T20:11:13.9369269Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:13.9369365Z [CHECK] Listing out library size: 2025-05-07T20:11:13.9369674Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:13.9369678Z 2025-05-07T20:11:13.9372143Z 8 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:13.9372152Z 2025-05-07T20:11:13.9372607Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:13.9373163Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.9373168Z 2025-05-07T20:11:13.9810338Z GLIBC_2.2.5 2025-05-07T20:11:13.9810626Z GLIBC_2.3 2025-05-07T20:11:13.9810862Z GLIBC_2.14 2025-05-07T20:11:13.9810895Z 2025-05-07T20:11:13.9811119Z 2025-05-07T20:11:13.9812777Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:13.9814480Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.9814799Z 2025-05-07T20:11:14.0235769Z GLIBCXX_3.4 2025-05-07T20:11:14.0236611Z GLIBCXX_3.4.9 2025-05-07T20:11:14.0236904Z GLIBCXX_3.4.11 2025-05-07T20:11:14.0237163Z GLIBCXX_3.4.15 2025-05-07T20:11:14.0237424Z GLIBCXX_3.4.18 2025-05-07T20:11:14.0237663Z GLIBCXX_3.4.20 2025-05-07T20:11:14.0237877Z GLIBCXX_3.4.21 2025-05-07T20:11:14.0238101Z GLIBCXX_3.4.29 2025-05-07T20:11:14.0238118Z 2025-05-07T20:11:14.0238130Z 2025-05-07T20:11:14.0254167Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.dV6Aylrsru.symbols.txt 2025-05-07T20:11:14.0254207Z 2025-05-07T20:11:14.0636985Z 2025-05-07T20:11:14.0661426Z [CHECK] Total Number of symbols: 4265 2025-05-07T20:11:14.0690876Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:11:14.0708314Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.9iDeVgjARJ.usymbols.txt 2025-05-07T20:11:14.0709948Z 2025-05-07T20:11:14.0736223Z 2025-05-07T20:11:14.0763920Z [CHECK] Listing out undefined symbols (190 total): 2025-05-07T20:11:14.0776270Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.0776936Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:14.0777291Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:14.0777637Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:14.0777966Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:14.0778272Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:14.0778596Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:14.0778917Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:14.0779242Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:14.0779562Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:14.0779885Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:14.0780210Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:14.0780705Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:14.0781101Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:14.0781518Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:14.0781951Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:14.0782286Z U at::RecordFunction::end() 2025-05-07T20:11:14.0782645Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:14.0783041Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:14.0783700Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:14.0784477Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:14.0785142Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:11:14.0785733Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:11:14.0786636Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.0787498Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:14.0787923Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.0788300Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:14.0788659Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:14.0789027Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:14.0789331Z U c10::AnyType::get() 2025-05-07T20:11:14.0789645Z U c10::BoolType::get() 2025-05-07T20:11:14.0789969Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:14.0790410Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:14.0790808Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:14.0791499Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:14.0792662Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:14.0793697Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:14.0794233Z U c10::Error::what() const 2025-05-07T20:11:14.0794531Z U c10::FloatType::get() 2025-05-07T20:11:14.0794824Z U c10::GradMode::is_enabled() 2025-05-07T20:11:14.0795125Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:14.0795500Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:14.0795856Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:14.0796180Z U c10::IValue::isBoolList() const 2025-05-07T20:11:14.0796481Z U c10::IValue::isDoubleList() const 2025-05-07T20:11:14.0796795Z U c10::IValue::isIntList() const 2025-05-07T20:11:14.0797098Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:14.0797402Z U c10::IValue::isTensorList() const 2025-05-07T20:11:14.0797741Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:14.0798058Z U c10::IntType::get() 2025-05-07T20:11:14.0798715Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.0799437Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:14.0800902Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:14.0801243Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:14.0801573Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:14.0801997Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.0802559Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:14.0803019Z U c10::StringType::get() 2025-05-07T20:11:14.0803348Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:14.0803704Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:14.0804073Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:14.0804450Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:11:14.0805082Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:14.0805699Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:14.0806033Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:11:14.0806381Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:14.0806717Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:14.0807051Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:14.0807367Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:14.0807674Z U c10::SymIntType::get() 2025-05-07T20:11:14.0808021Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:14.0808336Z U c10::TensorType::get() 2025-05-07T20:11:14.0808654Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:14.0809269Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.0810367Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:14.0811514Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:14.0812373Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.0813322Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:14.0814351Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.0815360Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:14.0815981Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:14.0816400Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:14.0816776Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:14.0817412Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:14.0818000Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:14.0818397Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:14.0818940Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:11:14.0819358Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:14.0819828Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:14.0820246Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:14.0820730Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:14.0821381Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.0821857Z U free@GLIBC_2.2.5 2025-05-07T20:11:14.0822216Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:14.0822582Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:14.0822988Z U memcpy@GLIBC_2.14 2025-05-07T20:11:14.0823257Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:14.0823536Z U memset@GLIBC_2.2.5 2025-05-07T20:11:14.0823858Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:14.0824242Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:14.0824564Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:14.0825134Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:14.0825797Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:14.0826629Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:14.0827467Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:14.0828265Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:14.0829338Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:14.0830188Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:14.0830785Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:14.0831122Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:14.0831467Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.0831854Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.0832259Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:14.0832680Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:14.0833065Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:14.0833546Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:14.0834244Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:14.0835254Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.0836436Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.0837179Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:14.0837527Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:14.0837946Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:14.0838358Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.0838777Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.0839145Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:14.0839494Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:14.0839933Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.0840480Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.0841112Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:14.0841644Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:14.0842045Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:14.0842717Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:14.0843358Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:14.0843752Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:14.0844094Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:14.0844377Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:14.0844717Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:14.0845487Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:14.0846610Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.0847443Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.0847899Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:14.0848415Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:14.0848958Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:14.0849441Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:14.0849931Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:14.0850791Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:14.0851510Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:14.0851966Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:14.0852479Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:14.0852922Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:14.0853274Z U torch::autograd::Node::metadata() 2025-05-07T20:11:14.0853668Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:14.0854161Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:14.0854822Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:14.0855366Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:14.0855842Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:14.0856409Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:14.0859490Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:14.0862439Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:14.0862882Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:14.0863433Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:14.0864518Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:11:14.0865782Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:14.0866476Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:14.0867379Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:14.0867986Z U typeinfo for c10::Error 2025-05-07T20:11:14.0868330Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:14.0868791Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:14.0869156Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:14.0869554Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:14.0869923Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:14.0870322Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:14.0870765Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:14.0871192Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:14.0871651Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:14.0872028Z U vtable for c10::Error 2025-05-07T20:11:14.0872604Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.0873420Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.0873992Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:14.0874453Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:14.0875003Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:14.0875471Z U vtable for torch::autograd::Node 2025-05-07T20:11:14.0875883Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:14.0876282Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:14.0876632Z w _ITM_registerTMCloneTable 2025-05-07T20:11:14.0876949Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:14.0877265Z w __gmon_start__ 2025-05-07T20:11:14.0877548Z w __pthread_key_create 2025-05-07T20:11:14.0877870Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:14.0878241Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:14.0878627Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:14.0879207Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:14.0879580Z 2025-05-07T20:11:14.0879694Z linux-vdso.so.1 (0x00007ffc72b9c000) 2025-05-07T20:11:14.0880009Z libc10.so => not found 2025-05-07T20:11:14.0880751Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f14a6c00000) 2025-05-07T20:11:14.0881396Z libtorch.so => not found 2025-05-07T20:11:14.0881983Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f14a783e000) 2025-05-07T20:11:14.0882793Z libtorch_cpu.so => not found 2025-05-07T20:11:14.0883082Z libtorch_cuda.so => not found 2025-05-07T20:11:14.0883409Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f14a699c000) 2025-05-07T20:11:14.0883844Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f14a780e000) 2025-05-07T20:11:14.0884225Z libc.so.6 => /lib64/libc.so.6 (0x00007f14a6794000) 2025-05-07T20:11:14.0884597Z /lib64/ld-linux-x86-64.so.2 (0x00007f14a784e000) 2025-05-07T20:11:14.0885099Z libc10.so => not found 2025-05-07T20:11:14.0885408Z libc10_cuda.so => not found 2025-05-07T20:11:14.0885962Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f14a6200000) 2025-05-07T20:11:14.0886525Z libtorch.so => not found 2025-05-07T20:11:14.0886801Z libtorch_cpu.so => not found 2025-05-07T20:11:14.0887079Z libtorch_cuda.so => not found 2025-05-07T20:11:14.0887371Z libcudart.so.12 => not found 2025-05-07T20:11:14.0887640Z libc10.so => not found 2025-05-07T20:11:14.0887904Z libtorch_cpu.so => not found 2025-05-07T20:11:14.0888177Z libtorch_cuda.so => not found 2025-05-07T20:11:14.0888473Z libtorch.so => not found 2025-05-07T20:11:14.0888837Z libm.so.6 => /lib64/libm.so.6 (0x00007f14a772f000) 2025-05-07T20:11:14.0889174Z libc10.so => not found 2025-05-07T20:11:14.0889700Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f14a6f85000) 2025-05-07T20:11:14.0890380Z libtorch.so => not found 2025-05-07T20:11:14.0890675Z libtorch_cpu.so => not found 2025-05-07T20:11:14.0890953Z libtorch_cuda.so => not found 2025-05-07T20:11:14.0891297Z libtorch_cpu.so => not found 2025-05-07T20:11:14.0891569Z libtorch_cuda.so => not found 2025-05-07T20:11:14.0891870Z libtorch.so => not found 2025-05-07T20:11:14.0892036Z 2025-05-07T20:11:14.0892180Z [CHECK] Displaying ELF information: 2025-05-07T20:11:14.0892677Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:14.0893077Z 2025-05-07T20:11:14.0893136Z 2025-05-07T20:11:14.0893298Z Dynamic section at offset 0x701230 contains 38 entries: 2025-05-07T20:11:14.0893683Z Tag Type Name/Value 2025-05-07T20:11:14.0894142Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:14.0894702Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:14.0895243Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:14.0895800Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:11:14.0896337Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:14.0896892Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:14.0910343Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:14.0910858Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:14.0911351Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:14.0911862Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:14.0912566Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_pt2.so] 2025-05-07T20:11:14.0913153Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:14.0913597Z 0x000000000000000c (INIT) 0x178000 2025-05-07T20:11:14.0913941Z 0x000000000000000d (FINI) 0x65b3d8 2025-05-07T20:11:14.0914272Z 0x0000000000000019 (INIT_ARRAY) 0x6fcd78 2025-05-07T20:11:14.0914626Z 0x000000000000001b (INIT_ARRAYSZ) 256 (bytes) 2025-05-07T20:11:14.0914972Z 0x000000000000001a (FINI_ARRAY) 0x6fce78 2025-05-07T20:11:14.0915416Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:14.0915723Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:14.0916051Z 0x000000006ffffef5 (GNU_HASH) 0x6490 2025-05-07T20:11:14.0916363Z 0x0000000000000005 (STRTAB) 0x25438 2025-05-07T20:11:14.0916663Z 0x0000000000000006 (SYMTAB) 0xc448 2025-05-07T20:11:14.0917001Z 0x000000000000000a (STRSZ) 1180638 (bytes) 2025-05-07T20:11:14.0917339Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:14.0917677Z 0x0000000000000003 (PLTGOT) 0x7024d0 2025-05-07T20:11:14.0918007Z 0x0000000000000002 (PLTRELSZ) 20976 (bytes) 2025-05-07T20:11:14.0918345Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:14.0918649Z 0x0000000000000017 (JMPREL) 0x171f98 2025-05-07T20:11:14.0918973Z 0x0000000000000007 (RELA) 0x147aa0 2025-05-07T20:11:14.0919308Z 0x0000000000000008 (RELASZ) 173304 (bytes) 2025-05-07T20:11:14.0919640Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:14.0919948Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:14.0920232Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:14.0920576Z 0x000000006ffffffe (VERNEED) 0x147970 2025-05-07T20:11:14.0920913Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:11:14.0921221Z 0x000000006ffffff0 (VERSYM) 0x145816 2025-05-07T20:11:14.0921518Z 0x000000006ffffff9 (RELACOUNT) 34 2025-05-07T20:11:14.0921829Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:14.0922021Z 2025-05-07T20:11:14.0922142Z ################################################################################ 2025-05-07T20:11:14.0922354Z 2025-05-07T20:11:14.0922358Z 2025-05-07T20:11:14.0922456Z ################################################################################ 2025-05-07T20:11:14.0922941Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:14.0923390Z [CHECK] Listing out library size: 2025-05-07T20:11:14.0923809Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:14.0924146Z 2025-05-07T20:11:14.0924345Z 432 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:14.0924628Z 2025-05-07T20:11:14.0924980Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:14.0925910Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.0926453Z 2025-05-07T20:11:14.1270676Z GLIBC_2.2.5 2025-05-07T20:11:14.1271305Z GLIBC_2.3 2025-05-07T20:11:14.1271854Z GLIBC_2.14 2025-05-07T20:11:14.1272168Z 2025-05-07T20:11:14.1272182Z 2025-05-07T20:11:14.1273403Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:14.1276186Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.1276792Z 2025-05-07T20:11:14.1652999Z GLIBCXX_3.4 2025-05-07T20:11:14.1653658Z GLIBCXX_3.4.9 2025-05-07T20:11:14.1654255Z GLIBCXX_3.4.11 2025-05-07T20:11:14.1655102Z GLIBCXX_3.4.14 2025-05-07T20:11:14.1655708Z GLIBCXX_3.4.18 2025-05-07T20:11:14.1656376Z GLIBCXX_3.4.20 2025-05-07T20:11:14.1656750Z GLIBCXX_3.4.21 2025-05-07T20:11:14.1656947Z GLIBCXX_3.4.29 2025-05-07T20:11:14.1657145Z 2025-05-07T20:11:14.1657150Z 2025-05-07T20:11:14.1671418Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.3DiSWGY8Lo.symbols.txt 2025-05-07T20:11:14.1671943Z 2025-05-07T20:11:14.2020024Z 2025-05-07T20:11:14.2050879Z [CHECK] Total Number of symbols: 4997 2025-05-07T20:11:14.2075459Z [CHECK] Number of fbgemm symbols: 3788 2025-05-07T20:11:14.2093003Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.LdhNLFprU5.usymbols.txt 2025-05-07T20:11:14.2094477Z 2025-05-07T20:11:14.2125615Z 2025-05-07T20:11:14.2152173Z [CHECK] Listing out undefined symbols (258 total): 2025-05-07T20:11:14.2168814Z U GOMP_parallel 2025-05-07T20:11:14.2170900Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.2173204Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.2174847Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:14.2175225Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.2175642Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.2176071Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.2176474Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:14.2176872Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:14.2177249Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:14.2177604Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.2178167Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:14.2178497Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:14.2178834Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:14.2179164Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:14.2179469Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:14.2179806Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:14.2180131Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:14.2180462Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:14.2180777Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:14.2181088Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:14.2181384Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:14.2181727Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:14.2182120Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:14.2182981Z U at::_ops::arange_start::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.2184274Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.2185499Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.2186361Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:14.2187132Z U at::_ops::scalar_tensor::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.2187929Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:14.2188483Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:14.2189559Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:14.2190693Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.2191343Z U at::detail::getCUDAHooks() 2025-05-07T20:11:14.2191640Z U at::detail::getHIPHooks() 2025-05-07T20:11:14.2191941Z U at::get_num_threads() 2025-05-07T20:11:14.2192218Z U at::get_thread_num() 2025-05-07T20:11:14.2192511Z U at::globalContext() 2025-05-07T20:11:14.2192789Z U at::in_parallel_region() 2025-05-07T20:11:14.2193091Z U at::init_num_threads() 2025-05-07T20:11:14.2193381Z U at::internal::set_thread_num(int) 2025-05-07T20:11:14.2193765Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:11:14.2194213Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.2194678Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.2195118Z U c10::ClassType::addMethod(torch::jit::Function*) 2025-05-07T20:11:14.2195697Z U c10::ClassType::getMethod(std::__cxx11::basic_string, std::allocator > const&) const 2025-05-07T20:11:14.2196313Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:14.2197185Z U c10::DictType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.2198263Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:14.2198821Z U c10::Error::what() const 2025-05-07T20:11:14.2199135Z U c10::GradMode::is_enabled() 2025-05-07T20:11:14.2199436Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:14.2199789Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.2200200Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.2200624Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:14.2201003Z U c10::IValue::is(c10::IValue const&) const 2025-05-07T20:11:14.2201325Z U c10::IValue::isTensorList() const 2025-05-07T20:11:14.2201683Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:14.2202009Z U c10::IntType::get() 2025-05-07T20:11:14.2202640Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.2203354Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:14.2203719Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:14.2204033Z U c10::NoneType::get() 2025-05-07T20:11:14.2204420Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.2204873Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:14.2205204Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:14.2205574Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:14.2205957Z U c10::StringType::get() 2025-05-07T20:11:14.2206304Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:14.2207083Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:14.2207689Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:14.2208037Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:14.2208395Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:14.2209048Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:11:14.2209665Z U c10::TensorType::get() 2025-05-07T20:11:14.2210965Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:11:14.2211986Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:14.2212941Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:14.2213985Z U c10::_fastEqualsForContainer(c10::IValue const&, c10::IValue const&) 2025-05-07T20:11:14.2214444Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:14.2214820Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:14.2215167Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:14.2215522Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:14.2215893Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:14.2216251Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:14.2216726Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:14.2217187Z U c10::cuda::device_count() 2025-05-07T20:11:14.2217543Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:14.2217917Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:14.2218326Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:14.2218728Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:14.2219132Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:14.2219528Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:14.2220179Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.2221238Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:14.2222886Z U c10::detail::infer_schema::make_function_schema(std::__cxx11::basic_string, std::allocator >&&, std::__cxx11::basic_string, std::allocator >&&, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:14.2224261Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:14.2225134Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.2226114Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:14.2227173Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.2228057Z U c10::getCustomClassTypeImpl(std::type_index const&) 2025-05-07T20:11:14.2228635Z U c10::get_default_dtype() 2025-05-07T20:11:14.2229127Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:14.2229740Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:11:14.2230182Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:14.2230524Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:14.2230893Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:11:14.2231256Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:14.2231878Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:14.2232522Z U c10::ivalue::Future::extractStorages(c10::IValue const&) 2025-05-07T20:11:14.2232961Z U c10::ivalue::Object::resizeObject(unsigned long) 2025-05-07T20:11:14.2233476Z U c10::ivalue::checkCustomClassType(c10::ClassType const*, c10::Type const*) 2025-05-07T20:11:14.2233969Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:14.2234366Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:14.2234772Z U c10::operator<<(std::ostream&, c10::FunctionSchema const&) 2025-05-07T20:11:14.2235155Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:14.2235569Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:14.2236059Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:14.2236425Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:14.2236805Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:14.2237189Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:14.2237537Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:14.2237900Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:14.2238250Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:14.2238597Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:14.2238965Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:14.2239305Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:14.2239662Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:14.2239997Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:14.2240370Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:14.2240735Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:14.2241930Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2243514Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2245156Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2246774Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2248469Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2250216Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2252064Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:14.2253700Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:14.2255473Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2257282Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:14.2259138Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2260952Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:14.2262754Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:14.2264654Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2266416Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:14.2268099Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:14.2269857Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2271941Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:14.2274026Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2275902Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:14.2277698Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:14.2279623Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2281501Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2283351Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2285384Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2287209Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2289063Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2291227Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:14.2292402Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.2292817Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.2293224Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.2293610Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.2294290Z U linearize_cache_indices_cuda(at::Tensor const&, at::Tensor const&, at::Tensor const&, std::optional const&, long, long) 2025-05-07T20:11:14.2294982Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:14.2295413Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.2295825Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.2296673Z U lru_cache_populate_byte_cuda(at::Tensor, at::Tensor, long, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long, bool, std::optional) 2025-05-07T20:11:14.2297794Z U lxu_cache_lookup_cuda(at::Tensor, at::Tensor, long, bool, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.2298461Z U memchr@GLIBC_2.2.5 2025-05-07T20:11:14.2298758Z U memcpy@GLIBC_2.14 2025-05-07T20:11:14.2299094Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:14.2299401Z U memset@GLIBC_2.2.5 2025-05-07T20:11:14.2299694Z U omp_get_num_threads 2025-05-07T20:11:14.2299967Z U omp_get_thread_num 2025-05-07T20:11:14.2300300Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:14.2300690Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:14.2301122Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:14.2301799Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:14.2302631Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:14.2303634Z U std::__cxx11::basic_stringbuf, std::allocator >::_M_sync(char*, unsigned long, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:11:14.2304614Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:14.2305322Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:14.2306090Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:14.2306884Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:14.2307454Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:14.2307857Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:14.2308190Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:14.2308523Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:14.2308864Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:14.2309225Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.2309594Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.2310000Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:14.2310383Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:14.2310836Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:14.2311490Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:14.2312433Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.2313544Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.2314294Z U std::condition_variable::condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:11:14.2314696Z U std::condition_variable::notify_all()@GLIBCXX_3.4.11 2025-05-07T20:11:14.2315123Z U std::condition_variable::~condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:11:14.2315525Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:14.2315849Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:14.2316191Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:14.2317935Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:14.2318279Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.2318631Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.2318841Z U std::istream& std::istream::_M_extract(long&)@GLIBCXX_3.4.9 2025-05-07T20:11:14.2318952Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:14.2319061Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:14.2319452Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:14.2319577Z U std::logic_error::~logic_error()@GLIBCXX_3.4 2025-05-07T20:11:14.2319748Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.2319991Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.2320114Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:14.2320267Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:14.2320405Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:14.2320610Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:14.2320711Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:14.2320820Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:14.2320910Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:14.2321026Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:14.2321590Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:14.2322046Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.2322284Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.2323281Z U torch::detail::class_base::class_base(std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator >, std::type_info const&, std::type_info const&) 2025-05-07T20:11:14.2323619Z U torch::detail::class_base::withNewArguments(c10::FunctionSchema const&, std::initializer_list) 2025-05-07T20:11:14.2323966Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:14.2324338Z U torch::registerCustomClassMethod(std::unique_ptr >) 2025-05-07T20:11:14.2324482Z U torch::serialize::InputArchive::InputArchive() 2025-05-07T20:11:14.2324793Z U torch::serialize::InputArchive::load_from(char const*, unsigned long, std::optional) 2025-05-07T20:11:14.2325213Z U torch::serialize::InputArchive::read(std::__cxx11::basic_string, std::allocator > const&, at::Tensor&, bool) 2025-05-07T20:11:14.2325507Z U torch::serialize::OutputArchive::OutputArchive(std::shared_ptr) 2025-05-07T20:11:14.2325678Z U torch::serialize::OutputArchive::save_to(std::ostream&) 2025-05-07T20:11:14.2326124Z U torch::serialize::OutputArchive::write(std::__cxx11::basic_string, std::allocator > const&, at::Tensor const&, bool) 2025-05-07T20:11:14.2326225Z U typeinfo for c10::Error 2025-05-07T20:11:14.2326375Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:14.2326516Z U typeinfo for std::logic_error@GLIBCXX_3.4 2025-05-07T20:11:14.2326658Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:14.2326846Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.2327043Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.2327189Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:14.2327350Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:14.2327495Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:14.2327591Z U vtable for c10::Error 2025-05-07T20:11:14.2327929Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.2328230Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.2328718Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.2329112Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:14.2329361Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:14.2329494Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:11:14.2329628Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:14.2329739Z w _ITM_registerTMCloneTable 2025-05-07T20:11:14.2329843Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:14.2329944Z w __gmon_start__ 2025-05-07T20:11:14.2330334Z w __pthread_key_create 2025-05-07T20:11:14.2330452Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:14.2330568Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:14.2330721Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:14.2330945Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:14.2330952Z 2025-05-07T20:11:14.2331172Z linux-vdso.so.1 (0x00007ffd3fdcb000) 2025-05-07T20:11:14.2331257Z libc10.so => not found 2025-05-07T20:11:14.2331353Z libc10_cuda.so => not found 2025-05-07T20:11:14.2331736Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007effe7a00000) 2025-05-07T20:11:14.2332199Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007effe6200000) 2025-05-07T20:11:14.2332297Z libtorch.so => not found 2025-05-07T20:11:14.2332417Z libtorch_cpu.so => not found 2025-05-07T20:11:14.2332510Z libtorch_cuda.so => not found 2025-05-07T20:11:14.2332610Z libcudart.so.12 => not found 2025-05-07T20:11:14.2332780Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007effe5f9c000) 2025-05-07T20:11:14.2332946Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f00033c8000) 2025-05-07T20:11:14.2333069Z libc.so.6 => /lib64/libc.so.6 (0x00007effe5d94000) 2025-05-07T20:11:14.2333197Z /lib64/ld-linux-x86-64.so.2 (0x00007f00033fc000) 2025-05-07T20:11:14.2333304Z libc10.so => not found 2025-05-07T20:11:14.2333658Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007effe7f85000) 2025-05-07T20:11:14.2333755Z libtorch.so => not found 2025-05-07T20:11:14.2333851Z libtorch_cpu.so => not found 2025-05-07T20:11:14.2333964Z libtorch_cuda.so => not found 2025-05-07T20:11:14.2334094Z libm.so.6 => /lib64/libm.so.6 (0x00007effe7925000) 2025-05-07T20:11:14.2334188Z libtorch.so => not found 2025-05-07T20:11:14.2334288Z libc10.so => not found 2025-05-07T20:11:14.2334426Z libc10_cuda.so => not found 2025-05-07T20:11:14.2334518Z libtorch_cpu.so => not found 2025-05-07T20:11:14.2334652Z libtorch_cuda.so => not found 2025-05-07T20:11:14.2334761Z libcudart.so.12 => not found 2025-05-07T20:11:14.2334890Z libtorch_cpu.so => not found 2025-05-07T20:11:14.2334984Z libtorch_cuda.so => not found 2025-05-07T20:11:14.2335086Z libtorch.so => not found 2025-05-07T20:11:14.2335091Z 2025-05-07T20:11:14.2335197Z [CHECK] Displaying ELF information: 2025-05-07T20:11:14.2335441Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:14.2335446Z 2025-05-07T20:11:14.2335450Z 2025-05-07T20:11:14.2335633Z Dynamic section at offset 0x1af13978 contains 40 entries: 2025-05-07T20:11:14.2335745Z Tag Type Name/Value 2025-05-07T20:11:14.2335938Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:14.2336157Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:14.2336352Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:14.2336570Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:11:14.2336768Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:14.2336981Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:14.2337184Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:14.2337389Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:14.2337596Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:14.2337791Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:14.2337981Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:14.2338217Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:14.2338481Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_inference.so] 2025-05-07T20:11:14.2338661Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:14.2338796Z 0x000000000000000c (INIT) 0x19a000 2025-05-07T20:11:14.2338908Z 0x000000000000000d (FINI) 0x7e3f4c 2025-05-07T20:11:14.2339029Z 0x0000000000000019 (INIT_ARRAY) 0x1af13d58 2025-05-07T20:11:14.2339149Z 0x000000000000001b (INIT_ARRAYSZ) 392 (bytes) 2025-05-07T20:11:14.2339275Z 0x000000000000001a (FINI_ARRAY) 0x1af13ee0 2025-05-07T20:11:14.2339397Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:14.2339500Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:14.2339628Z 0x000000006ffffef5 (GNU_HASH) 0x7048 2025-05-07T20:11:14.2339746Z 0x0000000000000005 (STRTAB) 0x2bee8 2025-05-07T20:11:14.2339857Z 0x0000000000000006 (SYMTAB) 0xea58 2025-05-07T20:11:14.2340005Z 0x000000000000000a (STRSZ) 1363139 (bytes) 2025-05-07T20:11:14.2340124Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:14.2340237Z 0x0000000000000003 (PLTGOT) 0x1af14c38 2025-05-07T20:11:14.2340366Z 0x0000000000000002 (PLTRELSZ) 15648 (bytes) 2025-05-07T20:11:14.2340489Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:14.2340599Z 0x0000000000000017 (JMPREL) 0x195ff8 2025-05-07T20:11:14.2340704Z 0x0000000000000007 (RELA) 0x17b418 2025-05-07T20:11:14.2340849Z 0x0000000000000008 (RELASZ) 109536 (bytes) 2025-05-07T20:11:14.2340966Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:14.2341064Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:14.2341182Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:14.2341310Z 0x000000006ffffffe (VERNEED) 0x17b2b8 2025-05-07T20:11:14.2341421Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:14.2341576Z 0x000000006ffffff0 (VERSYM) 0x178bac 2025-05-07T20:11:14.2341716Z 0x000000006ffffff9 (RELACOUNT) 79 2025-05-07T20:11:14.2341813Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:14.2341841Z 2025-05-07T20:11:14.2341954Z ################################################################################ 2025-05-07T20:11:14.2341959Z 2025-05-07T20:11:14.2341963Z 2025-05-07T20:11:14.2342082Z ################################################################################ 2025-05-07T20:11:14.2342435Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:14.2342540Z [CHECK] Listing out library size: 2025-05-07T20:11:14.2343003Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:14.2343007Z 2025-05-07T20:11:14.2343257Z 4 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:14.2343263Z 2025-05-07T20:11:14.2343697Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:14.2344243Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.2344248Z 2025-05-07T20:11:14.2493596Z GLIBC_2.2.5 2025-05-07T20:11:14.2493844Z GLIBC_2.3 2025-05-07T20:11:14.2494108Z GLIBC_2.14 2025-05-07T20:11:14.2494126Z 2025-05-07T20:11:14.2494146Z 2025-05-07T20:11:14.2495621Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:14.2497400Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.2497714Z 2025-05-07T20:11:14.2719303Z GLIBCXX_3.4 2025-05-07T20:11:14.2720445Z GLIBCXX_3.4.9 2025-05-07T20:11:14.2720756Z GLIBCXX_3.4.11 2025-05-07T20:11:14.2720987Z GLIBCXX_3.4.15 2025-05-07T20:11:14.2721264Z GLIBCXX_3.4.18 2025-05-07T20:11:14.2721493Z GLIBCXX_3.4.20 2025-05-07T20:11:14.2721715Z GLIBCXX_3.4.21 2025-05-07T20:11:14.2721954Z GLIBCXX_3.4.29 2025-05-07T20:11:14.2721970Z 2025-05-07T20:11:14.2721983Z 2025-05-07T20:11:14.2739428Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.v5DjIBr6sY.symbols.txt 2025-05-07T20:11:14.2739457Z 2025-05-07T20:11:14.2918835Z 2025-05-07T20:11:14.2942920Z [CHECK] Total Number of symbols: 2654 2025-05-07T20:11:14.2964810Z [CHECK] Number of fbgemm symbols: 1 2025-05-07T20:11:14.2979682Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.ezPlNyo98o.usymbols.txt 2025-05-07T20:11:14.2979718Z 2025-05-07T20:11:14.3004470Z 2025-05-07T20:11:14.3030420Z [CHECK] Listing out undefined symbols (194 total): 2025-05-07T20:11:14.3047505Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.3047951Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:14.3048299Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:14.3048616Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:14.3048914Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:14.3049209Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:14.3049520Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:14.3049834Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:14.3050305Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:14.3050652Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:14.3050955Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:14.3051231Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:14.3051887Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:14.3052292Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:14.3052726Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:14.3053259Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:14.3053642Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:14.3053963Z U at::RecordFunction::end() 2025-05-07T20:11:14.3054323Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:14.3054749Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:14.3056973Z U at::_ops::_sparse_coo_tensor_unsafe::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.3057932Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:11:14.3059673Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.3060465Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.3060620Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:14.3060810Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:14.3060991Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:14.3061172Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:11:14.3061341Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.3061456Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:14.3061636Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:14.3061754Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:14.3061851Z U c10::AnyType::get() 2025-05-07T20:11:14.3061958Z U c10::BoolType::get() 2025-05-07T20:11:14.3062125Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:14.3062230Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:14.3062725Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:14.3063311Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:14.3063661Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:14.3063771Z U c10::Error::what() const 2025-05-07T20:11:14.3063866Z U c10::FloatType::get() 2025-05-07T20:11:14.3063965Z U c10::GradMode::is_enabled() 2025-05-07T20:11:14.3064082Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:14.3064229Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:14.3064342Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:14.3064461Z U c10::IValue::isBoolList() const 2025-05-07T20:11:14.3064561Z U c10::IValue::isIntList() const 2025-05-07T20:11:14.3064670Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:14.3064805Z U c10::IValue::isTensorList() const 2025-05-07T20:11:14.3064988Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:14.3065110Z U c10::IntType::get() 2025-05-07T20:11:14.3065552Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.3065722Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:14.3065843Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:14.3065962Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:14.3066086Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:14.3066298Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.3066565Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:14.3066680Z U c10::StringType::get() 2025-05-07T20:11:14.3066819Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:14.3066953Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:14.3067134Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:11:14.3067275Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:14.3067423Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:11:14.3067818Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:14.3067943Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:14.3068066Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:11:14.3068237Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:11:14.3068365Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:14.3068476Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:14.3068598Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:14.3068721Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:14.3068840Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:14.3068950Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:14.3069038Z U c10::SymIntType::get() 2025-05-07T20:11:14.3069153Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:14.3069249Z U c10::TensorType::get() 2025-05-07T20:11:14.3069373Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:14.3069772Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.3070263Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:14.3070499Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:14.3070949Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.3071283Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:14.3071813Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.3072140Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:14.3072353Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:14.3072493Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:14.3072644Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:14.3073007Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:14.3073122Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:14.3073275Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:14.3073429Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:11:14.3073555Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:14.3073736Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:14.3073870Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:14.3074113Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:14.3074251Z U free@GLIBC_2.2.5 2025-05-07T20:11:14.3074418Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:14.3074507Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:14.3074598Z U memcpy@GLIBC_2.14 2025-05-07T20:11:14.3074685Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:14.3074768Z U memset@GLIBC_2.2.5 2025-05-07T20:11:14.3074910Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:14.3075029Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:14.3075111Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:14.3075389Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:14.3075714Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:14.3076073Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:14.3076384Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:14.3076696Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:14.3077046Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:14.3077410Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:14.3077526Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:14.3077637Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:14.3077787Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.3077921Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.3078081Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:14.3078205Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:14.3078348Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:14.3078568Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:14.3078884Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:14.3079454Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.3079983Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.3080113Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:14.3080223Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:14.3080342Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:14.3080458Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.3080565Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.3080673Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:14.3080788Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:14.3080959Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.3081183Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.3081316Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:14.3081467Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:14.3081601Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:14.3082017Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:14.3082146Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:14.3082247Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:14.3082337Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:14.3082463Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:14.3082577Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:14.3083128Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:14.3083559Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.3083793Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.3083930Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:14.3084203Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:14.3084373Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:14.3084575Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:14.3084752Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:14.3085072Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:14.3085226Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:14.3085402Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:14.3085568Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:14.3085686Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:14.3085793Z U torch::autograd::Node::metadata() 2025-05-07T20:11:14.3085918Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:14.3086146Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:14.3086437Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:14.3086622Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:14.3086817Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:14.3087027Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:14.3089988Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:14.3090228Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:14.3090386Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:14.3090569Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:14.3091361Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:11:14.3091552Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:14.3091979Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:14.3092344Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:14.3092463Z U typeinfo for c10::Error 2025-05-07T20:11:14.3092606Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:14.3092734Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:14.3092858Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:14.3093011Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:14.3093135Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:14.3093283Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:14.3093455Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:14.3093609Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:14.3093770Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:14.3093875Z U vtable for c10::Error 2025-05-07T20:11:14.3094223Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.3094552Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.3094697Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:14.3094893Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:14.3095120Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:14.3095281Z U vtable for torch::autograd::Node 2025-05-07T20:11:14.3095455Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:14.3095603Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:14.3095753Z w _ITM_registerTMCloneTable 2025-05-07T20:11:14.3095854Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:14.3095948Z w __gmon_start__ 2025-05-07T20:11:14.3096057Z w __pthread_key_create 2025-05-07T20:11:14.3096164Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:14.3096270Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:14.3096415Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:14.3096715Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:14.3096723Z 2025-05-07T20:11:14.3096852Z linux-vdso.so.1 (0x00007ffebefa9000) 2025-05-07T20:11:14.3096942Z libc10.so => not found 2025-05-07T20:11:14.3097414Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f904dbd4000) 2025-05-07T20:11:14.3097987Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f904c600000) 2025-05-07T20:11:14.3098072Z libtorch.so => not found 2025-05-07T20:11:14.3098172Z libtorch_cpu.so => not found 2025-05-07T20:11:14.3098264Z libtorch_cuda.so => not found 2025-05-07T20:11:14.3098412Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f904c39c000) 2025-05-07T20:11:14.3098560Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f904d7d2000) 2025-05-07T20:11:14.3098674Z libc.so.6 => /lib64/libc.so.6 (0x00007f904c194000) 2025-05-07T20:11:14.3098788Z /lib64/ld-linux-x86-64.so.2 (0x00007f904dbe4000) 2025-05-07T20:11:14.3098867Z libc10.so => not found 2025-05-07T20:11:14.3098964Z libtorch_cpu.so => not found 2025-05-07T20:11:14.3099077Z libtorch_cuda.so => not found 2025-05-07T20:11:14.3099159Z libtorch.so => not found 2025-05-07T20:11:14.3099253Z libtorch.so => not found 2025-05-07T20:11:14.3099329Z libc10.so => not found 2025-05-07T20:11:14.3099413Z libc10_cuda.so => not found 2025-05-07T20:11:14.3099500Z libtorch_cpu.so => not found 2025-05-07T20:11:14.3099594Z libtorch_cuda.so => not found 2025-05-07T20:11:14.3099681Z libcudart.so.12 => not found 2025-05-07T20:11:14.3099793Z libm.so.6 => /lib64/libm.so.6 (0x00007f904c0b9000) 2025-05-07T20:11:14.3099797Z 2025-05-07T20:11:14.3099906Z [CHECK] Displaying ELF information: 2025-05-07T20:11:14.3100197Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:14.3100202Z 2025-05-07T20:11:14.3127930Z 2025-05-07T20:11:14.3128734Z Dynamic section at offset 0x39abb0 contains 38 entries: 2025-05-07T20:11:14.3128872Z Tag Type Name/Value 2025-05-07T20:11:14.3129083Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:14.3129338Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:11:14.3129575Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:14.3129775Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:14.3129976Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:14.3130283Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:14.3130486Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:14.3130680Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:14.3130883Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:14.3131106Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:14.3131586Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_split_host.so] 2025-05-07T20:11:14.3131785Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:14.3131957Z 0x000000000000000c (INIT) 0xb9000 2025-05-07T20:11:14.3132139Z 0x000000000000000d (FINI) 0x33effc 2025-05-07T20:11:14.3132256Z 0x0000000000000019 (INIT_ARRAY) 0x397b28 2025-05-07T20:11:14.3132392Z 0x000000000000001b (INIT_ARRAYSZ) 304 (bytes) 2025-05-07T20:11:14.3132504Z 0x000000000000001a (FINI_ARRAY) 0x397c58 2025-05-07T20:11:14.3132624Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:14.3132739Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:14.3132853Z 0x000000006ffffef5 (GNU_HASH) 0x3b08 2025-05-07T20:11:14.3132964Z 0x0000000000000005 (STRTAB) 0x17258 2025-05-07T20:11:14.3133071Z 0x0000000000000006 (SYMTAB) 0x7970 2025-05-07T20:11:14.3133226Z 0x000000000000000a (STRSZ) 529940 (bytes) 2025-05-07T20:11:14.3133354Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:14.3133480Z 0x0000000000000003 (PLTGOT) 0x39ae50 2025-05-07T20:11:14.3133654Z 0x0000000000000002 (PLTRELSZ) 14112 (bytes) 2025-05-07T20:11:14.3133772Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:14.3133892Z 0x0000000000000017 (JMPREL) 0xb52c8 2025-05-07T20:11:14.3134044Z 0x0000000000000007 (RELA) 0x99e60 2025-05-07T20:11:14.3134323Z 0x0000000000000008 (RELASZ) 111720 (bytes) 2025-05-07T20:11:14.3134448Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:14.3134570Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:14.3134699Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:14.3134815Z 0x000000006ffffffe (VERNEED) 0x99d30 2025-05-07T20:11:14.3134922Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:11:14.3135108Z 0x000000006ffffff0 (VERSYM) 0x9886c 2025-05-07T20:11:14.3135221Z 0x000000006ffffff9 (RELACOUNT) 40 2025-05-07T20:11:14.3135322Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:14.3135329Z 2025-05-07T20:11:14.3135477Z ################################################################################ 2025-05-07T20:11:14.3135482Z 2025-05-07T20:11:14.3135486Z 2025-05-07T20:11:14.3135591Z ################################################################################ 2025-05-07T20:11:14.3135899Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:14.3136038Z [CHECK] Listing out library size: 2025-05-07T20:11:14.3136336Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:14.3136341Z 2025-05-07T20:11:14.3143374Z 343 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:14.3143692Z 2025-05-07T20:11:14.3145360Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:14.3146112Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.3146120Z 2025-05-07T20:11:14.4103035Z GLIBC_2.2.5 2025-05-07T20:11:14.4103954Z GLIBC_2.3 2025-05-07T20:11:14.4104214Z GLIBC_2.14 2025-05-07T20:11:14.4104261Z 2025-05-07T20:11:14.4104274Z 2025-05-07T20:11:14.4105633Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:14.4107292Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.4107311Z 2025-05-07T20:11:14.5058194Z GLIBCXX_3.4 2025-05-07T20:11:14.5058664Z GLIBCXX_3.4.9 2025-05-07T20:11:14.5058979Z GLIBCXX_3.4.20 2025-05-07T20:11:14.5059220Z GLIBCXX_3.4.21 2025-05-07T20:11:14.5059758Z GLIBCXX_3.4.29 2025-05-07T20:11:14.5059781Z 2025-05-07T20:11:14.5059912Z 2025-05-07T20:11:14.5079849Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.3o8UcJABRv.symbols.txt 2025-05-07T20:11:14.5079875Z 2025-05-07T20:11:14.6009027Z 2025-05-07T20:11:14.6052280Z [CHECK] Total Number of symbols: 12731 2025-05-07T20:11:14.6098281Z [CHECK] Number of fbgemm symbols: 5268 2025-05-07T20:11:14.6117676Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.lflL05P92d.usymbols.txt 2025-05-07T20:11:14.6119243Z 2025-05-07T20:11:14.6172299Z 2025-05-07T20:11:14.6212133Z [CHECK] Listing out undefined symbols (178 total): 2025-05-07T20:11:14.6232783Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.6234464Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:14.6235665Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.6236856Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.6237981Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.6239103Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:14.6240184Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:14.6241231Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:14.6241836Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.6242241Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:14.6242585Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:14.6242936Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:14.6243289Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:14.6243627Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:14.6244191Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:14.6244650Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:14.6245017Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:14.6245343Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:14.6245675Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:14.6245993Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:14.6246338Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:14.6246752Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:14.6247169Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:14.6247720Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:14.6248422Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:14.6249236Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:11:14.6249880Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:11:14.6251066Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.6252106Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:14.6252631Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:14.6253131Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:14.6253621Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:14.6254078Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.6254701Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.6255212Z U c10::BoolType::get() 2025-05-07T20:11:14.6255631Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:14.6256116Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:14.6256529Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:14.6257630Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:14.6258881Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:14.6259981Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:14.6260584Z U c10::Error::what() const 2025-05-07T20:11:14.6260918Z U c10::FloatType::get() 2025-05-07T20:11:14.6261279Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.6261745Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.6262182Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:14.6262581Z U c10::IntType::get() 2025-05-07T20:11:14.6262953Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:14.6263400Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:14.6263804Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:14.6264178Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:14.6264596Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:14.6265045Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:14.6265489Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:14.6266166Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:14.6266849Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:14.6267273Z U c10::SymInt::operator+=(c10::SymInt const&) 2025-05-07T20:11:14.6267666Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:14.6268084Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:14.6268456Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:14.6268874Z U c10::SymInt::sym_ge(c10::SymInt const&) const 2025-05-07T20:11:14.6269290Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:14.6269672Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:14.6270078Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:11:14.6270434Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:14.6270790Z U c10::SymIntType::get() 2025-05-07T20:11:14.6271110Z U c10::TensorType::get() 2025-05-07T20:11:14.6271473Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:14.6272440Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:14.6273399Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:14.6273784Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:14.6274142Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:14.6274520Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:14.6274853Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:14.6275219Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:14.6275719Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:14.6276166Z U c10::cuda::device_count() 2025-05-07T20:11:14.6276513Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:14.6276893Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:14.6277267Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:14.6277659Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:14.6278042Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:14.6278427Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:14.6279152Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:14.6279996Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:14.6280844Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.6281769Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:14.6282774Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.6283552Z U c10::get_default_dtype() 2025-05-07T20:11:14.6283896Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:14.6284233Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:14.6284767Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:14.6285361Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:14.6285787Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:14.6286114Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:14.6286508Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:14.6286912Z U c10::operator%(c10::SymInt const&, int) 2025-05-07T20:11:14.6287261Z U c10::operator*(c10::SymInt const&, long) 2025-05-07T20:11:14.6287622Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:14.6287961Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:11:14.6288342Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:14.6288706Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:14.6289100Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:14.6289515Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:14.6289855Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:14.6290362Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:14.6290981Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:14.6291448Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:14.6291804Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:14.6292167Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:14.6292527Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:14.6292864Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:14.6293273Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:14.6293663Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:14.6294063Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:14.6294409Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:14.6294780Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:14.6295149Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:14.6295498Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:14.6296029Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.6296537Z U float at::Tensor::item() const 2025-05-07T20:11:14.6296917Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.6297306Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.6297675Z U free@GLIBC_2.2.5 2025-05-07T20:11:14.6297984Z U int at::Tensor::item() const 2025-05-07T20:11:14.6298326Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.6298708Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.6299133Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:14.6299557Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.6299966Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.6300305Z U memcpy@GLIBC_2.14 2025-05-07T20:11:14.6300609Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:14.6300890Z U memset@GLIBC_2.2.5 2025-05-07T20:11:14.6301244Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:14.6301660Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:14.6302250Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:14.6303103Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:14.6303923Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:14.6304495Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:14.6304846Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.6305233Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.6305653Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:14.6306161Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:14.6306855Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:14.6307879Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.6309048Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.6309793Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:14.6310156Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:14.6310491Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:14.6310843Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.6311173Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.6311548Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:14.6311932Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:14.6312398Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.6312951Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.6313431Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:14.6313800Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:14.6314110Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:14.6314443Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:14.6315280Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:14.6316439Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.6317290Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.6318031Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:14.6318597Z U typeinfo for c10::Error 2025-05-07T20:11:14.6318957Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:14.6319368Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:14.6319807Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:14.6320162Z U vtable for c10::Error 2025-05-07T20:11:14.6320713Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.6321531Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.6322160Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:14.6322695Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:14.6323208Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:14.6323593Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:14.6323912Z w _ITM_registerTMCloneTable 2025-05-07T20:11:14.6324212Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:14.6324505Z w __gmon_start__ 2025-05-07T20:11:14.6324764Z w __pthread_key_create 2025-05-07T20:11:14.6325104Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:14.6325582Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:14.6325943Z 2025-05-07T20:11:14.6326074Z linux-vdso.so.1 (0x00007ffeef577000) 2025-05-07T20:11:14.6326362Z libc10.so => not found 2025-05-07T20:11:14.6326596Z libc10_cuda.so => not found 2025-05-07T20:11:14.6327264Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f56a9800000) 2025-05-07T20:11:14.6327947Z libtorch.so => not found 2025-05-07T20:11:14.6328193Z libtorch_cpu.so => not found 2025-05-07T20:11:14.6328674Z libtorch_cuda.so => not found 2025-05-07T20:11:14.6328934Z libcudart.so.12 => not found 2025-05-07T20:11:14.6329270Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f56a959c000) 2025-05-07T20:11:14.6329681Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f56bfb89000) 2025-05-07T20:11:14.6330072Z libc.so.6 => /lib64/libc.so.6 (0x00007f56a9394000) 2025-05-07T20:11:14.6330532Z /lib64/ld-linux-x86-64.so.2 (0x00007f56bfbbd000) 2025-05-07T20:11:14.6330953Z libc10.so => not found 2025-05-07T20:11:14.6331208Z libc10_cuda.so => not found 2025-05-07T20:11:14.6331851Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f56a8e00000) 2025-05-07T20:11:14.6332781Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f56bfb7b000) 2025-05-07T20:11:14.6333428Z libtorch.so => not found 2025-05-07T20:11:14.6333693Z libtorch_cpu.so => not found 2025-05-07T20:11:14.6333953Z libtorch_cuda.so => not found 2025-05-07T20:11:14.6334223Z libcudart.so.12 => not found 2025-05-07T20:11:14.6334507Z libm.so.6 => /lib64/libm.so.6 (0x00007f56a8d25000) 2025-05-07T20:11:14.6334826Z libc10.so => not found 2025-05-07T20:11:14.6335328Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f56a9b85000) 2025-05-07T20:11:14.6335873Z libtorch.so => not found 2025-05-07T20:11:14.6336130Z libtorch_cpu.so => not found 2025-05-07T20:11:14.6336385Z libtorch_cuda.so => not found 2025-05-07T20:11:14.6336641Z libc10.so => not found 2025-05-07T20:11:14.6336867Z libtorch_cpu.so => not found 2025-05-07T20:11:14.6337134Z libtorch_cuda.so => not found 2025-05-07T20:11:14.6337387Z libtorch.so => not found 2025-05-07T20:11:14.6337640Z libtorch_cpu.so => not found 2025-05-07T20:11:14.6337892Z libtorch_cuda.so => not found 2025-05-07T20:11:14.6338155Z libtorch.so => not found 2025-05-07T20:11:14.6338306Z 2025-05-07T20:11:14.6338422Z [CHECK] Displaying ELF information: 2025-05-07T20:11:14.6339071Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:14.6339453Z 2025-05-07T20:11:14.6377560Z 2025-05-07T20:11:14.6378066Z Dynamic section at offset 0x1569a110 contains 39 entries: 2025-05-07T20:11:14.6378491Z Tag Type Name/Value 2025-05-07T20:11:14.6379099Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:14.6379596Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:14.6380139Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:14.6380669Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:14.6381163Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:14.6381678Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:14.6382184Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:14.6382696Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:14.6383191Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:14.6383683Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:14.6384204Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:14.6384776Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_forward.so] 2025-05-07T20:11:14.6385330Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:14.6385719Z 0x000000000000000c (INIT) 0x44b000 2025-05-07T20:11:14.6386083Z 0x000000000000000d (FINI) 0x22530cc 2025-05-07T20:11:14.6386426Z 0x0000000000000019 (INIT_ARRAY) 0x15698508 2025-05-07T20:11:14.6386772Z 0x000000000000001b (INIT_ARRAYSZ) 752 (bytes) 2025-05-07T20:11:14.6387127Z 0x000000000000001a (FINI_ARRAY) 0x156987f8 2025-05-07T20:11:14.6387457Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:14.6387786Z 0x0000000000000004 (HASH) 0x200 2025-05-07T20:11:14.6388098Z 0x000000006ffffef5 (GNU_HASH) 0x10898 2025-05-07T20:11:14.6388426Z 0x0000000000000005 (STRTAB) 0x6f610 2025-05-07T20:11:14.6388745Z 0x0000000000000006 (SYMTAB) 0x24c70 2025-05-07T20:11:14.6389142Z 0x000000000000000a (STRSZ) 3691715 (bytes) 2025-05-07T20:11:14.6389532Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:14.6389900Z 0x0000000000000003 (PLTGOT) 0x1569a3c0 2025-05-07T20:11:14.6390288Z 0x0000000000000002 (PLTRELSZ) 10920 (bytes) 2025-05-07T20:11:14.6390629Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:14.6390964Z 0x0000000000000017 (JMPREL) 0x4484b0 2025-05-07T20:11:14.6391294Z 0x0000000000000007 (RELA) 0x3faf60 2025-05-07T20:11:14.6391661Z 0x0000000000000008 (RELASZ) 316752 (bytes) 2025-05-07T20:11:14.6392031Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:14.6392345Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:14.6392675Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:14.6393022Z 0x000000006ffffffe (VERNEED) 0x3fae50 2025-05-07T20:11:14.6393380Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:14.6393704Z 0x000000006ffffff0 (VERSYM) 0x3f4ad4 2025-05-07T20:11:14.6394057Z 0x000000006ffffff9 (RELACOUNT) 136 2025-05-07T20:11:14.6394375Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:14.6394573Z 2025-05-07T20:11:14.6394681Z ################################################################################ 2025-05-07T20:11:14.6394904Z 2025-05-07T20:11:14.6394908Z 2025-05-07T20:11:14.6395038Z ################################################################################ 2025-05-07T20:11:14.6395542Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:14.6396050Z [CHECK] Listing out library size: 2025-05-07T20:11:14.6396510Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:14.6396911Z 2025-05-07T20:11:14.6397232Z 35 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:14.6397614Z 2025-05-07T20:11:14.6398013Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:14.6399011Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.6399630Z 2025-05-07T20:11:14.6522135Z GLIBC_2.2.5 2025-05-07T20:11:14.6522469Z GLIBC_2.3 2025-05-07T20:11:14.6522982Z GLIBC_2.14 2025-05-07T20:11:14.6523152Z 2025-05-07T20:11:14.6523170Z 2025-05-07T20:11:14.6523616Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:14.6524684Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.6525333Z 2025-05-07T20:11:14.6639522Z GLIBCXX_3.4 2025-05-07T20:11:14.6640173Z GLIBCXX_3.4.9 2025-05-07T20:11:14.6640756Z GLIBCXX_3.4.11 2025-05-07T20:11:14.6641469Z GLIBCXX_3.4.15 2025-05-07T20:11:14.6642093Z GLIBCXX_3.4.18 2025-05-07T20:11:14.6642724Z GLIBCXX_3.4.20 2025-05-07T20:11:14.6643255Z GLIBCXX_3.4.21 2025-05-07T20:11:14.6643823Z GLIBCXX_3.4.29 2025-05-07T20:11:14.6645350Z 2025-05-07T20:11:14.6645390Z 2025-05-07T20:11:14.6669951Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.cnGWlKXpyh.symbols.txt 2025-05-07T20:11:14.6670477Z 2025-05-07T20:11:14.6748166Z 2025-05-07T20:11:14.6772911Z [CHECK] Total Number of symbols: 1477 2025-05-07T20:11:14.6791544Z [CHECK] Number of fbgemm symbols: 213 2025-05-07T20:11:14.6810081Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.aAeIOIbTzr.usymbols.txt 2025-05-07T20:11:14.6811874Z 2025-05-07T20:11:14.6830730Z 2025-05-07T20:11:14.6854741Z [CHECK] Listing out undefined symbols (270 total): 2025-05-07T20:11:14.6874078Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.6875968Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.6876522Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:14.6876912Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.6877321Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.6877731Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.6878119Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:14.6878497Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:14.6878890Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:14.6879255Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.6879647Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:14.6879986Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:14.6880325Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:14.6880663Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:14.6880974Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:14.6881310Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:14.6881627Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:14.6881960Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:14.6882276Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:14.6882612Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:14.6882939Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:14.6883274Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:14.6883602Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:14.6883913Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:14.6884364Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:14.6884797Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:14.6885253Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:14.6885622Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:14.6886028Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:14.6886500Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:14.6886918Z U at::TensorMaker::make_tensor() 2025-05-07T20:11:14.6887279Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:11:14.6887660Z U at::_ops::concat::call(c10::ArrayRef, long) 2025-05-07T20:11:14.6888103Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:14.6888987Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.6890450Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.6891818Z U at::_ops::eq_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.6892305Z U at::_ops::eq_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.6892781Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:14.6893278Z U at::_ops::gt_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.6893890Z U at::_ops::index_add::call(at::Tensor const&, long, at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.6894569Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:11:14.6895065Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:11:14.6895506Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.6896030Z U at::_ops::narrow::call(at::Tensor const&, long, c10::SymInt, c10::SymInt) 2025-05-07T20:11:14.6896564Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:14.6897101Z U at::_ops::split_with_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:11:14.6897803Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:14.6898889Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:14.6899811Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:14.6900304Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.6901083Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.6902254Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.6903206Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:14.6903575Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:14.6903995Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:14.6904471Z U at::globalContext() 2025-05-07T20:11:14.6904815Z U at::has_internal_overlap(at::TensorBase const&) 2025-05-07T20:11:14.6905233Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:14.6905589Z U bool at::Tensor::item() const 2025-05-07T20:11:14.6905943Z U c10::AnyType::get() 2025-05-07T20:11:14.6906339Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:11:14.6906818Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.6907256Z U c10::BoolType::get() 2025-05-07T20:11:14.6907616Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:14.6908092Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:14.6908494Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:14.6909263Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:14.6910528Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:14.6911626Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:14.6912245Z U c10::Error::what() const 2025-05-07T20:11:14.6912595Z U c10::GradMode::is_enabled() 2025-05-07T20:11:14.6934344Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:14.6934910Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.6935397Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:14.6935961Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:14.6936330Z U c10::IValue::isBoolList() const 2025-05-07T20:11:14.6936743Z U c10::IValue::isIntList() const 2025-05-07T20:11:14.6937124Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:14.6937483Z U c10::IValue::isTensorList() const 2025-05-07T20:11:14.6937856Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:14.6938237Z U c10::IntType::get() 2025-05-07T20:11:14.6938928Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.6939708Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:14.6940147Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:14.6940514Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:14.6940902Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:14.6941420Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:14.6942009Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:14.6942413Z U c10::StringType::get() 2025-05-07T20:11:14.6942767Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:14.6943450Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:14.6944110Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:14.6944506Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:14.6944866Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:14.6945186Z U c10::SymIntType::get() 2025-05-07T20:11:14.6945622Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:14.6946015Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:14.6946720Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:11:14.6947457Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:14.6947826Z U c10::TensorType::get() 2025-05-07T20:11:14.6948245Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:11:14.6948670Z U c10::Type::is_module() const 2025-05-07T20:11:14.6949037Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:14.6950014Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:14.6950991Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:14.6951389Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:14.6951747Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:14.6952120Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:14.6952491Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:14.6952841Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:14.6953337Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:14.6953800Z U c10::cuda::device_count() 2025-05-07T20:11:14.6954171Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:14.6954643Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:14.6955070Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:14.6955530Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:14.6955949Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:14.6956411Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:14.6957062Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.6958142Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:14.6959048Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:14.6959921Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.6960906Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:14.6961970Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.6962945Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:14.6963568Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:11:14.6964032Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:14.6964373Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:14.6964947Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:14.6965571Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:14.6967455Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:14.6967899Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:14.6968313Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:14.6968669Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:14.6969053Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:14.6969725Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:14.6970477Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:14.6970856Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:14.6971253Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:14.6971636Z U c10::throwNullDataPtrError() 2025-05-07T20:11:14.6972018Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:11:14.6972356Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:14.6972713Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:14.6973145Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:14.6973571Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:14.6973943Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:14.6974317Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:14.6974710Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:14.6975079Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:14.6975442Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:14.6975800Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:14.6976138Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:14.6976548Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:14.6976910Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:14.6977326Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:14.6977896Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:14.6978260Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:14.6978599Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:14.6978950Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:14.6979296Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:14.6979678Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:14.6980109Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:11:14.6980591Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.6980965Z U free@GLIBC_2.2.5 2025-05-07T20:11:14.6981301Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.6981666Z U log2f@GLIBC_2.2.5 2025-05-07T20:11:14.6981960Z U long at::Tensor::item() const 2025-05-07T20:11:14.6982387Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:14.6982806Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.6983222Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.6983604Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:14.6983891Z U memcpy@GLIBC_2.14 2025-05-07T20:11:14.6984184Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:14.6984463Z U memset@GLIBC_2.2.5 2025-05-07T20:11:14.6984825Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:14.6985200Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:14.6985578Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:14.6986001Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:14.6986669Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:14.6987521Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:14.6988365Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:14.6989173Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:14.6989778Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:14.6990113Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:14.6990496Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.6990909Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.6991334Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:14.6991775Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:14.6992156Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:14.6992664Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:14.6993376Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:14.6994399Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.6995628Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.6996458Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:14.6996817Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:14.6997184Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:14.6997529Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.6997881Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.6998208Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:14.6998553Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:14.6998980Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.6999509Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.7000009Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:14.7000407Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:14.7000845Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:14.7001537Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:14.7002208Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:14.7002588Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:14.7002895Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:14.7003201Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:14.7003531Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:14.7004343Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:14.7005557Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.7006395Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.7006886Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:14.7007184Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:14.7007387Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:14.7007591Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:14.7007779Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:14.7008152Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:14.7008307Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:14.7008500Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:14.7008692Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:14.7008816Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:14.7008936Z U torch::autograd::Node::metadata() 2025-05-07T20:11:14.7009082Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:14.7009327Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:14.7009598Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:14.7009760Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:14.7010000Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:14.7010332Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:14.7013041Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:14.7013218Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:14.7013378Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:14.7013540Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:14.7013717Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:14.7014129Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:14.7014487Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:14.7015049Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:14.7015184Z U typeinfo for c10::Error 2025-05-07T20:11:14.7015288Z U typeinfo for c10::Type 2025-05-07T20:11:14.7015453Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:14.7015591Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:14.7015717Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:14.7015846Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:14.7015997Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:14.7016160Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:14.7016338Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:14.7016500Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:14.7016657Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:11:14.7016775Z U vtable for c10::Error 2025-05-07T20:11:14.7016882Z U vtable for c10::ListType 2025-05-07T20:11:14.7017236Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.7017583Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.7017930Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.7018070Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:14.7018280Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:14.7018511Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:14.7018675Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:11:14.7018811Z U vtable for torch::autograd::Node 2025-05-07T20:11:14.7019015Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:14.7019155Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:14.7019267Z w _ITM_registerTMCloneTable 2025-05-07T20:11:14.7019390Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:14.7019501Z w __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:11:14.7019596Z w __gmon_start__ 2025-05-07T20:11:14.7019707Z w __pthread_key_create 2025-05-07T20:11:14.7019820Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:14.7019939Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:14.7020104Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:14.7020330Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:14.7020340Z 2025-05-07T20:11:14.7020499Z linux-vdso.so.1 (0x00007ffff35f7000) 2025-05-07T20:11:14.7020609Z libc10.so => not found 2025-05-07T20:11:14.7020709Z libc10_cuda.so => not found 2025-05-07T20:11:14.7021273Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f0a4aa50000) 2025-05-07T20:11:14.7021748Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f0a49800000) 2025-05-07T20:11:14.7021845Z libtorch.so => not found 2025-05-07T20:11:14.7021945Z libtorch_cpu.so => not found 2025-05-07T20:11:14.7022064Z libtorch_cuda.so => not found 2025-05-07T20:11:14.7022158Z libcudart.so.12 => not found 2025-05-07T20:11:14.7022322Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f0a4959c000) 2025-05-07T20:11:14.7022450Z libm.so.6 => /lib64/libm.so.6 (0x00007f0a4a975000) 2025-05-07T20:11:14.7022648Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f0a4cfee000) 2025-05-07T20:11:14.7022769Z libc.so.6 => /lib64/libc.so.6 (0x00007f0a49394000) 2025-05-07T20:11:14.7022902Z /lib64/ld-linux-x86-64.so.2 (0x00007f0a4d022000) 2025-05-07T20:11:14.7023007Z libc10.so => not found 2025-05-07T20:11:14.7023097Z libc10_cuda.so => not found 2025-05-07T20:11:14.7023191Z libtorch.so => not found 2025-05-07T20:11:14.7023313Z libtorch_cpu.so => not found 2025-05-07T20:11:14.7023409Z libtorch_cuda.so => not found 2025-05-07T20:11:14.7023503Z libcudart.so.12 => not found 2025-05-07T20:11:14.7023596Z libtorch.so => not found 2025-05-07T20:11:14.7023697Z libc10.so => not found 2025-05-07T20:11:14.7023787Z libc10_cuda.so => not found 2025-05-07T20:11:14.7023885Z libtorch_cpu.so => not found 2025-05-07T20:11:14.7024004Z libtorch_cuda.so => not found 2025-05-07T20:11:14.7024091Z libcudart.so.12 => not found 2025-05-07T20:11:14.7024096Z 2025-05-07T20:11:14.7024205Z [CHECK] Displaying ELF information: 2025-05-07T20:11:14.7024460Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:14.7024486Z 2025-05-07T20:11:14.7024491Z 2025-05-07T20:11:14.7024652Z Dynamic section at offset 0x2201930 contains 41 entries: 2025-05-07T20:11:14.7024767Z Tag Type Name/Value 2025-05-07T20:11:14.7024962Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:14.7025181Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:14.7025425Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:14.7025643Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:14.7025857Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:14.7026055Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:14.7026259Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:14.7026501Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:14.7026731Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:14.7026939Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:14.7027144Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:14.7027333Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:14.7027543Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:14.7027787Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:11:14.7027985Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:14.7028100Z 0x000000000000000c (INIT) 0x51000 2025-05-07T20:11:14.7028215Z 0x000000000000000d (FINI) 0x14a27c 2025-05-07T20:11:14.7028346Z 0x0000000000000019 (INIT_ARRAY) 0x2201bc8 2025-05-07T20:11:14.7028633Z 0x000000000000001b (INIT_ARRAYSZ) 144 (bytes) 2025-05-07T20:11:14.7028755Z 0x000000000000001a (FINI_ARRAY) 0x2201c58 2025-05-07T20:11:14.7028901Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:14.7029002Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:14.7029123Z 0x000000006ffffef5 (GNU_HASH) 0x2900 2025-05-07T20:11:14.7029230Z 0x0000000000000005 (STRTAB) 0xda10 2025-05-07T20:11:14.7029356Z 0x0000000000000006 (SYMTAB) 0x4f80 2025-05-07T20:11:14.7029559Z 0x000000000000000a (STRSZ) 224745 (bytes) 2025-05-07T20:11:14.7029675Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:14.7029811Z 0x0000000000000003 (PLTGOT) 0x2202c00 2025-05-07T20:11:14.7029948Z 0x0000000000000002 (PLTRELSZ) 11784 (bytes) 2025-05-07T20:11:14.7030118Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:14.7030233Z 0x0000000000000017 (JMPREL) 0x4da10 2025-05-07T20:11:14.7030369Z 0x0000000000000007 (RELA) 0x45508 2025-05-07T20:11:14.7030497Z 0x0000000000000008 (RELASZ) 34056 (bytes) 2025-05-07T20:11:14.7030612Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:14.7030730Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:14.7030849Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:14.7030967Z 0x000000006ffffffe (VERNEED) 0x45388 2025-05-07T20:11:14.7031097Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:11:14.7031205Z 0x000000006ffffff0 (VERSYM) 0x447fa 2025-05-07T20:11:14.7031317Z 0x000000006ffffff9 (RELACOUNT) 388 2025-05-07T20:11:14.7031420Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:14.7031424Z 2025-05-07T20:11:14.7031551Z ################################################################################ 2025-05-07T20:11:14.7031557Z 2025-05-07T20:11:14.7031562Z 2025-05-07T20:11:14.7031673Z ################################################################################ 2025-05-07T20:11:14.7031994Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:14.7032113Z [CHECK] Listing out library size: 2025-05-07T20:11:14.7032417Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:14.7032421Z 2025-05-07T20:11:14.7032650Z 1 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:14.7032668Z 2025-05-07T20:11:14.7033087Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:14.7033615Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.7033621Z 2025-05-07T20:11:14.7033732Z GLIBC_2.2.5 2025-05-07T20:11:14.7033846Z GLIBC_2.3 2025-05-07T20:11:14.7033931Z GLIBC_2.14 2025-05-07T20:11:14.7033987Z 2025-05-07T20:11:14.7033991Z 2025-05-07T20:11:14.7034461Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:14.7035012Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.7035018Z 2025-05-07T20:11:14.7085105Z GLIBCXX_3.4 2025-05-07T20:11:14.7085536Z GLIBCXX_3.4.9 2025-05-07T20:11:14.7085794Z GLIBCXX_3.4.18 2025-05-07T20:11:14.7086050Z GLIBCXX_3.4.20 2025-05-07T20:11:14.7086273Z GLIBCXX_3.4.21 2025-05-07T20:11:14.7086527Z GLIBCXX_3.4.29 2025-05-07T20:11:14.7086543Z 2025-05-07T20:11:14.7086556Z 2025-05-07T20:11:14.7101385Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.Fz4oK6XQSd.symbols.txt 2025-05-07T20:11:14.7101435Z 2025-05-07T20:11:14.7128105Z 2025-05-07T20:11:14.7156012Z [CHECK] Total Number of symbols: 356 2025-05-07T20:11:14.7168101Z [CHECK] Number of fbgemm symbols: 56 2025-05-07T20:11:14.7185829Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.Lgcz1C5HFX.usymbols.txt 2025-05-07T20:11:14.7185873Z 2025-05-07T20:11:14.7203113Z 2025-05-07T20:11:14.7228377Z [CHECK] Listing out undefined symbols (123 total): 2025-05-07T20:11:14.7240805Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.7241915Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.7242201Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:14.7242646Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.7243365Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.7243749Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.7244175Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:14.7244548Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:14.7244889Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:14.7245268Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.7245571Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:14.7245870Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:14.7246149Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:14.7246458Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:14.7246756Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:14.7247041Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:14.7247318Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:14.7247656Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:14.7247953Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:14.7248218Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:14.7249443Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.7250081Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.7250355Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:14.7250508Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:14.7250613Z U c10::IntType::get() 2025-05-07T20:11:14.7250865Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:14.7250989Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:14.7251294Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.7251720Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:14.7251860Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:14.7251963Z U c10::TensorType::get() 2025-05-07T20:11:14.7252085Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:14.7252812Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:14.7252949Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:14.7253091Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:14.7253213Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:14.7253324Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:14.7253443Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:14.7253571Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:14.7253822Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:14.7253924Z U c10::cuda::device_count() 2025-05-07T20:11:14.7254076Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:14.7254215Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:14.7254356Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:14.7254518Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:14.7254703Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:14.7254820Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:14.7255351Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:14.7255604Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:14.7256098Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.7256458Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:14.7256576Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:14.7256684Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:14.7256825Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:14.7256965Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:14.7257104Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:14.7257229Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:14.7257425Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:14.7257555Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:14.7257719Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:14.7257837Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:14.7257965Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:14.7258093Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:14.7258214Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:14.7258361Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:14.7258528Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:14.7258677Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:14.7258816Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:14.7258950Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:14.7259062Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:14.7259191Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:14.7259332Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:14.7259477Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.7259651Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:14.7259823Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.7259916Z U memcpy@GLIBC_2.14 2025-05-07T20:11:14.7260015Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:14.7260112Z U memset@GLIBC_2.2.5 2025-05-07T20:11:14.7260283Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:14.7260413Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:14.7260757Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:14.7261172Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:14.7261511Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:14.7261918Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:14.7262062Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:14.7262176Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:14.7262343Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.7262491Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.7262776Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:14.7263033Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:14.7263369Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:14.7263934Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.7264449Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.7264572Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:14.7264700Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:14.7264828Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.7264945Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.7265061Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:14.7265181Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:14.7265365Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.7265603Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.7265742Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:14.7265849Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:14.7265975Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:14.7266139Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:14.7266734Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:14.7267192Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.7267467Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.7267820Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:14.7268032Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.7268206Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:14.7268365Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:14.7268527Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:14.7268880Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.7269211Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.7269568Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.7269768Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:14.7269993Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:14.7270148Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:14.7270260Z w _ITM_registerTMCloneTable 2025-05-07T20:11:14.7270365Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:14.7270451Z w __gmon_start__ 2025-05-07T20:11:14.7270562Z w __pthread_key_create 2025-05-07T20:11:14.7270713Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:14.7270952Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:14.7270959Z 2025-05-07T20:11:14.7283326Z linux-vdso.so.1 (0x00007ffe149f9000) 2025-05-07T20:11:14.7283708Z libtorch.so => not found 2025-05-07T20:11:14.7283962Z libc10.so => not found 2025-05-07T20:11:14.7284264Z libc10_cuda.so => not found 2025-05-07T20:11:14.7284533Z libtorch_cpu.so => not found 2025-05-07T20:11:14.7284804Z libtorch_cuda.so => not found 2025-05-07T20:11:14.7285100Z libcudart.so.12 => not found 2025-05-07T20:11:14.7285578Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fa9a4de9000) 2025-05-07T20:11:14.7286016Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fa9a4dbb000) 2025-05-07T20:11:14.7286399Z libc.so.6 => /lib64/libc.so.6 (0x00007fa9a4bb3000) 2025-05-07T20:11:14.7286897Z /lib64/ld-linux-x86-64.so.2 (0x00007fa9a50c0000) 2025-05-07T20:11:14.7287024Z libm.so.6 => /lib64/libm.so.6 (0x00007fa9a4ad8000) 2025-05-07T20:11:14.7287029Z 2025-05-07T20:11:14.7287141Z [CHECK] Displaying ELF information: 2025-05-07T20:11:14.7287434Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:14.7287439Z 2025-05-07T20:11:14.7315990Z 2025-05-07T20:11:14.7316361Z Dynamic section at offset 0x6a540 contains 37 entries: 2025-05-07T20:11:14.7316512Z Tag Type Name/Value 2025-05-07T20:11:14.7316878Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:14.7317137Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:14.7317468Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:14.7317742Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:14.7317983Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:14.7318192Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:14.7318410Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:14.7318608Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:14.7318799Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:14.7319043Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:14.7319309Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:11:14.7319429Z 0x000000000000000c (INIT) 0xf000 2025-05-07T20:11:14.7319569Z 0x000000000000000d (FINI) 0x2c63c 2025-05-07T20:11:14.7319684Z 0x0000000000000019 (INIT_ARRAY) 0x6b1f8 2025-05-07T20:11:14.7319814Z 0x000000000000001b (INIT_ARRAYSZ) 40 (bytes) 2025-05-07T20:11:14.7319953Z 0x000000000000001a (FINI_ARRAY) 0x6b220 2025-05-07T20:11:14.7320074Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:14.7320180Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:14.7320294Z 0x000000006ffffef5 (GNU_HASH) 0x12b0 2025-05-07T20:11:14.7320422Z 0x0000000000000005 (STRTAB) 0x3ff0 2025-05-07T20:11:14.7320530Z 0x0000000000000006 (SYMTAB) 0x1e78 2025-05-07T20:11:14.7320661Z 0x000000000000000a (STRSZ) 31425 (bytes) 2025-05-07T20:11:14.7320805Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:14.7320915Z 0x0000000000000003 (PLTGOT) 0x6b7e0 2025-05-07T20:11:14.7321094Z 0x0000000000000002 (PLTRELSZ) 4320 (bytes) 2025-05-07T20:11:14.7321211Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:14.7321345Z 0x0000000000000017 (JMPREL) 0xd0f8 2025-05-07T20:11:14.7321451Z 0x0000000000000007 (RELA) 0xbeb0 2025-05-07T20:11:14.7321583Z 0x0000000000000008 (RELASZ) 4680 (bytes) 2025-05-07T20:11:14.7321722Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:14.7321825Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:14.7321953Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:14.7322091Z 0x000000006ffffffe (VERNEED) 0xbd80 2025-05-07T20:11:14.7322199Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:14.7322312Z 0x000000006ffffff0 (VERSYM) 0xbab2 2025-05-07T20:11:14.7322422Z 0x000000006ffffff9 (RELACOUNT) 24 2025-05-07T20:11:14.7322545Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:14.7322552Z 2025-05-07T20:11:14.7322672Z ################################################################################ 2025-05-07T20:11:14.7322678Z 2025-05-07T20:11:14.7322694Z 2025-05-07T20:11:14.7322828Z ################################################################################ 2025-05-07T20:11:14.7323066Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:14.7323177Z [CHECK] Listing out library size: 2025-05-07T20:11:14.7323415Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:14.7323420Z 2025-05-07T20:11:14.7329262Z 74 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:14.7329878Z 2025-05-07T20:11:14.7331405Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:14.7332740Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.7332789Z 2025-05-07T20:11:14.7695868Z GLIBC_2.2.5 2025-05-07T20:11:14.7696424Z GLIBC_2.3 2025-05-07T20:11:14.7696770Z GLIBC_2.14 2025-05-07T20:11:14.7696910Z 2025-05-07T20:11:14.7697094Z 2025-05-07T20:11:14.7699111Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:14.7699662Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.7699669Z 2025-05-07T20:11:14.8057326Z GLIBCXX_3.4 2025-05-07T20:11:14.8057604Z GLIBCXX_3.4.9 2025-05-07T20:11:14.8057959Z GLIBCXX_3.4.11 2025-05-07T20:11:14.8058076Z GLIBCXX_3.4.14 2025-05-07T20:11:14.8058181Z GLIBCXX_3.4.15 2025-05-07T20:11:14.8058297Z GLIBCXX_3.4.18 2025-05-07T20:11:14.8058395Z GLIBCXX_3.4.19 2025-05-07T20:11:14.8058487Z GLIBCXX_3.4.20 2025-05-07T20:11:14.8058599Z GLIBCXX_3.4.21 2025-05-07T20:11:14.8058762Z GLIBCXX_3.4.29 2025-05-07T20:11:14.8058780Z 2025-05-07T20:11:14.8058791Z 2025-05-07T20:11:14.8079597Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.9rcN0leLSQ.symbols.txt 2025-05-07T20:11:14.8079869Z 2025-05-07T20:11:14.8376924Z 2025-05-07T20:11:14.8404219Z [CHECK] Total Number of symbols: 6350 2025-05-07T20:11:14.8437802Z [CHECK] Number of fbgemm symbols: 4411 2025-05-07T20:11:14.8455371Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.dmM2pCBfoX.usymbols.txt 2025-05-07T20:11:14.8455380Z 2025-05-07T20:11:14.8498440Z 2025-05-07T20:11:14.8530997Z [CHECK] Listing out undefined symbols (483 total): 2025-05-07T20:11:14.8557559Z U GOMP_parallel 2025-05-07T20:11:14.8558866Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.8560065Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.8560514Z U VTT for std::basic_ifstream >@GLIBCXX_3.4 2025-05-07T20:11:14.8560738Z U VTT for std::basic_ofstream >@GLIBCXX_3.4 2025-05-07T20:11:14.8560889Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:14.8561004Z U __assert_fail@GLIBC_2.2.5 2025-05-07T20:11:14.8561164Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.8561336Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.8561471Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.8561618Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:14.8561773Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:14.8561899Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:14.8562041Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.8562166Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:14.8562300Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:14.8562413Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:14.8562522Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:14.8562661Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:14.8562770Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:14.8562882Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:14.8562994Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:14.8563121Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:14.8563238Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:14.8563342Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:14.8563476Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:14.8563580Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:11:14.8563681Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:14.8563971Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:14.8564162Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:14.8564334Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:14.8564511Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:14.8564635Z U at::SplitUntil32Bit::begin() const 2025-05-07T20:11:14.8564756Z U at::SplitUntil32Bit::end() const 2025-05-07T20:11:14.8564971Z U at::SplitUntil32Bit::iterator::operator*() const 2025-05-07T20:11:14.8565125Z U at::SplitUntil32Bit::iterator::operator++() 2025-05-07T20:11:14.8565354Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:11:14.8565551Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:14.8565746Z U at::TensorIteratorBase::build(at::TensorIteratorConfig&) 2025-05-07T20:11:14.8565917Z U at::TensorIteratorBase::can_use_32bit_indexing() const 2025-05-07T20:11:14.8566064Z U at::TensorIteratorBase::data_ptr(long) const 2025-05-07T20:11:14.8566227Z U at::TensorIteratorBase::is_contiguous() const 2025-05-07T20:11:14.8566353Z U at::TensorIteratorBase::numel() const 2025-05-07T20:11:14.8566512Z U at::TensorIteratorBase::with_32bit_indexing() const 2025-05-07T20:11:14.8566748Z U at::TensorIteratorConfig::add_borrowed_input(at::TensorBase const&) 2025-05-07T20:11:14.8566971Z U at::TensorIteratorConfig::add_borrowed_output(at::TensorBase const&) 2025-05-07T20:11:14.8567089Z U at::TensorMaker::make_tensor() 2025-05-07T20:11:14.8567251Z U at::_ops::_is_all_true::call(at::Tensor const&) 2025-05-07T20:11:14.8567399Z U at::_ops::_unique::call(at::Tensor const&, bool, bool) 2025-05-07T20:11:14.8567679Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.8567919Z U at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.8568046Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:11:14.8568392Z U at::_ops::baddbmm::call(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) 2025-05-07T20:11:14.8568624Z U at::_ops::broadcast_to::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:14.8568781Z U at::_ops::cat::call(c10::IListRef const&, long) 2025-05-07T20:11:14.8568994Z U at::_ops::cat_out::call(c10::IListRef const&, long, at::Tensor&) 2025-05-07T20:11:14.8569187Z U at::_ops::clamp_max::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.8569402Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:11:14.8569590Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:11:14.8569772Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:14.8569993Z U at::_ops::cumsum::call(at::Tensor const&, long, std::optional) 2025-05-07T20:11:14.8570531Z U at::_ops::diff::call(at::Tensor const&, long, long, std::optional const&, std::optional const&) 2025-05-07T20:11:14.8570713Z U at::_ops::div_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.8571312Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.8571996Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.8572228Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:14.8572428Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:14.8572556Z U at::_ops::floor::call(at::Tensor const&) 2025-05-07T20:11:14.8573089Z U at::_ops::full::call(c10::ArrayRef, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.8573270Z U at::_ops::ge_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.8573580Z U at::_ops::index_put_::call(at::Tensor&, c10::List > const&, at::Tensor const&, bool) 2025-05-07T20:11:14.8573787Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:11:14.8573924Z U at::_ops::item::call(at::Tensor const&) 2025-05-07T20:11:14.8574093Z U at::_ops::le_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.8574214Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:11:14.8574397Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.8574967Z U at::_ops::ones_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.8575147Z U at::_ops::permute::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:14.8575677Z U at::_ops::range::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.8575958Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:14.8576275Z U at::_ops::resize_::call(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:11:14.8576453Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:14.8576888Z U at::_ops::set__source_Storage_storage_offset::call(at::Tensor&, c10::Storage, c10::SymInt, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:14.8577262Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:11:14.8577411Z U at::_ops::sort::call(at::Tensor const&, long, bool) 2025-05-07T20:11:14.8577637Z U at::_ops::split_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:11:14.8577804Z U at::_ops::squeeze_dim::call(at::Tensor const&, long) 2025-05-07T20:11:14.8578038Z U at::_ops::sub_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.8578230Z U at::_ops::sum::call(at::Tensor const&, std::optional) 2025-05-07T20:11:14.8578504Z U at::_ops::tensor_split_indices::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:11:14.8578810Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:14.8579436Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:14.8579620Z U at::_ops::transpose_int::call(at::Tensor const&, long, long) 2025-05-07T20:11:14.8579907Z U at::_ops::unique_consecutive::call(at::Tensor const&, bool, bool, std::optional) 2025-05-07T20:11:14.8580076Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:11:14.8580292Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:14.8580457Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.8580572Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:14.8581066Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.8581639Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.8581936Z U at::checkScalarTypes(char const*, at::TensorArg const&, c10::ArrayRef) 2025-05-07T20:11:14.8582065Z U at::cuda::getCurrentCUDABlasHandle() 2025-05-07T20:11:14.8582199Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:14.8582348Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:11:14.8582491Z U at::cuda::get_p2p_access(signed char, signed char) 2025-05-07T20:11:14.8582828Z U at::detail::computeStorageNbytes(c10::ArrayRef, c10::ArrayRef, unsigned long, unsigned long) 2025-05-07T20:11:14.8582972Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:14.8583134Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:14.8583232Z U at::get_num_threads() 2025-05-07T20:11:14.8583327Z U at::get_thread_num() 2025-05-07T20:11:14.8583440Z U at::in_parallel_region() 2025-05-07T20:11:14.8583562Z U at::init_num_threads() 2025-05-07T20:11:14.8583777Z U at::internal::OpaqueOptionalTensorRef::~OpaqueOptionalTensorRef() 2025-05-07T20:11:14.8583904Z U at::internal::set_thread_num(int) 2025-05-07T20:11:14.8584141Z U at::native::_rowwise_prune(at::Tensor const&, at::Tensor const&, c10::ScalarType) 2025-05-07T20:11:14.8584711Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.8585349Z U at::native::empty_meta_symint(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.8585620Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:11:14.8585777Z U at::print(std::ostream&, at::Tensor const&, long) 2025-05-07T20:11:14.8585905Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:14.8586070Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:11:14.8586193Z U bool at::Tensor::item() const 2025-05-07T20:11:14.8586325Z U bool* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.8586473Z U bool* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.8586570Z U c10::AnyType::get() 2025-05-07T20:11:14.8586744Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:11:14.8586918Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.8587116Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.8587219Z U c10::BoolType::get() 2025-05-07T20:11:14.8587330Z U c10::DeviceObjType::get() 2025-05-07T20:11:14.8587512Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:14.8587728Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:14.8587871Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:14.8588389Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:14.8589029Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:14.8589404Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:14.8589524Z U c10::Error::what() const 2025-05-07T20:11:14.8589620Z U c10::FloatType::get() 2025-05-07T20:11:14.8589720Z U c10::GradMode::is_enabled() 2025-05-07T20:11:14.8589826Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:14.8589978Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.8590144Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.8590291Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:14.8590407Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:14.8590508Z U c10::IValue::isBoolList() const 2025-05-07T20:11:14.8590626Z U c10::IValue::isIntList() const 2025-05-07T20:11:14.8590755Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:14.8590866Z U c10::IValue::isTensorList() const 2025-05-07T20:11:14.8591003Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:14.8591166Z U c10::InferenceMode::is_enabled() 2025-05-07T20:11:14.8591264Z U c10::IntType::get() 2025-05-07T20:11:14.8591744Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.8592045Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:14.8592170Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:14.8592291Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:14.8592428Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:14.8592644Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.8592767Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:14.8592884Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:14.8593009Z U c10::ScalarTypeType::get() 2025-05-07T20:11:14.8593286Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:14.8593602Z U c10::SmallVectorBase::mallocForGrow(unsigned long, unsigned long, unsigned long&) 2025-05-07T20:11:14.8593776Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:14.8593874Z U c10::StringType::get() 2025-05-07T20:11:14.8594015Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:14.8594164Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:14.8594310Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:14.8594709Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:14.8594858Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:14.8595027Z U c10::SymInt::operator%(c10::SymInt const&) const 2025-05-07T20:11:14.8595188Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:11:14.8595364Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:14.8595479Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:14.8595608Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:14.8595745Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:11:14.8595849Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:14.8595947Z U c10::SymIntType::get() 2025-05-07T20:11:14.8596176Z U c10::SymbolicShapeMeta::init_is_channels_last_3d_contiguous() const 2025-05-07T20:11:14.8596370Z U c10::SymbolicShapeMeta::init_is_channels_last_contiguous() const 2025-05-07T20:11:14.8596520Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:14.8596656Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:14.8597086Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:11:14.8597232Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:14.8597398Z U c10::TensorImpl::throw_storage_access_error() const 2025-05-07T20:11:14.8597500Z U c10::TensorType::get() 2025-05-07T20:11:14.8598366Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:11:14.8598576Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:11:14.8598713Z U c10::Type::is_module() const 2025-05-07T20:11:14.8598835Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:14.8599550Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:14.8599681Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:14.8599850Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:11:14.8600130Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:11:14.8600462Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:11:14.8600579Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:14.8600708Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:14.8600822Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:14.8600945Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:14.8601065Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:14.8601316Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:14.8601423Z U c10::cuda::current_device() 2025-05-07T20:11:14.8601575Z U c10::cuda::device_count() 2025-05-07T20:11:14.8601713Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:14.8601850Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:14.8602005Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:14.8602144Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:14.8602304Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:14.8602456Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:14.8602904Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.8603456Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:14.8603727Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:14.8604205Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.8604537Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:14.8605122Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.8605399Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:14.8605618Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:11:14.8605734Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:14.8605841Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:14.8606157Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:14.8606535Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:14.8606660Z U c10::impl::PyObjectSlot::PyObjectSlot() 2025-05-07T20:11:14.8606787Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:11:14.8606995Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:14.8607165Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:14.8607288Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:14.8607433Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:14.8607592Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:14.8607961Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:14.8608108Z U c10::operator*(c10::SymInt const&, int) 2025-05-07T20:11:14.8608223Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:11:14.8608364Z U c10::operator+(c10::SymInt const&, unsigned long) 2025-05-07T20:11:14.8608510Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:14.8608656Z U c10::operator-(c10::SymInt const&, unsigned long) 2025-05-07T20:11:14.8608774Z U c10::operator/(c10::SymInt const&, int) 2025-05-07T20:11:14.8608914Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:11:14.8609064Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:14.8609196Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:14.8609375Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:14.8609523Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:14.8609640Z U c10::operator==(c10::SymInt const&, int) 2025-05-07T20:11:14.8609782Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:11:14.8609907Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:11:14.8610023Z U c10::report_overflow(char const*) 2025-05-07T20:11:14.8610231Z U c10::throwNullDataPtrError() 2025-05-07T20:11:14.8610435Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:11:14.8610571Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:14.8610686Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:14.8610941Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:14.8611060Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:14.8611177Z U cublasGemmStridedBatchedEx 2025-05-07T20:11:14.8611301Z U cublasSetStream_v2 2025-05-07T20:11:14.8611436Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:14.8611571Z U cudaDeviceGetByPCIBusId@libcudart.so.12 2025-05-07T20:11:14.8611723Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:14.8611865Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:14.8611985Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:14.8612117Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:14.8612244Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:14.8612363Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:14.8612493Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:14.8612617Z U cudaFree@libcudart.so.12 2025-05-07T20:11:14.8612742Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:14.8612872Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:14.8612988Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:14.8613133Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:14.8613271Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:14.8613396Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:14.8613541Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:14.8613710Z U cudaHostGetDevicePointer@libcudart.so.12 2025-05-07T20:11:14.8613828Z U cudaHostRegister@libcudart.so.12 2025-05-07T20:11:14.8613973Z U cudaHostUnregister@libcudart.so.12 2025-05-07T20:11:14.8614087Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:14.8614208Z U cudaMallocManaged@libcudart.so.12 2025-05-07T20:11:14.8614324Z U cudaMemAdvise@libcudart.so.12 2025-05-07T20:11:14.8614466Z U cudaMemPrefetchAsync@libcudart.so.12 2025-05-07T20:11:14.8614589Z U cudaMemcpy2DAsync@libcudart.so.12 2025-05-07T20:11:14.8614708Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:11:14.8614841Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:11:14.8615143Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:14.8615272Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:14.8615406Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:14.8615523Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:14.8615660Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:14.8615785Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:14.8615956Z U double* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.8616128Z U double* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.8616235Z U exit@GLIBC_2.2.5 2025-05-07T20:11:14.8616365Z U exp10@GLIBC_2.2.5 2025-05-07T20:11:14.8616469Z U exp@GLIBC_2.2.5 2025-05-07T20:11:14.8616566Z U expf@GLIBC_2.2.5 2025-05-07T20:11:14.8616799Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:11:14.8617007Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:14.8617247Z U fbgemm_gpu::asynchronous_exclusive_cumsum_cpu(at::Tensor const&) 2025-05-07T20:11:14.8617484Z U fbgemm_gpu::asynchronous_exclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:14.8617748Z U fbgemm_gpu::asynchronous_inclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:14.8617900Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.8618095Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.8618199Z U fminf@GLIBC_2.2.5 2025-05-07T20:11:14.8618298Z U fmod@GLIBC_2.2.5 2025-05-07T20:11:14.8618394Z U free@GLIBC_2.2.5 2025-05-07T20:11:14.8618543Z U get_info_B_num_bits_from_T(int, int) 2025-05-07T20:11:14.8618664Z U int at::Tensor::item() const 2025-05-07T20:11:14.8618835Z U int const* at::TensorBase::const_data_ptr() const 2025-05-07T20:11:14.8618993Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.8619144Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.8619254Z U lgamma@GLIBC_2.2.5 2025-05-07T20:11:14.8619382Z U llrint@GLIBC_2.2.5 2025-05-07T20:11:14.8619485Z U log10@GLIBC_2.2.5 2025-05-07T20:11:14.8619582Z U log2@GLIBC_2.2.5 2025-05-07T20:11:14.8619682Z U log@GLIBC_2.2.5 2025-05-07T20:11:14.8619825Z U long at::Tensor::item() const 2025-05-07T20:11:14.8620005Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:14.8620180Z U long const* at::TensorBase::const_data_ptr() const 2025-05-07T20:11:14.8620347Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.8620505Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.8620634Z U lrint@GLIBC_2.2.5 2025-05-07T20:11:14.8620767Z U madvise@GLIBC_2.2.5 2025-05-07T20:11:14.8620868Z U malloc@GLIBC_2.2.5 2025-05-07T20:11:14.8620975Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:14.8621100Z U memcpy@GLIBC_2.14 2025-05-07T20:11:14.8621207Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:14.8621306Z U memset@GLIBC_2.2.5 2025-05-07T20:11:14.8621414Z U nvmlDeviceGetCount_v2 2025-05-07T20:11:14.8621563Z U nvmlDeviceGetHandleByIndex_v2 2025-05-07T20:11:14.8621715Z U nvmlDeviceGetNvLinkRemotePciInfo_v2 2025-05-07T20:11:14.8621838Z U nvmlDeviceGetNvLinkState 2025-05-07T20:11:14.8621981Z U nvmlDeviceGetPciInfo_v3 2025-05-07T20:11:14.8622091Z U nvmlInit_v2 2025-05-07T20:11:14.8622196Z U omp_get_num_threads 2025-05-07T20:11:14.8622304Z U omp_get_thread_num 2025-05-07T20:11:14.8622495Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:14.8622629Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:14.8622771Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:14.8622899Z U pow@GLIBC_2.2.5 2025-05-07T20:11:14.8623007Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:14.8623173Z U short* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.8623401Z U signed char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.8623502Z U sin@GLIBC_2.2.5 2025-05-07T20:11:14.8623724Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:14.8623928Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:14.8624125Z U std::_Rb_tree_increment(std::_Rb_tree_node_base const*)@GLIBCXX_3.4 2025-05-07T20:11:14.8624373Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:14.8624793Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:11:14.8625003Z U std::__basic_file::~__basic_file()@GLIBCXX_3.4 2025-05-07T20:11:14.8625358Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:14.8625786Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:14.8626136Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:14.8626526Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:14.8626933Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:14.8627091Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:14.8627248Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:14.8627394Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:11:14.8627523Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:11:14.8627651Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:14.8627799Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:14.8627922Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:11:14.8628070Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:14.8628249Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.8628418Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.8628791Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.8629008Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:14.8629155Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:14.8629342Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:14.8629621Z U std::basic_filebuf >::basic_filebuf()@GLIBCXX_3.4 2025-05-07T20:11:14.8629888Z U std::basic_filebuf >::close()@GLIBCXX_3.4 2025-05-07T20:11:14.8630200Z U std::basic_filebuf >::open(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:11:14.8630466Z U std::basic_filebuf >::~basic_filebuf()@GLIBCXX_3.4 2025-05-07T20:11:14.8630711Z U std::basic_ifstream >::~basic_ifstream()@GLIBCXX_3.4 2025-05-07T20:11:14.8630956Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:14.8631341Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:14.8631590Z U std::basic_ofstream >::~basic_ofstream()@GLIBCXX_3.4 2025-05-07T20:11:14.8632172Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.8632718Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.8632883Z U std::chrono::_V2::system_clock::now()@GLIBCXX_3.4.19 2025-05-07T20:11:14.8633055Z U std::cout@GLIBCXX_3.4 2025-05-07T20:11:14.8633306Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:11:14.8633479Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:14.8633620Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:14.8633777Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:14.8633909Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:14.8634037Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.8634190Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.8634315Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:14.8634443Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:14.8634677Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:11:14.8634874Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.8635123Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.8635276Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:11:14.8635411Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:14.8635537Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:11:14.8635692Z U std::ostream::write(char const*, long)@GLIBCXX_3.4 2025-05-07T20:11:14.8635891Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:14.8636037Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:14.8636277Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:14.8636729Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:14.8638223Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:14.8638350Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:14.8638486Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:14.8638591Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:14.8638698Z U sysconf@GLIBC_2.2.5 2025-05-07T20:11:14.8638860Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:14.8639460Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:14.8639935Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.8640473Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:11:14.8640746Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.8640904Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:14.8641213Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:14.8641518Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:14.8641742Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:14.8641925Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:14.8642259Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:14.8642435Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:14.8642656Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:14.8642882Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:14.8643028Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:14.8643138Z U torch::autograd::Node::metadata() 2025-05-07T20:11:14.8643274Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:14.8643693Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:14.8643994Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:14.8644143Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:14.8644364Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:14.8644620Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:14.8647261Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:14.8647462Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:14.8647621Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:14.8647827Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:14.8647997Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:14.8648403Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:14.8648791Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:14.8649189Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:14.8649393Z U torch::pickle_load(std::vector > const&) 2025-05-07T20:11:14.8649551Z U torch::pickle_save(c10::IValue const&) 2025-05-07T20:11:14.8650096Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:14.8650335Z U typeinfo for c10::Error 2025-05-07T20:11:14.8650452Z U typeinfo for c10::Type 2025-05-07T20:11:14.8650781Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:14.8650923Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:14.8651091Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:14.8651301Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:14.8651433Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:14.8651667Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.8651928Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.8652577Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(float const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:14.8653203Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:14.8653671Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, float const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:14.8654239Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:14.8654694Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, float*) 2025-05-07T20:11:14.8655229Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, unsigned short*) 2025-05-07T20:11:14.8655750Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:11:14.8656306Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:11:14.8656834Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:11:14.8657467Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:11:14.8658071Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:11:14.8658277Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:14.8658457Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:14.8658623Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:14.8658823Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:14.8658995Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:11:14.8659113Z U vtable for at::TensorIterator 2025-05-07T20:11:14.8659240Z U vtable for at::TensorIteratorBase 2025-05-07T20:11:14.8659387Z U vtable for c10::Error 2025-05-07T20:11:14.8659500Z U vtable for c10::ListType 2025-05-07T20:11:14.8659855Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.8660222Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.8660580Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.8660741Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:14.8660964Z U vtable for std::basic_filebuf >@GLIBCXX_3.4 2025-05-07T20:11:14.8661195Z U vtable for std::basic_ifstream >@GLIBCXX_3.4 2025-05-07T20:11:14.8661423Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:14.8661657Z U vtable for std::basic_ofstream >@GLIBCXX_3.4 2025-05-07T20:11:14.8661890Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:14.8662078Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:11:14.8662203Z U vtable for torch::autograd::Node 2025-05-07T20:11:14.8662413Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:14.8662554Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:14.8662689Z w _ITM_registerTMCloneTable 2025-05-07T20:11:14.8662794Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:14.8662914Z w __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:11:14.8663032Z w __gmon_start__ 2025-05-07T20:11:14.8663135Z w __pthread_key_create 2025-05-07T20:11:14.8663250Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:14.8663372Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:14.8663479Z w pthread_once 2025-05-07T20:11:14.8663631Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:14.8663820Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:14.8663827Z 2025-05-07T20:11:14.8664008Z linux-vdso.so.1 (0x00007ffed32dd000) 2025-05-07T20:11:14.8664109Z libc10.so => not found 2025-05-07T20:11:14.8664239Z libc10_cuda.so => not found 2025-05-07T20:11:14.8664610Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007fa945200000) 2025-05-07T20:11:14.8664721Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.8664819Z libtorch.so => not found 2025-05-07T20:11:14.8665393Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fa945050000) 2025-05-07T20:11:14.8665853Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fa943e00000) 2025-05-07T20:11:14.8665959Z libtorch_cpu.so => not found 2025-05-07T20:11:14.8666102Z libtorch_cuda.so => not found 2025-05-07T20:11:14.8666196Z libcudart.so.12 => not found 2025-05-07T20:11:14.8666358Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fa943b9c000) 2025-05-07T20:11:14.8666511Z libm.so.6 => /lib64/libm.so.6 (0x00007fa944f75000) 2025-05-07T20:11:14.8666660Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fa94a5f6000) 2025-05-07T20:11:14.8666786Z libc.so.6 => /lib64/libc.so.6 (0x00007fa943994000) 2025-05-07T20:11:14.8666942Z /lib64/ld-linux-x86-64.so.2 (0x00007fa94a62c000) 2025-05-07T20:11:14.8667028Z libc10.so => not found 2025-05-07T20:11:14.8667392Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007fa945785000) 2025-05-07T20:11:14.8667490Z libtorch.so => not found 2025-05-07T20:11:14.8667606Z libtorch_cpu.so => not found 2025-05-07T20:11:14.8667697Z libtorch_cuda.so => not found 2025-05-07T20:11:14.8667791Z libc10.so => not found 2025-05-07T20:11:14.8667912Z libc10_cuda.so => not found 2025-05-07T20:11:14.8668007Z libtorch.so => not found 2025-05-07T20:11:14.8668102Z libtorch_cpu.so => not found 2025-05-07T20:11:14.8668208Z libtorch_cuda.so => not found 2025-05-07T20:11:14.8668326Z libcudart.so.12 => not found 2025-05-07T20:11:14.8668419Z libtorch.so => not found 2025-05-07T20:11:14.8668506Z libc10.so => not found 2025-05-07T20:11:14.8668627Z libc10_cuda.so => not found 2025-05-07T20:11:14.8668722Z libtorch_cpu.so => not found 2025-05-07T20:11:14.8668819Z libtorch_cuda.so => not found 2025-05-07T20:11:14.8668924Z libcudart.so.12 => not found 2025-05-07T20:11:14.8669040Z libtorch_cpu.so => not found 2025-05-07T20:11:14.8669142Z libtorch_cuda.so => not found 2025-05-07T20:11:14.8669239Z libtorch.so => not found 2025-05-07T20:11:14.8669244Z 2025-05-07T20:11:14.8669371Z [CHECK] Displaying ELF information: 2025-05-07T20:11:14.8669576Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:14.8669581Z 2025-05-07T20:11:14.8669585Z 2025-05-07T20:11:14.8669754Z Dynamic section at offset 0x4953578 contains 43 entries: 2025-05-07T20:11:14.8669931Z Tag Type Name/Value 2025-05-07T20:11:14.8670128Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:14.8670380Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:14.8670576Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:14.8670801Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:14.8670992Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:14.8671243Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:14.8671484Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:14.8671689Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:14.8671889Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:14.8672118Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:14.8672323Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:14.8672515Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:14.8672719Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:14.8672910Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:14.8673133Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:14.8673365Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_py.so] 2025-05-07T20:11:14.8673545Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:14.8673660Z 0x000000000000000c (INIT) 0x18e000 2025-05-07T20:11:14.8673782Z 0x000000000000000d (FINI) 0x7e464c 2025-05-07T20:11:14.8673942Z 0x0000000000000019 (INIT_ARRAY) 0x494d470 2025-05-07T20:11:14.8674081Z 0x000000000000001b (INIT_ARRAYSZ) 1160 (bytes) 2025-05-07T20:11:14.8674206Z 0x000000000000001a (FINI_ARRAY) 0x494d8f8 2025-05-07T20:11:14.8674355Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:14.8674468Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:14.8674583Z 0x000000006ffffef5 (GNU_HASH) 0x8530 2025-05-07T20:11:14.8674695Z 0x0000000000000005 (STRTAB) 0x363a0 2025-05-07T20:11:14.8674823Z 0x0000000000000006 (SYMTAB) 0x11038 2025-05-07T20:11:14.8674970Z 0x000000000000000a (STRSZ) 1209140 (bytes) 2025-05-07T20:11:14.8675089Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:14.8675230Z 0x0000000000000003 (PLTGOT) 0x4954868 2025-05-07T20:11:14.8675364Z 0x0000000000000002 (PLTRELSZ) 42168 (bytes) 2025-05-07T20:11:14.8675477Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:14.8675617Z 0x0000000000000017 (JMPREL) 0x183378 2025-05-07T20:11:14.8675730Z 0x0000000000000007 (RELA) 0x160a28 2025-05-07T20:11:14.8675874Z 0x0000000000000008 (RELASZ) 141648 (bytes) 2025-05-07T20:11:14.8675999Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:14.8676118Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:14.8676239Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:14.8676357Z 0x000000006ffffffe (VERNEED) 0x160878 2025-05-07T20:11:14.8676493Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:11:14.8676603Z 0x000000006ffffff0 (VERSYM) 0x15d6d4 2025-05-07T20:11:14.8676709Z 0x000000006ffffff9 (RELACOUNT) 516 2025-05-07T20:11:14.8676833Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:14.8676838Z 2025-05-07T20:11:14.8676957Z ################################################################################ 2025-05-07T20:11:14.8676964Z 2025-05-07T20:11:14.8676968Z 2025-05-07T20:11:14.8677108Z ################################################################################ 2025-05-07T20:11:14.8677463Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:14.8677596Z [CHECK] Listing out library size: 2025-05-07T20:11:14.8677902Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:14.8677907Z 2025-05-07T20:11:14.8678500Z 908 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:14.8678513Z 2025-05-07T20:11:14.8678962Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:14.8679501Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.8679534Z 2025-05-07T20:11:15.0538785Z GLIBC_2.2.5 2025-05-07T20:11:15.0539405Z GLIBC_2.3 2025-05-07T20:11:15.0539981Z GLIBC_2.14 2025-05-07T20:11:15.0540340Z 2025-05-07T20:11:15.0540377Z 2025-05-07T20:11:15.0541733Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:15.0545008Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:15.0546926Z 2025-05-07T20:11:15.2376665Z GLIBCXX_3.4 2025-05-07T20:11:15.2377272Z GLIBCXX_3.4.9 2025-05-07T20:11:15.2377935Z GLIBCXX_3.4.11 2025-05-07T20:11:15.2378632Z GLIBCXX_3.4.14 2025-05-07T20:11:15.2378867Z GLIBCXX_3.4.15 2025-05-07T20:11:15.2379075Z GLIBCXX_3.4.18 2025-05-07T20:11:15.2379313Z GLIBCXX_3.4.20 2025-05-07T20:11:15.2379547Z GLIBCXX_3.4.21 2025-05-07T20:11:15.2379749Z GLIBCXX_3.4.29 2025-05-07T20:11:15.2379871Z 2025-05-07T20:11:15.2380069Z 2025-05-07T20:11:15.2400131Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.lAmfIeHecv.symbols.txt 2025-05-07T20:11:15.2400789Z 2025-05-07T20:11:15.4201335Z 2025-05-07T20:11:15.4284977Z [CHECK] Total Number of symbols: 12349 2025-05-07T20:11:15.4368463Z [CHECK] Number of fbgemm symbols: 2031 2025-05-07T20:11:15.4388192Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.3qrK07YALg.usymbols.txt 2025-05-07T20:11:15.4388757Z 2025-05-07T20:11:15.4453330Z 2025-05-07T20:11:15.4481235Z [CHECK] Listing out undefined symbols (289 total): 2025-05-07T20:11:15.4494487Z U GOMP_parallel 2025-05-07T20:11:15.4496145Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.4498465Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.4499874Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:15.4500246Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:15.4500696Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:15.4501110Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:15.4501532Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:15.4501950Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:15.4502322Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:15.4502732Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:15.4503112Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:15.4503463Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:15.4503777Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:15.4504109Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:15.4504425Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:15.4504978Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:15.4505326Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:15.4505730Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:15.4506129Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:15.4506449Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:15.4506787Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:15.4507102Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:15.4507441Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:15.4507774Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:15.4508196Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:15.4508644Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:15.4509065Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:15.4509519Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:15.4509893Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:15.4510310Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:15.4510901Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:15.4511519Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:11:15.4512132Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:15.4512988Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.4514528Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.4515598Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:15.4516692Z U at::_ops::sparse_coo_tensor_indices_size::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.4517855Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:15.4518459Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:11:15.4518896Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:15.4519688Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.4520977Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.4521800Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:15.4522217Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:15.4522595Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:15.4522977Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:15.4523352Z U at::get_num_threads() 2025-05-07T20:11:15.4523643Z U at::get_thread_num() 2025-05-07T20:11:15.4523944Z U at::globalContext() 2025-05-07T20:11:15.4524236Z U at::in_parallel_region() 2025-05-07T20:11:15.4524547Z U at::init_num_threads() 2025-05-07T20:11:15.4524913Z U at::internal::set_thread_num(int) 2025-05-07T20:11:15.4525252Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:15.4525699Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:11:15.4526149Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:11:15.4526504Z U c10::AnyType::get() 2025-05-07T20:11:15.4526921Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.4527335Z U c10::BoolType::get() 2025-05-07T20:11:15.4527710Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:15.4528154Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:15.4529077Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:15.4529844Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:15.4531197Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:15.4532337Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:15.4532927Z U c10::Error::what() const 2025-05-07T20:11:15.4533258Z U c10::FloatType::get() 2025-05-07T20:11:15.4533605Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:15.4533943Z U c10::GradMode::is_enabled() 2025-05-07T20:11:15.4534287Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:15.4534658Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.4535201Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.4535679Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:15.4536089Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:15.4536444Z U c10::IValue::isBoolList() const 2025-05-07T20:11:15.4536874Z U c10::IValue::isIntList() const 2025-05-07T20:11:15.4537198Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:15.4537507Z U c10::IValue::isTensorList() const 2025-05-07T20:11:15.4537860Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:15.4538204Z U c10::IntType::get() 2025-05-07T20:11:15.4538545Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:15.4538933Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:15.4539264Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:15.4539622Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:15.4540053Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:15.4540580Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:15.4540921Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:15.4541409Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:15.4541962Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:15.4542335Z U c10::StringType::get() 2025-05-07T20:11:15.4542706Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:15.4543096Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:15.4543760Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:15.4544475Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:15.4545059Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:15.4545431Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:15.4545818Z U c10::SymIntType::get() 2025-05-07T20:11:15.4546222Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:15.4546621Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:15.4547231Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:15.4547658Z U c10::TensorType::get() 2025-05-07T20:11:15.4548007Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:15.4549011Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:15.4550103Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:15.4550494Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:15.4550890Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:15.4551252Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:15.4551642Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:15.4552000Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:15.4552512Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:15.4553016Z U c10::cuda::device_count() 2025-05-07T20:11:15.4553375Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:15.4553798Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:15.4554287Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:15.4554874Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:15.4555316Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:15.4555719Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:15.4556418Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:15.4557485Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:15.4558400Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:15.4559300Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:15.4560263Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:15.4561333Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:15.4562186Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:15.4562545Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:15.4563128Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:15.4563804Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:15.4564274Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:15.4564747Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:15.4565171Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:15.4565591Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:15.4566056Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:15.4566779Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:15.4567510Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:15.4567874Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:15.4568277Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:15.4568669Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:15.4569101Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:15.4569507Z U c10::operator<=(c10::SymInt const&, int) 2025-05-07T20:11:15.4569864Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:11:15.4570341Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:11:15.4570882Z U c10::throwNullDataPtrError() 2025-05-07T20:11:15.4571340Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:15.4571697Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:15.4572166Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:15.4572633Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:15.4573005Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:15.4573420Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:15.4573812Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:15.4574217Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:15.4574590Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:15.4575020Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:15.4575400Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:15.4575766Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:15.4576163Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:15.4576557Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:15.4576968Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:15.4577331Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:15.4577714Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:15.4578095Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:15.4578468Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:15.4578869Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:15.4579903Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:15.4581176Z U fbgemm::SparseAdaGradSignature::Type fbgemm::GenerateSparseAdaGrad(int, bool, int, bool) 2025-05-07T20:11:15.4581792Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:11:15.4582230Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:15.4582695Z U float at::Tensor::item() const 2025-05-07T20:11:15.4583186Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.4583618Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.4584017Z U free@GLIBC_2.2.5 2025-05-07T20:11:15.4584342Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.4584755Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.4585191Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:15.4585686Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.4586126Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.4586556Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:15.4586889Z U memcpy@GLIBC_2.14 2025-05-07T20:11:15.4587192Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:15.4587530Z U memset@GLIBC_2.2.5 2025-05-07T20:11:15.4587830Z U omp_get_num_threads 2025-05-07T20:11:15.4588268Z U omp_get_thread_num 2025-05-07T20:11:15.4588601Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:15.4589005Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:15.4589540Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.4590276Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.4590995Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.4591718Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.4592460Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.4593197Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.4593710Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:15.4594347Z U split_embedding_codegen_forward_cpu(at::Tensor, at::Tensor, at::Tensor, c10::SymInt, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long) 2025-05-07T20:11:15.4595303Z U split_embedding_codegen_grad_indice_weights_cpu(at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor) 2025-05-07T20:11:15.4595928Z U sqrt@GLIBC_2.2.5 2025-05-07T20:11:15.4596247Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:11:15.4596645Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:15.4597309Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:15.4598109Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:15.4598939Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:15.4599744Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:15.4600352Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:15.4600786Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:15.4601190Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:15.4601528Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:15.4601902Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:15.4602287Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.4602698Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.4619042Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:15.4619667Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:15.4620087Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:15.4620756Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:15.4621590Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:15.4622665Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.4623886Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.4624639Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:15.4625056Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:15.4625462Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:15.4625839Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:15.4626221Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:15.4626580Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:15.4626964Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:15.4627309Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:15.4627755Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.4628334Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.4629051Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:15.4629514Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:15.4629953Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:15.4630466Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:15.4631355Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:15.4632051Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:15.4632458Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:15.4632784Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:15.4633111Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:15.4633464Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:15.4634301Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:15.4635496Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:15.4636357Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:15.4636871Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:15.4637443Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:15.4638051Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:15.4638597Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:15.4639149Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:15.4639816Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:15.4640470Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:15.4640944Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:15.4641529Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:15.4642032Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:15.4644338Z U torch::autograd::Node::metadata() 2025-05-07T20:11:15.4644782Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:15.4645299Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:15.4645963Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:15.4646525Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:15.4647011Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:15.4647593Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:15.4650773Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:15.4653730Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:15.4654247Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:15.4654695Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:15.4655170Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:15.4655860Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:15.4656784Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:15.4657845Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:15.4658616Z U typeinfo for c10::Error 2025-05-07T20:11:15.4659008Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:15.4659406Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:15.4659818Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:15.4660229Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:15.4660606Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:15.4661939Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:11:15.4664196Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:11:15.4665643Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:15.4666103Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:15.4666755Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:15.4667194Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:15.4667708Z U vtable for c10::Error 2025-05-07T20:11:15.4668427Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.4669237Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.4670232Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.4670839Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:15.4671326Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:15.4671907Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:15.4672368Z U vtable for torch::autograd::Node 2025-05-07T20:11:15.4672911Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:15.4673321Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:15.4673680Z w _ITM_registerTMCloneTable 2025-05-07T20:11:15.4673999Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:15.4674322Z w __gmon_start__ 2025-05-07T20:11:15.4674625Z w __pthread_key_create 2025-05-07T20:11:15.4674939Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:15.4675290Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:15.4675701Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:15.4676370Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:15.4676736Z 2025-05-07T20:11:15.4676900Z linux-vdso.so.1 (0x00007ffc2eff7000) 2025-05-07T20:11:15.4677209Z libc10.so => not found 2025-05-07T20:11:15.4677447Z libc10_cuda.so => not found 2025-05-07T20:11:15.4678103Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fc491800000) 2025-05-07T20:11:15.4679228Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fc491650000) 2025-05-07T20:11:15.4679967Z libtorch.so => not found 2025-05-07T20:11:15.4680491Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007fc491000000) 2025-05-07T20:11:15.4681402Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fc48fe00000) 2025-05-07T20:11:15.4682118Z libtorch_cpu.so => not found 2025-05-07T20:11:15.4682439Z libtorch_cuda.so => not found 2025-05-07T20:11:15.4682718Z libcudart.so.12 => not found 2025-05-07T20:11:15.4683078Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fc48fb9c000) 2025-05-07T20:11:15.4683467Z libm.so.6 => /lib64/libm.so.6 (0x00007fc491575000) 2025-05-07T20:11:15.4683976Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fc4cbd3b000) 2025-05-07T20:11:15.4684455Z libc.so.6 => /lib64/libc.so.6 (0x00007fc48f994000) 2025-05-07T20:11:15.4684818Z /lib64/ld-linux-x86-64.so.2 (0x00007fc4cbd71000) 2025-05-07T20:11:15.4685123Z libc10.so => not found 2025-05-07T20:11:15.4685352Z libc10_cuda.so => not found 2025-05-07T20:11:15.4685938Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007fc491bf4000) 2025-05-07T20:11:15.4686546Z libtorch.so => not found 2025-05-07T20:11:15.4686795Z libtorch_cpu.so => not found 2025-05-07T20:11:15.4687062Z libtorch_cuda.so => not found 2025-05-07T20:11:15.4687315Z libcudart.so.12 => not found 2025-05-07T20:11:15.4687758Z libc10.so => not found 2025-05-07T20:11:15.4688058Z libc10_cuda.so => not found 2025-05-07T20:11:15.4688372Z libtorch.so => not found 2025-05-07T20:11:15.4688644Z libtorch_cpu.so => not found 2025-05-07T20:11:15.4688913Z libtorch_cuda.so => not found 2025-05-07T20:11:15.4689211Z libcudart.so.12 => not found 2025-05-07T20:11:15.4689495Z libc10.so => not found 2025-05-07T20:11:15.4689998Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007fc491b75000) 2025-05-07T20:11:15.4690868Z libtorch.so => not found 2025-05-07T20:11:15.4691133Z libtorch_cpu.so => not found 2025-05-07T20:11:15.4691498Z libtorch_cuda.so => not found 2025-05-07T20:11:15.4691774Z libtorch.so => not found 2025-05-07T20:11:15.4692064Z libc10.so => not found 2025-05-07T20:11:15.4692324Z libc10_cuda.so => not found 2025-05-07T20:11:15.4692624Z libtorch_cpu.so => not found 2025-05-07T20:11:15.4692924Z libtorch_cuda.so => not found 2025-05-07T20:11:15.4693206Z libcudart.so.12 => not found 2025-05-07T20:11:15.4693489Z libc10.so => not found 2025-05-07T20:11:15.4693746Z libtorch_cpu.so => not found 2025-05-07T20:11:15.4694047Z libtorch_cuda.so => not found 2025-05-07T20:11:15.4694325Z libtorch.so => not found 2025-05-07T20:11:15.4694610Z libtorch_cpu.so => not found 2025-05-07T20:11:15.4694886Z libtorch_cuda.so => not found 2025-05-07T20:11:15.4695182Z libtorch.so => not found 2025-05-07T20:11:15.4695344Z 2025-05-07T20:11:15.4695461Z [CHECK] Displaying ELF information: 2025-05-07T20:11:15.4695964Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:15.4696346Z 2025-05-07T20:11:15.4696350Z 2025-05-07T20:11:15.4696542Z Dynamic section at offset 0x38b44998 contains 43 entries: 2025-05-07T20:11:15.4696936Z Tag Type Name/Value 2025-05-07T20:11:15.4697429Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:15.4697943Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:15.4698512Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:15.4699120Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:15.4699684Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:15.4700222Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:15.4700758Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:15.4701323Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:15.4701861Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:15.4702419Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:15.4702966Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:15.4703593Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:15.4704104Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:15.4704595Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:15.4705125Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:15.4705710Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:15.4706277Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:15.4706703Z 0x000000000000000c (INIT) 0x611000 2025-05-07T20:11:15.4707046Z 0x000000000000000d (FINI) 0x32390cc 2025-05-07T20:11:15.4707415Z 0x0000000000000019 (INIT_ARRAY) 0x38b425f8 2025-05-07T20:11:15.4707809Z 0x000000000000001b (INIT_ARRAYSZ) 1824 (bytes) 2025-05-07T20:11:15.4708198Z 0x000000000000001a (FINI_ARRAY) 0x38b42d18 2025-05-07T20:11:15.4708746Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:15.4709122Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:15.4709474Z 0x000000006ffffef5 (GNU_HASH) 0x10330 2025-05-07T20:11:15.4709812Z 0x0000000000000005 (STRTAB) 0x69580 2025-05-07T20:11:15.4710162Z 0x0000000000000006 (SYMTAB) 0x20fb0 2025-05-07T20:11:15.4710630Z 0x000000000000000a (STRSZ) 4919620 (bytes) 2025-05-07T20:11:15.4711001Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:15.4711337Z 0x0000000000000003 (PLTGOT) 0x38b44c88 2025-05-07T20:11:15.4711709Z 0x0000000000000002 (PLTRELSZ) 50064 (bytes) 2025-05-07T20:11:15.4712038Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:15.4712370Z 0x0000000000000017 (JMPREL) 0x603da0 2025-05-07T20:11:15.4712709Z 0x0000000000000007 (RELA) 0x5208e0 2025-05-07T20:11:15.4713045Z 0x0000000000000008 (RELASZ) 931008 (bytes) 2025-05-07T20:11:15.4713407Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:15.4713716Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:15.4714049Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:15.4714384Z 0x000000006ffffffe (VERNEED) 0x520740 2025-05-07T20:11:15.4714719Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:11:15.4715029Z 0x000000006ffffff0 (VERSYM) 0x51a6c4 2025-05-07T20:11:15.4715384Z 0x000000006ffffff9 (RELACOUNT) 26208 2025-05-07T20:11:15.4715715Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:15.4715909Z 2025-05-07T20:11:15.4716025Z ################################################################################ 2025-05-07T20:11:15.4716269Z 2025-05-07T20:11:15.4716273Z 2025-05-07T20:11:15.4716417Z ################################################################################ 2025-05-07T20:11:15.4716939Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:15.4717462Z [CHECK] Listing out library size: 2025-05-07T20:11:15.4717965Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:15.4718361Z 2025-05-07T20:11:15.4718598Z 142 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:15.4718969Z 2025-05-07T20:11:15.4719384Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:15.4720386Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:15.4721002Z 2025-05-07T20:11:15.4914503Z GLIBC_2.2.5 2025-05-07T20:11:15.4915178Z GLIBC_2.3 2025-05-07T20:11:15.4915714Z GLIBC_2.14 2025-05-07T20:11:15.4916092Z 2025-05-07T20:11:15.4916106Z 2025-05-07T20:11:15.4917193Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:15.4918362Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:15.4919030Z 2025-05-07T20:11:15.5179208Z GLIBCXX_3.4 2025-05-07T20:11:15.5179655Z GLIBCXX_3.4.9 2025-05-07T20:11:15.5179904Z GLIBCXX_3.4.11 2025-05-07T20:11:15.5180171Z GLIBCXX_3.4.18 2025-05-07T20:11:15.5180518Z GLIBCXX_3.4.20 2025-05-07T20:11:15.5180771Z GLIBCXX_3.4.21 2025-05-07T20:11:15.5181044Z GLIBCXX_3.4.29 2025-05-07T20:11:15.5182257Z 2025-05-07T20:11:15.5182262Z 2025-05-07T20:11:15.5200577Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.EX7msajhFP.symbols.txt 2025-05-07T20:11:15.5201159Z 2025-05-07T20:11:15.5436596Z 2025-05-07T20:11:15.5461267Z [CHECK] Total Number of symbols: 1624 2025-05-07T20:11:15.5481588Z [CHECK] Number of fbgemm symbols: 228 2025-05-07T20:11:15.5497018Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.Cavu28TjfW.usymbols.txt 2025-05-07T20:11:15.5497827Z 2025-05-07T20:11:15.5521433Z 2025-05-07T20:11:15.5548042Z [CHECK] Listing out undefined symbols (178 total): 2025-05-07T20:11:15.5564152Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.5566670Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.5568280Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:15.5569333Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:15.5570633Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:15.5571072Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:15.5571473Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:15.5571899Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:15.5572273Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:15.5572678Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:15.5573041Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:15.5573390Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:15.5573743Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:15.5574062Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:15.5574415Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:15.5574758Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:15.5575119Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:15.5575657Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:15.5576110Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:15.5576575Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:15.5577032Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:15.5577526Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:15.5578396Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.5579754Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.5580844Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:15.5581505Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:15.5582414Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.5583818Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.5584640Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:15.5585043Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:15.5585397Z U at::globalContext() 2025-05-07T20:11:15.5585785Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.5586279Z U c10::BoolType::get() 2025-05-07T20:11:15.5586654Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:15.5587066Z U c10::FloatType::get() 2025-05-07T20:11:15.5587463Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:15.5587855Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.5588297Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:15.5588644Z U c10::IntType::get() 2025-05-07T20:11:15.5589017Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:15.5589428Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:15.5589814Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:15.5590236Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:15.5590626Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:15.5591271Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:15.5591918Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:15.5592270Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:15.5592610Z U c10::SymIntType::get() 2025-05-07T20:11:15.5592955Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:15.5593374Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:15.5593749Z U c10::TensorType::get() 2025-05-07T20:11:15.5594060Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:15.5594970Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:15.5595905Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:15.5596277Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:15.5596636Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:15.5596984Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:15.5597311Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:15.5597662Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:15.5598136Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:15.5598585Z U c10::cuda::device_count() 2025-05-07T20:11:15.5598943Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:15.5599315Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:15.5599718Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:15.5600091Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:15.5600520Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:15.5600915Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:15.5601612Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:15.5602672Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:15.5603556Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:15.5604496Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:15.5605583Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:15.5606490Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:15.5606838Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:15.5607236Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:15.5607667Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:15.5608104Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:15.5608485Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:15.5608917Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:15.5609317Z U c10::throwNullDataPtrError() 2025-05-07T20:11:15.5609665Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:15.5610034Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:15.5610746Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:15.5611236Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:15.5611617Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:15.5612047Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:15.5612472Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:15.5612868Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:15.5613275Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:15.5613640Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:15.5614021Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:15.5614389Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:15.5614822Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:15.5615237Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:15.5615632Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:15.5616215Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:15.5616575Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:15.5616959Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:15.5617330Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:15.5617728Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:15.5620206Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:11:15.5622735Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:15.5623182Z U float at::Tensor::item() const 2025-05-07T20:11:15.5623583Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.5624025Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.5624437Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.5624857Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.5625296Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:15.5625753Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.5626217Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.5626641Z U memcpy@GLIBC_2.14 2025-05-07T20:11:15.5627080Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:15.5627427Z U memset@GLIBC_2.2.5 2025-05-07T20:11:15.5627795Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:15.5628185Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:15.5629252Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.5630035Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.5630796Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.5631588Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.5632377Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:15.5633274Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:15.5634143Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:15.5634976Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:15.5635609Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:15.5635988Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:15.5636367Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.5636874Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.5637311Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:15.5637787Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:15.5638322Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:15.5639036Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:15.5640109Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.5641322Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.5642073Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:15.5642457Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:15.5642814Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:15.5643183Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:15.5643559Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:15.5643898Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:15.5644334Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.5644880Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.5645391Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:15.5645743Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:15.5646090Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:15.5646612Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:15.5647741Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:15.5649137Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:15.5650005Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:15.5650832Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:15.5651894Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:15.5654038Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.5657032Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.5659951Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.5662848Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.5665727Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.5668576Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.5672059Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.5676161Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.5680381Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.5684006Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.5687837Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.5691950Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.5695719Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:11:15.5697719Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:15.5698178Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:15.5698628Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:15.5699240Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.5700040Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.5700883Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.5701527Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:15.5703556Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:15.5704025Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:15.5704351Z w _ITM_registerTMCloneTable 2025-05-07T20:11:15.5704685Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:15.5704981Z w __gmon_start__ 2025-05-07T20:11:15.5705277Z w __pthread_key_create 2025-05-07T20:11:15.5705700Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:15.5706036Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:15.5706392Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:15.5706921Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:15.5707277Z 2025-05-07T20:11:15.5707436Z linux-vdso.so.1 (0x00007ffdf3fbb000) 2025-05-07T20:11:15.5707743Z libc10.so => not found 2025-05-07T20:11:15.5707987Z libc10_cuda.so => not found 2025-05-07T20:11:15.5708721Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f9b14800000) 2025-05-07T20:11:15.5709478Z libtorch.so => not found 2025-05-07T20:11:15.5709748Z libtorch_cpu.so => not found 2025-05-07T20:11:15.5710015Z libtorch_cuda.so => not found 2025-05-07T20:11:15.5710407Z libcudart.so.12 => not found 2025-05-07T20:11:15.5710706Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f9b1459c000) 2025-05-07T20:11:15.5711108Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f9b4e9d2000) 2025-05-07T20:11:15.5711473Z libc.so.6 => /lib64/libc.so.6 (0x00007f9b14394000) 2025-05-07T20:11:15.5711810Z /lib64/ld-linux-x86-64.so.2 (0x00007f9b57a7d000) 2025-05-07T20:11:15.5712158Z libc10.so => not found 2025-05-07T20:11:15.5712384Z libc10_cuda.so => not found 2025-05-07T20:11:15.5712990Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f9b14000000) 2025-05-07T20:11:15.5714020Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f9b13e50000) 2025-05-07T20:11:15.5714730Z libtorch.so => not found 2025-05-07T20:11:15.5715228Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007f9b13800000) 2025-05-07T20:11:15.5716072Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f9b12600000) 2025-05-07T20:11:15.5716701Z libtorch_cpu.so => not found 2025-05-07T20:11:15.5716948Z libtorch_cuda.so => not found 2025-05-07T20:11:15.5717220Z libcudart.so.12 => not found 2025-05-07T20:11:15.5717499Z libm.so.6 => /lib64/libm.so.6 (0x00007f9b13d75000) 2025-05-07T20:11:15.5717809Z libc10.so => not found 2025-05-07T20:11:15.5718055Z libc10_cuda.so => not found 2025-05-07T20:11:15.5718617Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007f9b57a65000) 2025-05-07T20:11:15.5719230Z libtorch.so => not found 2025-05-07T20:11:15.5719463Z libtorch_cpu.so => not found 2025-05-07T20:11:15.5719718Z libtorch_cuda.so => not found 2025-05-07T20:11:15.5719966Z libcudart.so.12 => not found 2025-05-07T20:11:15.5720264Z libc10.so => not found 2025-05-07T20:11:15.5720492Z libc10_cuda.so => not found 2025-05-07T20:11:15.5720743Z libtorch.so => not found 2025-05-07T20:11:15.5720985Z libtorch_cpu.so => not found 2025-05-07T20:11:15.5721404Z libtorch_cuda.so => not found 2025-05-07T20:11:15.5721660Z libcudart.so.12 => not found 2025-05-07T20:11:15.5721902Z libc10.so => not found 2025-05-07T20:11:15.5722410Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007f9b4e957000) 2025-05-07T20:11:15.5722927Z libtorch.so => not found 2025-05-07T20:11:15.5723415Z libtorch_cpu.so => not found 2025-05-07T20:11:15.5723668Z libtorch_cuda.so => not found 2025-05-07T20:11:15.5723973Z libtorch.so => not found 2025-05-07T20:11:15.5724223Z libc10.so => not found 2025-05-07T20:11:15.5724652Z libc10_cuda.so => not found 2025-05-07T20:11:15.5724987Z libtorch_cpu.so => not found 2025-05-07T20:11:15.5725252Z libtorch_cuda.so => not found 2025-05-07T20:11:15.5725535Z libcudart.so.12 => not found 2025-05-07T20:11:15.5725786Z libc10.so => not found 2025-05-07T20:11:15.5726041Z libtorch_cpu.so => not found 2025-05-07T20:11:15.5726293Z libtorch_cuda.so => not found 2025-05-07T20:11:15.5726567Z libtorch.so => not found 2025-05-07T20:11:15.5726811Z libtorch_cpu.so => not found 2025-05-07T20:11:15.5727083Z libtorch_cuda.so => not found 2025-05-07T20:11:15.5727335Z libtorch.so => not found 2025-05-07T20:11:15.5727514Z 2025-05-07T20:11:15.5727620Z [CHECK] Displaying ELF information: 2025-05-07T20:11:15.5728119Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:15.5728689Z 2025-05-07T20:11:15.5728694Z 2025-05-07T20:11:15.5728858Z Dynamic section at offset 0x8dbfdd8 contains 39 entries: 2025-05-07T20:11:15.5729338Z Tag Type Name/Value 2025-05-07T20:11:15.5729813Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:15.5730416Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:15.5730995Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:15.5731551Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:15.5732076Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:15.5732586Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:15.5733198Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:15.5733711Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:15.5734232Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:15.5734755Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:15.5735268Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:15.5735880Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_gwd.so] 2025-05-07T20:11:15.5736438Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:15.5736857Z 0x000000000000000c (INIT) 0xbf000 2025-05-07T20:11:15.5737180Z 0x000000000000000d (FINI) 0x62dd0c 2025-05-07T20:11:15.5737531Z 0x0000000000000019 (INIT_ARRAY) 0x8dbf998 2025-05-07T20:11:15.5737895Z 0x000000000000001b (INIT_ARRAYSZ) 200 (bytes) 2025-05-07T20:11:15.5738244Z 0x000000000000001a (FINI_ARRAY) 0x8dbfa60 2025-05-07T20:11:15.5738608Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:15.5738929Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:15.5739263Z 0x000000006ffffef5 (GNU_HASH) 0x2b38 2025-05-07T20:11:15.5739581Z 0x0000000000000005 (STRTAB) 0xee00 2025-05-07T20:11:15.5739906Z 0x0000000000000006 (SYMTAB) 0x55a8 2025-05-07T20:11:15.5740247Z 0x000000000000000a (STRSZ) 594745 (bytes) 2025-05-07T20:11:15.5740619Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:15.5740989Z 0x0000000000000003 (PLTGOT) 0x8dc0088 2025-05-07T20:11:15.5741340Z 0x0000000000000002 (PLTRELSZ) 11400 (bytes) 2025-05-07T20:11:15.5741707Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:15.5742024Z 0x0000000000000017 (JMPREL) 0xbba08 2025-05-07T20:11:15.5742408Z 0x0000000000000007 (RELA) 0xa0f30 2025-05-07T20:11:15.5742879Z 0x0000000000000008 (RELASZ) 109272 (bytes) 2025-05-07T20:11:15.5743284Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:15.5743646Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:15.5743983Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:15.5744337Z 0x000000006ffffffe (VERNEED) 0xa0df0 2025-05-07T20:11:15.5744660Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:15.5744982Z 0x000000006ffffff0 (VERSYM) 0xa013a 2025-05-07T20:11:15.5745304Z 0x000000006ffffff9 (RELACOUNT) 3126 2025-05-07T20:11:15.5745631Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:15.5745827Z 2025-05-07T20:11:15.5745936Z ################################################################################ 2025-05-07T20:11:15.5746173Z 2025-05-07T20:11:15.5746177Z 2025-05-07T20:11:15.5746293Z ################################################################################ 2025-05-07T20:11:15.5746847Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:15.5747373Z [CHECK] Listing out library size: 2025-05-07T20:11:15.5747882Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:15.5748298Z 2025-05-07T20:11:15.5748538Z 59 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:15.5748906Z 2025-05-07T20:11:15.5749349Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:15.5750448Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:15.5751083Z 2025-05-07T20:11:15.5813123Z GLIBC_2.2.5 2025-05-07T20:11:15.5813823Z GLIBC_2.3 2025-05-07T20:11:15.5814390Z GLIBC_2.14 2025-05-07T20:11:15.5814732Z 2025-05-07T20:11:15.5814756Z 2025-05-07T20:11:15.5816164Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:15.5818997Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:15.5819626Z 2025-05-07T20:11:15.5965598Z GLIBCXX_3.4 2025-05-07T20:11:15.5966257Z GLIBCXX_3.4.9 2025-05-07T20:11:15.5966971Z GLIBCXX_3.4.11 2025-05-07T20:11:15.5967189Z GLIBCXX_3.4.15 2025-05-07T20:11:15.5967430Z GLIBCXX_3.4.18 2025-05-07T20:11:15.5967648Z GLIBCXX_3.4.20 2025-05-07T20:11:15.5967887Z GLIBCXX_3.4.21 2025-05-07T20:11:15.5968221Z GLIBCXX_3.4.29 2025-05-07T20:11:15.5968371Z 2025-05-07T20:11:15.5968376Z 2025-05-07T20:11:15.5988291Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.4GoopQZBCK.symbols.txt 2025-05-07T20:11:15.5989950Z 2025-05-07T20:11:15.6099459Z 2025-05-07T20:11:15.6127157Z [CHECK] Total Number of symbols: 1791 2025-05-07T20:11:15.6148967Z [CHECK] Number of fbgemm symbols: 94 2025-05-07T20:11:15.6166414Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.x33QpNrVO7.usymbols.txt 2025-05-07T20:11:15.6167026Z 2025-05-07T20:11:15.6189328Z 2025-05-07T20:11:15.6213980Z [CHECK] Listing out undefined symbols (266 total): 2025-05-07T20:11:15.6231784Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.6234360Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.6235965Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:15.6236989Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:15.6238507Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:15.6239177Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:15.6239730Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:15.6240150Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:15.6240525Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:15.6240938Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:15.6241327Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:15.6241705Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:15.6242028Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:15.6242382Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:15.6242717Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:15.6243081Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:15.6243441Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:15.6243778Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:15.6244134Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:15.6244465Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:15.6244931Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:15.6245246Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:15.6245595Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:15.6246021Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:15.6246401Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:15.6246840Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:15.6247238Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:15.6247598Z U at::RecordFunction::end() 2025-05-07T20:11:15.6247925Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:15.6248375Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:15.6248834Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:15.6249277Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:15.6250106Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.6251747Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.6252687Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:15.6253489Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.6254689Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.6255529Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:15.6255941Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:15.6256352Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:15.6256764Z U at::globalContext() 2025-05-07T20:11:15.6257225Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:15.6257553Z U c10::AnyType::get() 2025-05-07T20:11:15.6257966Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.6258378Z U c10::BoolType::get() 2025-05-07T20:11:15.6258803Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:15.6259277Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:15.6259740Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:15.6260473Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:15.6261644Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:15.6262721Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:15.6263305Z U c10::Error::what() const 2025-05-07T20:11:15.6263612Z U c10::FloatType::get() 2025-05-07T20:11:15.6263946Z U c10::GradMode::is_enabled() 2025-05-07T20:11:15.6264281Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:15.6264693Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.6265153Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:15.6265720Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:15.6266086Z U c10::IValue::isBoolList() const 2025-05-07T20:11:15.6266425Z U c10::IValue::isIntList() const 2025-05-07T20:11:15.6266790Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:15.6267135Z U c10::IValue::isTensorList() const 2025-05-07T20:11:15.6267537Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:15.6267922Z U c10::IntType::get() 2025-05-07T20:11:15.6268299Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:15.6268766Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:15.6269140Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:15.6269540Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:15.6270005Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:15.6270650Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:15.6271335Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:15.6271706Z U c10::StringType::get() 2025-05-07T20:11:15.6272075Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:15.6272465Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:15.6272905Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:11:15.6273350Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:15.6273745Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:11:15.6274408Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:15.6275029Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:15.6275411Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:11:15.6275798Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:15.6276146Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:15.6276517Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:15.6276877Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:15.6277255Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:15.6277648Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:15.6278196Z U c10::SymIntType::get() 2025-05-07T20:11:15.6278620Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:15.6279045Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:15.6279458Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:15.6279880Z U c10::TensorType::get() 2025-05-07T20:11:15.6280424Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:15.6281414Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:15.6282390Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:15.6282812Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:15.6283188Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:15.6283570Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:15.6283949Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:15.6284308Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:15.6284819Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:15.6285307Z U c10::cuda::device_count() 2025-05-07T20:11:15.6285682Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:15.6286114Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:15.6286529Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:15.6286961Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:15.6287385Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:15.6287919Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:15.6288850Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:15.6289864Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:15.6291230Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:15.6292157Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:15.6293147Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:15.6294229Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:15.6295059Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:15.6295442Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:15.6296023Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:15.6296661Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:15.6297262Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:15.6297694Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:15.6298241Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:15.6298598Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:15.6298951Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:15.6299585Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:15.6300215Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:15.6300604Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:15.6301011Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:15.6301400Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:15.6301832Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:11:15.6302223Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:15.6302602Z U c10::throwNullDataPtrError() 2025-05-07T20:11:15.6302948Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:15.6303266Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:15.6303699Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:15.6304121Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:15.6304496Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:15.6304860Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:15.6305244Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:15.6305621Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:15.6305966Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:15.6306334Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:15.6306671Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:15.6307049Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:15.6307410Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:15.6307811Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:15.6308210Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:15.6308559Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:15.6308895Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:15.6309215Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:15.6309560Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:15.6309902Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:15.6312165Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:11:15.6314536Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:15.6315036Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.6315417Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.6315956Z U free@GLIBC_2.2.5 2025-05-07T20:11:15.6316256Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.6316634Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.6317041Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:15.6317460Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.6318033Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.6318380Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:15.6318726Z U memcpy@GLIBC_2.14 2025-05-07T20:11:15.6319092Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:15.6319389Z U memset@GLIBC_2.2.5 2025-05-07T20:11:15.6319761Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:15.6320180Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:15.6320767Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.6321538Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.6322094Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:15.6322505Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:15.6323193Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:15.6324054Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:15.6324898Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:15.6325679Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:15.6326509Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:15.6327355Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:15.6327960Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:15.6328318Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:15.6328876Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.6329278Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.6329689Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:15.6330116Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:15.6330574Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:15.6331064Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:15.6331762Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:15.6332783Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.6333966Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.6334715Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:15.6335076Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:15.6335445Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:15.6335801Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:15.6336163Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:15.6336529Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:15.6336870Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:15.6337284Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.6337899Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.6338408Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:15.6338923Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:15.6339346Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:15.6340045Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:15.6340715Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:15.6341096Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:15.6341435Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:15.6341719Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:15.6342044Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:15.6342855Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:15.6344181Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:15.6345211Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:15.6345656Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:15.6346150Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:15.6346688Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:15.6347162Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:15.6347636Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:15.6348268Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:15.6348843Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:15.6349256Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:15.6349715Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:15.6350107Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:15.6350422Z U torch::autograd::Node::metadata() 2025-05-07T20:11:15.6350757Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:15.6351205Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:15.6352069Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:15.6352584Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:15.6353201Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:15.6353755Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:15.6356781Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:15.6359757Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:15.6360165Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:15.6360592Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:15.6361667Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:11:15.6362707Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:15.6363500Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:15.6364366Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:15.6365436Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:15.6366146Z U typeinfo for c10::Error 2025-05-07T20:11:15.6366472Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:15.6366813Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:15.6367158Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:15.6367495Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:15.6367839Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:15.6369376Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.6372551Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.6375435Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.6378280Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.6381108Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.6384003Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.6385522Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:15.6385901Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:15.6386501Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:15.6386934Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:15.6387348Z U vtable for c10::Error 2025-05-07T20:11:15.6387901Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.6388686Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.6389446Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.6390024Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:15.6390455Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:15.6390994Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:15.6391457Z U vtable for torch::autograd::Node 2025-05-07T20:11:15.6391849Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:15.6392265Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:15.6392614Z w _ITM_registerTMCloneTable 2025-05-07T20:11:15.6392946Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:15.6393244Z w __gmon_start__ 2025-05-07T20:11:15.6393540Z w __pthread_key_create 2025-05-07T20:11:15.6393844Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:15.6394182Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:15.6394560Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:15.6395067Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:15.6395449Z 2025-05-07T20:11:15.6395593Z linux-vdso.so.1 (0x00007ffe2f9aa000) 2025-05-07T20:11:15.6395896Z libc10.so => not found 2025-05-07T20:11:15.6396142Z libc10_cuda.so => not found 2025-05-07T20:11:15.6396701Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007fa266400000) 2025-05-07T20:11:15.6396813Z libtorch.so => not found 2025-05-07T20:11:15.6396913Z libtorch_cpu.so => not found 2025-05-07T20:11:15.6397011Z libtorch_cuda.so => not found 2025-05-07T20:11:15.6397134Z libcudart.so.12 => not found 2025-05-07T20:11:15.6397300Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fa26619c000) 2025-05-07T20:11:15.6397445Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fa2a4324000) 2025-05-07T20:11:15.6397588Z libc.so.6 => /lib64/libc.so.6 (0x00007fa265f94000) 2025-05-07T20:11:15.6397712Z /lib64/ld-linux-x86-64.so.2 (0x00007fa2a4358000) 2025-05-07T20:11:15.6397795Z libc10.so => not found 2025-05-07T20:11:15.6397886Z libc10_cuda.so => not found 2025-05-07T20:11:15.6398465Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fa265c00000) 2025-05-07T20:11:15.6398970Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fa265a50000) 2025-05-07T20:11:15.6399059Z libtorch.so => not found 2025-05-07T20:11:15.6399428Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007fa265400000) 2025-05-07T20:11:15.6399908Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fa264200000) 2025-05-07T20:11:15.6400003Z libtorch_cpu.so => not found 2025-05-07T20:11:15.6400108Z libtorch_cuda.so => not found 2025-05-07T20:11:15.6400197Z libcudart.so.12 => not found 2025-05-07T20:11:15.6400313Z libm.so.6 => /lib64/libm.so.6 (0x00007fa2a4245000) 2025-05-07T20:11:15.6400410Z libc10.so => not found 2025-05-07T20:11:15.6400493Z libc10_cuda.so => not found 2025-05-07T20:11:15.6400899Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007fa2a4237000) 2025-05-07T20:11:15.6400988Z libtorch.so => not found 2025-05-07T20:11:15.6401091Z libtorch_cpu.so => not found 2025-05-07T20:11:15.6401187Z libtorch_cuda.so => not found 2025-05-07T20:11:15.6401279Z libcudart.so.12 => not found 2025-05-07T20:11:15.6401376Z libc10.so => not found 2025-05-07T20:11:15.6401461Z libc10_cuda.so => not found 2025-05-07T20:11:15.6401547Z libtorch.so => not found 2025-05-07T20:11:15.6401636Z libtorch_cpu.so => not found 2025-05-07T20:11:15.6401736Z libtorch_cuda.so => not found 2025-05-07T20:11:15.6401828Z libcudart.so.12 => not found 2025-05-07T20:11:15.6401922Z libc10.so => not found 2025-05-07T20:11:15.6402279Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007fa2a0585000) 2025-05-07T20:11:15.6402372Z libtorch.so => not found 2025-05-07T20:11:15.6402478Z libtorch_cpu.so => not found 2025-05-07T20:11:15.6402568Z libtorch_cuda.so => not found 2025-05-07T20:11:15.6402681Z libtorch.so => not found 2025-05-07T20:11:15.6402768Z libc10.so => not found 2025-05-07T20:11:15.6402857Z libc10_cuda.so => not found 2025-05-07T20:11:15.6403000Z libtorch_cpu.so => not found 2025-05-07T20:11:15.6403098Z libtorch_cuda.so => not found 2025-05-07T20:11:15.6403194Z libcudart.so.12 => not found 2025-05-07T20:11:15.6403286Z libc10.so => not found 2025-05-07T20:11:15.6403409Z libtorch_cpu.so => not found 2025-05-07T20:11:15.6403509Z libtorch_cuda.so => not found 2025-05-07T20:11:15.6403604Z libtorch.so => not found 2025-05-07T20:11:15.6403720Z libtorch_cpu.so => not found 2025-05-07T20:11:15.6403815Z libtorch_cuda.so => not found 2025-05-07T20:11:15.6403911Z libtorch.so => not found 2025-05-07T20:11:15.6403915Z 2025-05-07T20:11:15.6404048Z [CHECK] Displaying ELF information: 2025-05-07T20:11:15.6404330Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:15.6404336Z 2025-05-07T20:11:15.6404340Z 2025-05-07T20:11:15.6404497Z Dynamic section at offset 0x3a22e50 contains 39 entries: 2025-05-07T20:11:15.6404625Z Tag Type Name/Value 2025-05-07T20:11:15.6404808Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:15.6404999Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:15.6405246Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:15.6405458Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:15.6405653Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:15.6405849Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:15.6406060Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:15.6406249Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:15.6406438Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:15.6406645Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:15.6406879Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:15.6407143Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_dense.so] 2025-05-07T20:11:15.6407397Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:15.6407515Z 0x000000000000000c (INIT) 0x7a000 2025-05-07T20:11:15.6407630Z 0x000000000000000d (FINI) 0x26a70c 2025-05-07T20:11:15.6407747Z 0x0000000000000019 (INIT_ARRAY) 0x3a23350 2025-05-07T20:11:15.6407892Z 0x000000000000001b (INIT_ARRAYSZ) 184 (bytes) 2025-05-07T20:11:15.6408004Z 0x000000000000001a (FINI_ARRAY) 0x3a23408 2025-05-07T20:11:15.6408122Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:15.6408252Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:15.6408368Z 0x000000006ffffef5 (GNU_HASH) 0x2e00 2025-05-07T20:11:15.6408477Z 0x0000000000000005 (STRTAB) 0x101c8 2025-05-07T20:11:15.6408590Z 0x0000000000000006 (SYMTAB) 0x59c8 2025-05-07T20:11:15.6408738Z 0x000000000000000a (STRSZ) 353759 (bytes) 2025-05-07T20:11:15.6408856Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:15.6408976Z 0x0000000000000003 (PLTGOT) 0x3a24100 2025-05-07T20:11:15.6409131Z 0x0000000000000002 (PLTRELSZ) 13056 (bytes) 2025-05-07T20:11:15.6409240Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:15.6409351Z 0x0000000000000017 (JMPREL) 0x75e68 2025-05-07T20:11:15.6409481Z 0x0000000000000007 (RELA) 0x67708 2025-05-07T20:11:15.6409602Z 0x0000000000000008 (RELASZ) 59232 (bytes) 2025-05-07T20:11:15.6409712Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:15.6409812Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:15.6409947Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:15.6410086Z 0x000000006ffffffe (VERNEED) 0x675a8 2025-05-07T20:11:15.6410310Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:15.6410454Z 0x000000006ffffff0 (VERSYM) 0x667a8 2025-05-07T20:11:15.6410745Z 0x000000006ffffff9 (RELACOUNT) 1167 2025-05-07T20:11:15.6410858Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:15.6410863Z 2025-05-07T20:11:15.6411012Z ################################################################################ 2025-05-07T20:11:15.6411017Z 2025-05-07T20:11:15.6411021Z 2025-05-07T20:11:15.6411138Z ################################################################################ 2025-05-07T20:11:15.6411568Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:15.6411703Z [CHECK] Listing out library size: 2025-05-07T20:11:15.6412024Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:15.6412028Z 2025-05-07T20:11:15.6412292Z 329 ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:15.6412297Z 2025-05-07T20:11:15.6412766Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:15.6413306Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:15.6413311Z 2025-05-07T20:11:15.6915034Z GLIBC_2.2.5 2025-05-07T20:11:15.6916059Z GLIBC_2.3 2025-05-07T20:11:15.6916311Z GLIBC_2.14 2025-05-07T20:11:15.6916328Z 2025-05-07T20:11:15.6916341Z 2025-05-07T20:11:15.6917739Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:15.6918775Z + objdump -TC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:15.6918794Z 2025-05-07T20:11:15.7494643Z GLIBCXX_3.4 2025-05-07T20:11:15.7494979Z GLIBCXX_3.4.9 2025-05-07T20:11:15.7495350Z GLIBCXX_3.4.11 2025-05-07T20:11:15.7495577Z GLIBCXX_3.4.18 2025-05-07T20:11:15.7495801Z GLIBCXX_3.4.20 2025-05-07T20:11:15.7496157Z GLIBCXX_3.4.21 2025-05-07T20:11:15.7496388Z GLIBCXX_3.4.29 2025-05-07T20:11:15.7496416Z 2025-05-07T20:11:15.7496429Z 2025-05-07T20:11:15.7518394Z + nm -gDC ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.c70BoTldXe.symbols.txt 2025-05-07T20:11:15.7518436Z 2025-05-07T20:11:15.8065089Z 2025-05-07T20:11:15.8098829Z [CHECK] Total Number of symbols: 3670 2025-05-07T20:11:15.8133114Z [CHECK] Number of fbgemm symbols: 456 2025-05-07T20:11:15.8152637Z + nm -gDCu ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.VPujZXTesy.usymbols.txt 2025-05-07T20:11:15.8152683Z 2025-05-07T20:11:15.8177098Z 2025-05-07T20:11:15.8204604Z [CHECK] Listing out undefined symbols (185 total): 2025-05-07T20:11:15.8222577Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.8222992Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.8223152Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:15.8223312Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:15.8223478Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:15.8223648Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:15.8223801Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:15.8223944Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:15.8224099Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:15.8224247Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:15.8225860Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:15.8225979Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:15.8226128Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:15.8226244Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:15.8240887Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:15.8241278Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:15.8241405Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:15.8241519Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:15.8241664Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:15.8241868Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:15.8242014Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:15.8242211Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:15.8242397Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:15.8242993Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.8243629Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.8243840Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:15.8244154Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:15.8244630Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.8245404Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.8245624Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:15.8245783Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:15.8245893Z U at::globalContext() 2025-05-07T20:11:15.8246113Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.8246245Z U c10::BoolType::get() 2025-05-07T20:11:15.8246414Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:15.8246522Z U c10::FloatType::get() 2025-05-07T20:11:15.8246647Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:15.8246847Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.8246994Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:15.8247104Z U c10::IntType::get() 2025-05-07T20:11:15.8247297Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:15.8247417Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:15.8247576Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:15.8247745Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:15.8247890Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:15.8248070Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:11:15.8248243Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:15.8248647Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:15.8248826Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:15.8248997Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:15.8249116Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:15.8249251Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:15.8249409Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:15.8249536Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:15.8249646Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:15.8249765Z U c10::SymIntType::get() 2025-05-07T20:11:15.8249926Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:15.8250081Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:15.8250320Z U c10::TensorType::get() 2025-05-07T20:11:15.8250478Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:15.8251202Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:15.8251360Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:15.8251485Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:15.8251604Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:15.8251720Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:15.8251862Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:15.8251980Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:15.8252230Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:15.8252390Z U c10::cuda::device_count() 2025-05-07T20:11:15.8252540Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:15.8252742Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:15.8252909Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:15.8253050Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:15.8253206Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:15.8253342Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:15.8253858Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:15.8254115Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:15.8254633Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:15.8254986Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:15.8255580Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:15.8255705Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:15.8255819Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:15.8255991Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:15.8256163Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:15.8256292Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:15.8256450Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:11:15.8256589Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:15.8256736Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:15.8256881Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:15.8257037Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:15.8257153Z U c10::throwNullDataPtrError() 2025-05-07T20:11:15.8257333Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:15.8257447Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:15.8257648Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:15.8257790Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:15.8257926Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:15.8258062Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:15.8258206Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:15.8258358Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:15.8258485Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:15.8258602Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:15.8258751Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:15.8258881Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:15.8259007Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:15.8259182Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:15.8259305Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:15.8259423Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:15.8259544Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:15.8259676Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:15.8259840Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:15.8259962Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:15.8262250Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:11:15.8262460Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:15.8262598Z U float at::Tensor::item() const 2025-05-07T20:11:15.8262871Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.8263034Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.8263174Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.8263321Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.8263498Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:15.8263641Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.8263787Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.8263888Z U memcpy@GLIBC_2.14 2025-05-07T20:11:15.8264001Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:15.8264100Z U memset@GLIBC_2.2.5 2025-05-07T20:11:15.8264253Z U operator delete(void*, unsigned long)@CXXABI_1.3.9 2025-05-07T20:11:15.8264412Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:15.8264748Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.8265062Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.8265377Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.8265712Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.8266022Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.8266335Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.8266691Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:15.8267077Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:15.8267436Z U std::__cxx11::basic_stringbuf, std::allocator >::str() const &@GLIBCXX_3.4.29 2025-05-07T20:11:15.8267796Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:15.8267914Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:15.8268049Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:15.8268192Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.8268334Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.8268524Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:15.8268683Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:15.8268998Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:15.8269360Z U std::basic_ios >::init(std::basic_streambuf >*)@GLIBCXX_3.4 2025-05-07T20:11:15.8269913Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.8270503Z U std::basic_ostream >& std::operator<< >(std::basic_ostream >&, char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.8270643Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:15.8270762Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:15.8270878Z U std::ios_base::ios_base()@GLIBCXX_3.4 2025-05-07T20:11:15.8271012Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:15.8271127Z U std::locale::locale()@GLIBCXX_3.4 2025-05-07T20:11:15.8271231Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:15.8271437Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.8271666Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.8271787Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:15.8271925Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:15.8272021Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:15.8272136Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:15.8272711Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:15.8273179Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:15.8273419Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:15.8273776Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:15.8274285Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:15.8276126Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.8277956Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.8280003Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.8281917Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.8283964Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.8285887Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.8287664Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:11:15.8287820Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:15.8287982Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:15.8288168Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:15.8288522Z U vtable for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.8288849Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.8289215Z U vtable for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.8289411Z U vtable for std::basic_ios >@GLIBCXX_3.4 2025-05-07T20:11:15.8289641Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:15.8289762Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:15.8289890Z w _ITM_registerTMCloneTable 2025-05-07T20:11:15.8289996Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:15.8290087Z w __gmon_start__ 2025-05-07T20:11:15.8290303Z w __pthread_key_create 2025-05-07T20:11:15.8290454Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:15.8290567Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:15.8290743Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:15.8291133Z + ldd ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:15.8291142Z 2025-05-07T20:11:15.8291299Z linux-vdso.so.1 (0x00007ffc259f4000) 2025-05-07T20:11:15.8291398Z libc10.so => not found 2025-05-07T20:11:15.8291498Z libc10_cuda.so => not found 2025-05-07T20:11:15.8292079Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007ff869400000) 2025-05-07T20:11:15.8292172Z libtorch.so => not found 2025-05-07T20:11:15.8292282Z libtorch_cpu.so => not found 2025-05-07T20:11:15.8292379Z libtorch_cuda.so => not found 2025-05-07T20:11:15.8292471Z libcudart.so.12 => not found 2025-05-07T20:11:15.8292667Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007ff86919c000) 2025-05-07T20:11:15.8292828Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007ff8b8727000) 2025-05-07T20:11:15.8292963Z libc.so.6 => /lib64/libc.so.6 (0x00007ff868f94000) 2025-05-07T20:11:15.8293096Z /lib64/ld-linux-x86-64.so.2 (0x00007ff8b875b000) 2025-05-07T20:11:15.8293198Z libc10.so => not found 2025-05-07T20:11:15.8293290Z libc10_cuda.so => not found 2025-05-07T20:11:15.8293767Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so (0x00007ff868c00000) 2025-05-07T20:11:15.8294324Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007ff868a50000) 2025-05-07T20:11:15.8294419Z libtorch.so => not found 2025-05-07T20:11:15.8294779Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so (0x00007ff868400000) 2025-05-07T20:11:15.8295284Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007ff867200000) 2025-05-07T20:11:15.8295383Z libtorch_cpu.so => not found 2025-05-07T20:11:15.8295489Z libtorch_cuda.so => not found 2025-05-07T20:11:15.8295605Z libcudart.so.12 => not found 2025-05-07T20:11:15.8295733Z libm.so.6 => /lib64/libm.so.6 (0x00007ff8b8648000) 2025-05-07T20:11:15.8295827Z libc10.so => not found 2025-05-07T20:11:15.8295919Z libc10_cuda.so => not found 2025-05-07T20:11:15.8296372Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so (0x00007ff8b863a000) 2025-05-07T20:11:15.8296465Z libtorch.so => not found 2025-05-07T20:11:15.8296559Z libtorch_cpu.so => not found 2025-05-07T20:11:15.8296669Z libtorch_cuda.so => not found 2025-05-07T20:11:15.8296763Z libcudart.so.12 => not found 2025-05-07T20:11:15.8296853Z libc10.so => not found 2025-05-07T20:11:15.8296948Z libc10_cuda.so => not found 2025-05-07T20:11:15.8297053Z libtorch.so => not found 2025-05-07T20:11:15.8297145Z libtorch_cpu.so => not found 2025-05-07T20:11:15.8297237Z libtorch_cuda.so => not found 2025-05-07T20:11:15.8297350Z libcudart.so.12 => not found 2025-05-07T20:11:15.8297429Z libc10.so => not found 2025-05-07T20:11:15.8297784Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so (0x00007ff8a3585000) 2025-05-07T20:11:15.8297894Z libtorch.so => not found 2025-05-07T20:11:15.8297981Z libtorch_cpu.so => not found 2025-05-07T20:11:15.8298075Z libtorch_cuda.so => not found 2025-05-07T20:11:15.8298169Z libtorch.so => not found 2025-05-07T20:11:15.8298282Z libc10.so => not found 2025-05-07T20:11:15.8298371Z libc10_cuda.so => not found 2025-05-07T20:11:15.8298468Z libtorch_cpu.so => not found 2025-05-07T20:11:15.8298591Z libtorch_cuda.so => not found 2025-05-07T20:11:15.8298873Z libcudart.so.12 => not found 2025-05-07T20:11:15.8298959Z libc10.so => not found 2025-05-07T20:11:15.8299054Z libtorch_cpu.so => not found 2025-05-07T20:11:15.8299214Z libtorch_cuda.so => not found 2025-05-07T20:11:15.8299333Z libtorch.so => not found 2025-05-07T20:11:15.8299428Z libtorch_cpu.so => not found 2025-05-07T20:11:15.8299571Z libtorch_cuda.so => not found 2025-05-07T20:11:15.8299664Z libtorch.so => not found 2025-05-07T20:11:15.8299669Z 2025-05-07T20:11:15.8299777Z [CHECK] Displaying ELF information: 2025-05-07T20:11:15.8300065Z + readelf -d ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:15.8300089Z 2025-05-07T20:11:15.8327941Z 2025-05-07T20:11:15.8328380Z Dynamic section at offset 0x148571f8 contains 39 entries: 2025-05-07T20:11:15.8328730Z Tag Type Name/Value 2025-05-07T20:11:15.8329172Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:15.8329396Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:15.8329682Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:15.8329919Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:15.8330216Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:15.8330460Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:15.8330680Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:15.8330881Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:15.8331087Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:15.8331295Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:15.8331516Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:15.8331795Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_vbe.so] 2025-05-07T20:11:15.8332151Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:15.8332298Z 0x000000000000000c (INIT) 0x1c3000 2025-05-07T20:11:15.8332412Z 0x000000000000000d (FINI) 0xf0879c 2025-05-07T20:11:15.8332546Z 0x0000000000000019 (INIT_ARRAY) 0x14856518 2025-05-07T20:11:15.8332704Z 0x000000000000001b (INIT_ARRAYSZ) 680 (bytes) 2025-05-07T20:11:15.8332828Z 0x000000000000001a (FINI_ARRAY) 0x148567c0 2025-05-07T20:11:15.8332957Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:15.8333102Z 0x0000000000000004 (HASH) 0x238 2025-05-07T20:11:15.8333215Z 0x000000006ffffef5 (GNU_HASH) 0x4b88 2025-05-07T20:11:15.8333327Z 0x0000000000000005 (STRTAB) 0x1fa30 2025-05-07T20:11:15.8333439Z 0x0000000000000006 (SYMTAB) 0xa208 2025-05-07T20:11:15.8333603Z 0x000000000000000a (STRSZ) 1419969 (bytes) 2025-05-07T20:11:15.8333728Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:15.8333852Z 0x0000000000000003 (PLTGOT) 0x148574a8 2025-05-07T20:11:15.8334007Z 0x0000000000000002 (PLTRELSZ) 18120 (bytes) 2025-05-07T20:11:15.8334116Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:15.8334231Z 0x0000000000000017 (JMPREL) 0x1bded8 2025-05-07T20:11:15.8334364Z 0x0000000000000007 (RELA) 0x17c2e0 2025-05-07T20:11:15.8334497Z 0x0000000000000008 (RELASZ) 269304 (bytes) 2025-05-07T20:11:15.8334610Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:15.8334709Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:11:15.8334855Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:11:15.8334968Z 0x000000006ffffffe (VERNEED) 0x17c1a0 2025-05-07T20:11:15.8335075Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:15.8335216Z 0x000000006ffffff0 (VERSYM) 0x17a4f2 2025-05-07T20:11:15.8335331Z 0x000000006ffffff9 (RELACOUNT) 7406 2025-05-07T20:11:15.8335492Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:15.8335570Z 2025-05-07T20:11:15.8335712Z ################################################################################ 2025-05-07T20:11:15.8335760Z 2025-05-07T20:11:15.8335765Z 2025-05-07T20:11:15.8335981Z [CHECK] Verifying sample subset of symbols in the built libraries ... 2025-05-07T20:11:15.8453884Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:15.8478601Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:15.8695193Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:15.8732178Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:15.8765197Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:15.8813838Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:15.8847738Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:15.8876031Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:15.8987064Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/asmjit.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.9012833Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.9230966Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.9262967Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.9295067Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.9343635Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.9377158Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.9407362Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.9801238Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:16.0154827Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_inference.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:16.0339241Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:16.1257289Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_training_forward.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:16.1337597Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_tbe_index_select.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:16.1369371Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_embedding_inplace_ops.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:16.1675700Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.13/cmake-build/fbgemm_gpu_py.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:16.1677835Z ################################################################################ 2025-05-07T20:11:16.1679362Z [BUILD] Wheel Audit: dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:11:16.1679771Z 2025-05-07T20:11:16.1680298Z + conda run --no-capture-output -n build_binary auditwheel show dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:11:16.1680886Z 2025-05-07T20:11:27.5960890Z 2025-05-07T20:11:27.5961351Z fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl is 2025-05-07T20:11:27.5961918Z consistent with the following platform tag: "linux_x86_64". 2025-05-07T20:11:27.5962237Z 2025-05-07T20:11:27.5962409Z The wheel references external versioned symbols in these 2025-05-07T20:11:27.5962895Z system-provided shared libraries: libgcc_s.so.1 with versions 2025-05-07T20:11:27.5963361Z {'GCC_3.4', 'GCC_3.0'}, libstdc++.so.6 with versions 2025-05-07T20:11:27.5963791Z {'GLIBCXX_3.4.14', 'CXXABI_1.3', 'GLIBCXX_3.4', 'GLIBCXX_3.4.18', 2025-05-07T20:11:27.5964272Z 'CXXABI_1.3.11', 'CXXABI_1.3.8', 'CXXABI_1.3.5', 'GLIBCXX_3.4.9', 2025-05-07T20:11:27.5964698Z 'GLIBCXX_3.4.20', 'GLIBCXX_3.4.29', 'GLIBCXX_3.4.11', 2025-05-07T20:11:27.5965113Z 'GLIBCXX_3.4.21', 'GLIBCXX_3.4.15', 'CXXABI_1.3.9', 'CXXABI_1.3.3', 2025-05-07T20:11:27.5965526Z 'GLIBCXX_3.4.19', 'CXXABI_1.3.7'}, libc.so.6 with versions 2025-05-07T20:11:27.5965922Z {'GLIBC_2.14', 'GLIBC_2.2.5'}, libm.so.6 with versions 2025-05-07T20:11:27.5966343Z {'GLIBC_2.2.5'}, libcudart.so.12 with versions {'libcudart.so.12'} 2025-05-07T20:11:27.5966637Z 2025-05-07T20:11:27.5966836Z This constrains the platform tag to "manylinux_2_34_x86_64". In order 2025-05-07T20:11:27.5967362Z to achieve a more compatible tag, you would need to recompile a new 2025-05-07T20:11:27.5967942Z wheel from source on a system with earlier versions of these 2025-05-07T20:11:27.5968344Z libraries, such as a recent manylinux image. 2025-05-07T20:11:27.6776202Z 2025-05-07T20:11:27.6776218Z 2025-05-07T20:11:27.6776762Z ################################################################################ 2025-05-07T20:11:27.6777157Z [BUILD] Enumerating the built wheels ... 2025-05-07T20:11:27.6777691Z + ls -lth dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:11:27.6778056Z 2025-05-07T20:11:27.6796621Z -rw-r--r--. 1 root root 511M May 7 20:11 dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:11:27.6797431Z 2025-05-07T20:11:27.6797561Z [BUILD] Enumerating the wheel SHAs ... 2025-05-07T20:11:27.6798067Z + sha1sum dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:11:27.6798435Z 2025-05-07T20:11:28.6429196Z f40dfdb3d81094fd93da19c8d3d81ac3b2035255 dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:11:28.6429762Z 2025-05-07T20:11:28.6430031Z + sha256sum dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:11:28.6430445Z 2025-05-07T20:11:30.8716623Z 6795258f725fca3c1246a2ffa7ffcbba6e1358857b40fdcfe861a866e2aa7911 dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:11:30.8717355Z 2025-05-07T20:11:30.8717612Z + md5sum dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:11:30.8717970Z 2025-05-07T20:11:31.7303036Z dae32a62eef74a301f66ed57f9c80634 dist/fbgemm_gpu_nightly-2025.5.7-cp313-cp313-manylinux_2_28_x86_64.whl 2025-05-07T20:11:31.7304529Z 2025-05-07T20:11:31.7304927Z [BUILD] FBGEMM-GPU build + package completed 2025-05-07T20:11:31.7398117Z ##[group]Run actions/upload-artifact@v4 2025-05-07T20:11:31.7398447Z with: 2025-05-07T20:11:31.7398673Z name: fbgemm_default_x86_gcc_py3.13_cu12.6.3.whl 2025-05-07T20:11:31.7399007Z path: fbgemm_gpu/dist/*.whl 2025-05-07T20:11:31.7399264Z if-no-files-found: error 2025-05-07T20:11:31.7399522Z compression-level: 6 2025-05-07T20:11:31.7399744Z overwrite: false 2025-05-07T20:11:31.7399989Z include-hidden-files: false 2025-05-07T20:11:31.7400225Z env: 2025-05-07T20:11:31.7400443Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T20:11:31.7400841Z BUILD_ENV: build_binary 2025-05-07T20:11:31.7401072Z BUILD_TARGET: default 2025-05-07T20:11:31.7401310Z BUILD_VARIANT: cuda 2025-05-07T20:11:31.7401536Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T20:11:31.7401785Z ##[endgroup] 2025-05-07T20:11:31.7404832Z ##[command]/usr/bin/docker exec 180e7cabfdf5006d0fba1b706a3ca06949f33b672d353cfee7af27b6030ff987 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:32.2004754Z With the provided path, there will be 1 file uploaded 2025-05-07T20:11:32.2005811Z Artifact name is valid! 2025-05-07T20:11:32.2006353Z Root directory input is valid! 2025-05-07T20:11:32.2820882Z Beginning upload of artifact content to blob storage 2025-05-07T20:11:32.9110664Z Uploaded bytes 8388608 2025-05-07T20:11:33.2412290Z Uploaded bytes 16777216 2025-05-07T20:11:33.4758282Z Uploaded bytes 25165824 2025-05-07T20:11:33.8216882Z Uploaded bytes 33554432 2025-05-07T20:11:34.1661511Z Uploaded bytes 41943040 2025-05-07T20:11:34.4162442Z Uploaded bytes 50331648 2025-05-07T20:11:34.7448091Z Uploaded bytes 58720256 2025-05-07T20:11:34.9931852Z Uploaded bytes 67108864 2025-05-07T20:11:35.3562748Z Uploaded bytes 75497472 2025-05-07T20:11:35.6442156Z Uploaded bytes 83886080 2025-05-07T20:11:35.9531272Z Uploaded bytes 92274688 2025-05-07T20:11:36.2880994Z Uploaded bytes 100663296 2025-05-07T20:11:36.6140427Z Uploaded bytes 109051904 2025-05-07T20:11:36.9078221Z Uploaded bytes 117440512 2025-05-07T20:11:37.3011979Z Uploaded bytes 125829120 2025-05-07T20:11:37.5050288Z Uploaded bytes 134217728 2025-05-07T20:11:37.8257144Z Uploaded bytes 142606336 2025-05-07T20:11:38.1977385Z Uploaded bytes 150994944 2025-05-07T20:11:38.4769768Z Uploaded bytes 159383552 2025-05-07T20:11:38.8370188Z Uploaded bytes 167772160 2025-05-07T20:11:39.1079722Z Uploaded bytes 176160768 2025-05-07T20:11:39.5232245Z Uploaded bytes 184549376 2025-05-07T20:11:39.8228098Z Uploaded bytes 192937984 2025-05-07T20:11:40.0629145Z Uploaded bytes 201326592 2025-05-07T20:11:40.3973563Z Uploaded bytes 209715200 2025-05-07T20:11:40.7265919Z Uploaded bytes 218103808 2025-05-07T20:11:41.0369382Z Uploaded bytes 226492416 2025-05-07T20:11:41.2951319Z Uploaded bytes 234881024 2025-05-07T20:11:41.6081873Z Uploaded bytes 243269632 2025-05-07T20:11:41.9162412Z Uploaded bytes 251658240 2025-05-07T20:11:42.2695108Z Uploaded bytes 260046848 2025-05-07T20:11:42.5495390Z Uploaded bytes 268435456 2025-05-07T20:11:42.8793651Z Uploaded bytes 276824064 2025-05-07T20:11:43.0873523Z Uploaded bytes 285212672 2025-05-07T20:11:43.4730227Z Uploaded bytes 293601280 2025-05-07T20:11:43.7790299Z Uploaded bytes 301989888 2025-05-07T20:11:44.0070339Z Uploaded bytes 310378496 2025-05-07T20:11:44.3548719Z Uploaded bytes 318767104 2025-05-07T20:11:44.7414118Z Uploaded bytes 327155712 2025-05-07T20:11:45.0703689Z Uploaded bytes 335544320 2025-05-07T20:11:45.3500585Z Uploaded bytes 343932928 2025-05-07T20:11:45.6875677Z Uploaded bytes 352321536 2025-05-07T20:11:45.9841102Z Uploaded bytes 360710144 2025-05-07T20:11:46.3670430Z Uploaded bytes 369098752 2025-05-07T20:11:46.6174767Z Uploaded bytes 377487360 2025-05-07T20:11:46.9661013Z Uploaded bytes 385875968 2025-05-07T20:11:47.2209270Z Uploaded bytes 394264576 2025-05-07T20:11:47.5431228Z Uploaded bytes 402653184 2025-05-07T20:11:47.8703292Z Uploaded bytes 411041792 2025-05-07T20:11:48.1739829Z Uploaded bytes 419430400 2025-05-07T20:11:48.5048876Z Uploaded bytes 427819008 2025-05-07T20:11:48.8458131Z Uploaded bytes 436207616 2025-05-07T20:11:49.1947397Z Uploaded bytes 444596224 2025-05-07T20:11:49.4660377Z Uploaded bytes 452984832 2025-05-07T20:11:49.7900455Z Uploaded bytes 461373440 2025-05-07T20:11:50.0107614Z Uploaded bytes 469762048 2025-05-07T20:11:50.3737956Z Uploaded bytes 478150656 2025-05-07T20:11:50.6167914Z Uploaded bytes 486539264 2025-05-07T20:11:50.8732251Z Uploaded bytes 494927872 2025-05-07T20:11:51.1813625Z Uploaded bytes 503316480 2025-05-07T20:11:51.5673243Z Uploaded bytes 511705088 2025-05-07T20:11:51.8361021Z Uploaded bytes 520093696 2025-05-07T20:11:51.9790984Z Uploaded bytes 524550168 2025-05-07T20:11:52.0015967Z Finished uploading artifact content to blob storage! 2025-05-07T20:11:52.0018077Z SHA256 digest of uploaded artifact zip is 384f735fd6a39aaa85616e86f63c067760e5f487da9671a456a5b8ca97038449 2025-05-07T20:11:52.0018923Z Finalizing artifact upload 2025-05-07T20:11:52.0805298Z Artifact fbgemm_default_x86_gcc_py3.13_cu12.6.3.whl.zip successfully finalized. Artifact ID 3081459692 2025-05-07T20:11:52.0808024Z Artifact fbgemm_default_x86_gcc_py3.13_cu12.6.3.whl has been successfully uploaded! Final size is 524550168 bytes. Artifact ID is 3081459692 2025-05-07T20:11:52.0814577Z Artifact download URL: https://github.com/pytorch/FBGEMM/actions/runs/14891846252/artifacts/3081459692 2025-05-07T20:11:52.1062641Z Post job cleanup. 2025-05-07T20:11:52.1067928Z ##[command]/usr/bin/docker exec 180e7cabfdf5006d0fba1b706a3ca06949f33b672d353cfee7af27b6030ff987 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:52.4717501Z [command]/usr/bin/git version 2025-05-07T20:11:52.4755097Z git version 2.47.1 2025-05-07T20:11:52.4784570Z Copying '/github/home/.gitconfig' to '/__w/_temp/97276e15-2589-46ae-a3b9-307cb847dc1a/.gitconfig' 2025-05-07T20:11:52.4792411Z Temporarily overriding HOME='/__w/_temp/97276e15-2589-46ae-a3b9-307cb847dc1a' before making global git config changes 2025-05-07T20:11:52.4793239Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T20:11:52.4797868Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T20:11:52.4829307Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T20:11:52.4854879Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T20:11:52.5146769Z Entering 'external/asmjit' 2025-05-07T20:11:52.5212616Z Entering 'external/composable_kernel' 2025-05-07T20:11:52.5277838Z Entering 'external/cpuinfo' 2025-05-07T20:11:52.5330610Z Entering 'external/cutlass' 2025-05-07T20:11:52.5405515Z Entering 'external/googletest' 2025-05-07T20:11:52.5470641Z Entering 'external/hipify_torch' 2025-05-07T20:11:52.5532303Z Entering 'external/json' 2025-05-07T20:11:52.5597876Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T20:11:52.5613421Z http.https://github.com/.extraheader 2025-05-07T20:11:52.5620535Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-05-07T20:11:52.5647272Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T20:11:52.5916571Z Entering 'external/asmjit' 2025-05-07T20:11:52.5950201Z http.https://github.com/.extraheader 2025-05-07T20:11:52.5991598Z Entering 'external/composable_kernel' 2025-05-07T20:11:52.6021744Z http.https://github.com/.extraheader 2025-05-07T20:11:52.6056273Z Entering 'external/cpuinfo' 2025-05-07T20:11:52.6099975Z http.https://github.com/.extraheader 2025-05-07T20:11:52.6135442Z Entering 'external/cutlass' 2025-05-07T20:11:52.6176223Z http.https://github.com/.extraheader 2025-05-07T20:11:52.6230158Z Entering 'external/googletest' 2025-05-07T20:11:52.6259359Z http.https://github.com/.extraheader 2025-05-07T20:11:52.6292886Z Entering 'external/hipify_torch' 2025-05-07T20:11:52.6340029Z http.https://github.com/.extraheader 2025-05-07T20:11:52.6375615Z Entering 'external/json' 2025-05-07T20:11:52.6416055Z http.https://github.com/.extraheader 2025-05-07T20:11:52.6619777Z Stop and remove container: c54895c4960e410caa647b9f8dfd47b1_amazonlinux2023_5ed145 2025-05-07T20:11:52.6625726Z ##[command]/usr/bin/docker rm --force 180e7cabfdf5006d0fba1b706a3ca06949f33b672d353cfee7af27b6030ff987 2025-05-07T20:11:54.4236901Z 180e7cabfdf5006d0fba1b706a3ca06949f33b672d353cfee7af27b6030ff987 2025-05-07T20:11:54.4273182Z Remove container network: github_network_a087cb1554ec44a393d5ce1ddb30c3db 2025-05-07T20:11:54.4277571Z ##[command]/usr/bin/docker network rm github_network_a087cb1554ec44a393d5ce1ddb30c3db 2025-05-07T20:11:55.2667680Z github_network_a087cb1554ec44a393d5ce1ddb30c3db 2025-05-07T20:11:55.2704508Z A job completed hook has been configured by the self-hosted runner administrator 2025-05-07T20:11:55.2725425Z ##[group]Run '/home/ec2-user/runner-scripts/after_job.sh' 2025-05-07T20:11:55.2732361Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T20:11:55.2732782Z ##[endgroup] 2025-05-07T20:11:55.2838710Z [!ALERT!] Swap in detected! [!ALERT!] 2025-05-07T20:12:05.4312310Z [!ALERT!] Swap out detected [!ALERT!] 2025-05-07T20:12:21.5019852Z Cleaning up orphan processes